Whittle index based Q-learning for restless bandits with average reward - researchr publication

researchr

You are not signed in
Sign in
Sign up

Konstantin E. Avrachenkov, Vivek S. Borkar. Whittle index based Q-learning for restless bandits with average reward. Automatica, 139:110186, 2022. [doi]

Abstract is missing.

runs on WebDSL