The following publications are possibly variants of this publication:
- Restless and uncertain: Robust policies for restless bandits via deep multi-agent reinforcement learningJackson A. Killian, Lily Xu, Arpita Biswas, Milind Tambe. uai 2022: 990-1000 [doi]
- Networked Restless Multi-Armed Bandits for Mobile InterventionsHan-Ching Ou, Christoph Siebenbrunner, Jackson A. Killian, Meredith B. Brooks, David Kempe 0001, Yevgeniy Vorobeychik, Milind Tambe. atal 2022: 1001-1009 [doi]
- Towards Zero Shot Learning in Restless Multi-armed BanditsYunfan Zhao, Nikhil Behari, Edward Hughes 0001, Edwin Zhang, Dheeraj Nagaraj, Karl Tuyls, Aparna Taneja, Milind Tambe. atal 2024: 2618-2620 [doi]
- Q-Learning Lagrange Policies for Multi-Action Restless BanditsJackson A. Killian, Arpita Biswas, Sanket Shah, Milind Tambe. kdd 2021: 871-881 [doi]