The following publications are possibly variants of this publication:
- Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive HealthcareArpita Biswas, Gaurav Aggarwal, Pradeep Varakantham, Milind Tambe. IJCAI 2021: 4039-4046 [doi]
- Optimistic Whittle Index Policy: Online Learning for Restless BanditsKai Wang 0040, Lily Xu, Aparna Taneja, Milind Tambe. AAAI 2023: 10131-10139 [doi]
- Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child HealthKai Wang, Shresth Verma, Aditya Mate, Sanket Shah, Aparna Taneja, Neha Madhiwalla, Aparna Hegde, Milind Tambe. AAAI 2023: 12138-12146 [doi]
- Restless Multi-Armed Bandits for Maternal and Child Health: Results from Decision-Focused LearningShresth Verma, Aditya Mate, Kai Wang 0040, Neha Madhiwalla, Aparna Hegde, Aparna Taneja, Milind Tambe. atal 2023: 1312-1320 [doi]
- Q-Learning Lagrange Policies for Multi-Action Restless BanditsJackson A. Killian, Arpita Biswas, Sanket Shah, Milind Tambe. kdd 2021: 871-881 [doi]