PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators

Anish Agarwal, Abdullah Alomar, Varkey Alumootil, Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang. PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 18564-18576, 2021. [doi]

Authors

Anish Agarwal

This author has not been identified. Look up 'Anish Agarwal' in Google

Abdullah Alomar

This author has not been identified. Look up 'Abdullah Alomar' in Google

Varkey Alumootil

This author has not been identified. Look up 'Varkey Alumootil' in Google

Devavrat Shah

This author has not been identified. Look up 'Devavrat Shah' in Google

Dennis Shen

This author has not been identified. Look up 'Dennis Shen' in Google

Zhi Xu

This author has not been identified. Look up 'Zhi Xu' in Google

Cindy Yang

This author has not been identified. Look up 'Cindy Yang' in Google