A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Dong Ki Kim, Miao Liu 0001, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 5541-5550, PMLR, 2021. [doi]

Authors

Dong Ki Kim

This author has not been identified. Look up 'Dong Ki Kim' in Google

Miao Liu 0001

This author has not been identified. Look up 'Miao Liu 0001' in Google

Matthew Riemer

This author has not been identified. Look up 'Matthew Riemer' in Google

Chuangchuang Sun

This author has not been identified. Look up 'Chuangchuang Sun' in Google

Marwa Abdulhai

This author has not been identified. Look up 'Marwa Abdulhai' in Google

Golnaz Habibi

This author has not been identified. Look up 'Golnaz Habibi' in Google

Sebastian Lopez-Cot

This author has not been identified. Look up 'Sebastian Lopez-Cot' in Google

Gerald Tesauro

This author has not been identified. Look up 'Gerald Tesauro' in Google

Jonathan P. How

This author has not been identified. Look up 'Jonathan P. How' in Google