A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Dong Ki Kim, Miao Liu 0001, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 5541-5550, PMLR, 2021. [doi]

This author has not been identified. Look up 'Dong Ki Kim' in GoogleThis author has not been identified. Look up 'Miao Liu 0001' in GoogleThis author has not been identified. Look up 'Matthew Riemer' in GoogleThis author has not been identified. Look up 'Chuangchuang Sun' in GoogleThis author has not been identified. Look up 'Marwa Abdulhai' in GoogleThis author has not been identified. Look up 'Golnaz Habibi' in GoogleThis author has not been identified. Look up 'Sebastian Lopez-Cot' in GoogleThis author has not been identified. Look up 'Gerald Tesauro' in GoogleThis author has not been identified. Look up 'Jonathan P. How' in Google

runs on WebDSL