Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning

Songtao Lu, Kaiqing Zhang, Tianyi Chen, Tamer Basar, Lior Horesh. Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. pages 8767-8775, AAAI Press, 2021. [doi]

Authors

Songtao Lu

This author has not been identified. Look up 'Songtao Lu' in Google

Kaiqing Zhang

This author has not been identified. Look up 'Kaiqing Zhang' in Google

Tianyi Chen

This author has not been identified. Look up 'Tianyi Chen' in Google

Tamer Basar

This author has not been identified. Look up 'Tamer Basar' in Google

Lior Horesh

This author has not been identified. Look up 'Lior Horesh' in Google