Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Songtao Lu, Kaiqing Zhang, Tianyi Chen, Tamer Basar, Lior Horesh. Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. pages 8767-8775, AAAI Press, 2021. [doi]

This author has not been identified. Look up 'Songtao Lu' in GoogleThis author has not been identified. Look up 'Kaiqing Zhang' in GoogleThis author has not been identified. Look up 'Tianyi Chen' in GoogleThis author has not been identified. Look up 'Tamer Basar' in GoogleThis author has not been identified. Look up 'Lior Horesh' in Google

runs on WebDSL