RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning

Wei Qiu 0001, Xiao Ma 0006, Bo An 0001, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu. RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Wei Qiu 0001

This author has not been identified. Look up 'Wei Qiu 0001' in Google

Xiao Ma 0006

This author has not been identified. Look up 'Xiao Ma 0006' in Google

Bo An 0001

This author has not been identified. Look up 'Bo An 0001' in Google

Svetlana Obraztsova

This author has not been identified. Look up 'Svetlana Obraztsova' in Google

Shuicheng Yan

This author has not been identified. Look up 'Shuicheng Yan' in Google

Zhongwen Xu

This author has not been identified. Look up 'Zhongwen Xu' in Google