Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration

Haowen Dou, Lujuan Dang, Zhirong Luan, Badong Chen. Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

@inproceedings{DouDLC24,
  title = {Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration},
  author = {Haowen Dou and Lujuan Dang and Zhirong Luan and Badong Chen},
  year = {2024},
  url = {http://papers.nips.cc/paper_files/paper/2024/hash/8bb7d93ee3ce2c75da68ebeb51508111-Abstract-Conference.html},
  researchr = {https://researchr.org/publication/DouDLC24},
  cites = {0},
  citedby = {0},
  booktitle = {Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024},
  editor = {Amir Globersons and Lester Mackey and Danielle Belgrave and Angela Fan and Ulrich Paquet and Jakub M. Tomczak and Cheng Zhang 0005},
}