Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies

Muhammad A. Masood, Finale Doshi-Velez. Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies. In Sarit Kraus, editor, Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019. pages 5923-5929, ijcai.org, 2019. [doi]

@inproceedings{MasoodD19-0,
  title = {Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies},
  author = {Muhammad A. Masood and Finale Doshi-Velez},
  year = {2019},
  doi = {10.24963/ijcai.2019/821},
  url = {https://doi.org/10.24963/ijcai.2019/821},
  researchr = {https://researchr.org/publication/MasoodD19-0},
  cites = {0},
  citedby = {0},
  pages = {5923-5929},
  booktitle = {Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019},
  editor = {Sarit Kraus},
  publisher = {ijcai.org},
}