Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Ryuichi Takanobu, Runze Liang, Minlie Huang. Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 625-638, Association for Computational Linguistics, 2020. [doi]

Authors

Ryuichi Takanobu

This author has not been identified. Look up 'Ryuichi Takanobu' in Google

Runze Liang

This author has not been identified. Look up 'Runze Liang' in Google

Minlie Huang

This author has not been identified. Look up 'Minlie Huang' in Google