Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning

Yangyang Zhao, Kai Yin, Zhenyu Wang 0001, Mehdi Dastani, Shihan Wang 0001. Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning. IEEE Transactions on Audio, Speech & Language Processing, 32:1380-1391, 2024. [doi]

Authors

Yangyang Zhao

This author has not been identified. Look up 'Yangyang Zhao' in Google

Kai Yin

This author has not been identified. Look up 'Kai Yin' in Google

Zhenyu Wang 0001

This author has not been identified. Look up 'Zhenyu Wang 0001' in Google

Mehdi Dastani

This author has not been identified. It may be one of the following persons: Look up 'Mehdi Dastani' in Google

Shihan Wang 0001

This author has not been identified. Look up 'Shihan Wang 0001' in Google