Two-phase selective decentralization to improve reinforcement learning systems with MDP

Thanh Nguyen 0005, Snehasis Mukhopadhyay. Two-phase selective decentralization to improve reinforcement learning systems with MDP. AI Commun., 31(4):319-337, 2018. [doi]

Authors

Thanh Nguyen 0005

This author has not been identified. Look up 'Thanh Nguyen 0005' in Google

Snehasis Mukhopadhyay

This author has not been identified. Look up 'Snehasis Mukhopadhyay' in Google