Yong Joo Do, Deuksun Hong, Hyungbo Shim. Distributed Q-Learning on Multi-agent Markov Decision Process with Heterogeneous State Transition Probabilities. In 64th IEEE Conference on Decision and Control, CDC 2025, Rio de Janeiro, Brazil, December 9-12, 2025. pages 1492-1499, IEEE, 2025. [doi]
Abstract is missing.