Q-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control

Dejan V. Djonin, Vikram Krishnamurthy. Q-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control. IEEE Transactions on Signal Processing, 55(5-2):2170-2181, 2007. [doi]

Abstract

Abstract is missing.