Leonid Lyubchyk, Olena Akhiiezer, Nataliia Protsai. Reinforcement Learning Control of Markov Chains Based on Stochastic Gradient Descent. In 15th International Conference on Advanced Computer Information Technologies, ACIT 2025, Sibenik, Croatia, September 17-19, 2025. pages 66-69, IEEE, 2025. [doi]
Abstract is missing.