PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning

Marcio Eduardo G. Silva, João Viana da Fonseca Neto, Francisco das Chagas de Souza. PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning. In David Al-Dabass, Alessandra Orsoni, Richard Cant, Jasmy Yunus, Zuwairie Ibrahim, Ismail Saad, editors, UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014, Cambridge, United Kingdom, March 26-28, 2014. pages 69-76, IEEE, 2014. [doi]

Abstract

Abstract is missing.