A temporal difference method for multi-objective reinforcement learning

Manuela Ruiz-Montiel, Lawrence Mandow, José-Luis Pérez-de-la-Cruz. A temporal difference method for multi-objective reinforcement learning. Neurocomputing, 263:15-25, 2017. [doi]

Bibliographies