Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

researchr

explore
calendar
search

You are not signed in
Sign in
Sign up

Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal. Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm. J. Artif. Intell. Res. (JAIR), 74:1565-1597, 2022. [doi]

@article{BaiAA22,
  title = {Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm},
  author = {Qinbo Bai and Mridul Agarwal and Vaneet Aggarwal},
  year = {2022},
  doi = {10.1613/jair.1.13981},
  url = {https://doi.org/10.1613/jair.1.13981},
  researchr = {https://researchr.org/publication/BaiAA22},
  cites = {0},
  citedby = {0},
  journal = {J. Artif. Intell. Res. (JAIR)},
  volume = {74},
  pages = {1565-1597},
}

External Links

Cite Key

Statistics

PDF

Researchr

Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm