Proximal Policy Optimization With Policy Feedback

Yang Gu, Yuhu Cheng, C. L. Philip Chen, Xuesong Wang 0001. Proximal Policy Optimization With Policy Feedback. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 52(7):4600-4610, 2022. [doi]

@article{GuCCW22,
  title = {Proximal Policy Optimization With Policy Feedback},
  author = {Yang Gu and Yuhu Cheng and C. L. Philip Chen and Xuesong Wang 0001},
  year = {2022},
  doi = {10.1109/TSMC.2021.3098451},
  url = {https://doi.org/10.1109/TSMC.2021.3098451},
  researchr = {https://researchr.org/publication/GuCCW22},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Systems, Man, and Cybernetics, Part A},
  volume = {52},
  number = {7},
  pages = {4600-4610},
}