Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning

Stefan Elfwing, Eiji Uchibe, Kenji Doya, Henrik I. Christensen. Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning. Adaptive Behaviour, 16(6):400-412, 2008. [doi]

@article{ElfwingUDC08,
  title = {Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning},
  author = {Stefan Elfwing and Eiji Uchibe and Kenji Doya and Henrik I. Christensen},
  year = {2008},
  doi = {10.1177/1059712308092835},
  url = {http://dx.doi.org/10.1177/1059712308092835},
  researchr = {https://researchr.org/publication/ElfwingUDC08},
  cites = {0},
  citedby = {0},
  journal = {Adaptive Behaviour},
  volume = {16},
  number = {6},
  pages = {400-412},
}