Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

Wacharawan Intayoad, Chayapol Kamyod, Punnarumol Temdee. Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems. Wireless Personal Communications, 115(4):2917-2932, 2020. [doi]

@article{IntayoadKT20,
  title = {Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems},
  author = {Wacharawan Intayoad and Chayapol Kamyod and Punnarumol Temdee},
  year = {2020},
  doi = {10.1007/s11277-020-07199-0},
  url = {https://doi.org/10.1007/s11277-020-07199-0},
  researchr = {https://researchr.org/publication/IntayoadKT20},
  cites = {0},
  citedby = {0},
  journal = {Wireless Personal Communications},
  volume = {115},
  number = {4},
  pages = {2917-2932},
}