Experience-efficient learning in associative bandit problems

Alexander L. Strehl, Chris Mesterharm, Michael L. Littman, Haym Hirsh. Experience-efficient learning in associative bandit problems. In William W. Cohen, Andrew Moore, editors, Machine Learning, Proceedings of the Twenty-Third International Conference (ICML 2006), Pittsburgh, Pennsylvania, USA, June 25-29, 2006. Volume 148 of ACM International Conference Proceeding Series, pages 889-896, ACM, 2006. [doi]

@inproceedings{StrehlMLH06,
  title = {Experience-efficient learning in associative bandit problems},
  author = {Alexander L. Strehl and Chris Mesterharm and Michael L. Littman and Haym Hirsh},
  year = {2006},
  doi = {10.1145/1143844.1143956},
  url = {http://doi.acm.org/10.1145/1143844.1143956},
  researchr = {https://researchr.org/publication/StrehlMLH06},
  cites = {0},
  citedby = {0},
  pages = {889-896},
  booktitle = {Machine Learning, Proceedings of the Twenty-Third International Conference (ICML 2006), Pittsburgh, Pennsylvania, USA, June 25-29, 2006},
  editor = {William W. Cohen and Andrew Moore},
  volume = {148},
  series = {ACM International Conference Proceeding Series},
  publisher = {ACM},
  isbn = {1-59593-383-2},
}