Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms

Lihong Li, Wei Chu, John Langford, Xuanhui Wang. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms. In Irwin King, Wolfgang Nejdl, Hang Li, editors, Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12, 2011. pages 297-306, ACM, 2011. [doi]

@inproceedings{LiCLW11,
  title = {Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms},
  author = {Lihong Li and Wei Chu and John Langford and Xuanhui Wang},
  year = {2011},
  doi = {10.1145/1935826.1935878},
  url = {http://doi.acm.org/10.1145/1935826.1935878},
  tags = {rule-based, recommendation algorithm},
  researchr = {https://researchr.org/publication/LiCLW11},
  cites = {0},
  citedby = {0},
  pages = {297-306},
  booktitle = {Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12, 2011},
  editor = {Irwin King and Wolfgang Nejdl and Hang Li},
  publisher = {ACM},
  isbn = {978-1-4503-0493-1},
}