The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information

John Langford, Tong Zhang. The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information. In John C. Platt, Daphne Koller, Yoram Singer, Sam T. Roweis, editors, Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007. pages 817-824, MIT Press, 2007. [doi]

@inproceedings{LangfordZ07,
  title = {The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information},
  author = {John Langford and Tong Zhang},
  year = {2007},
  url = {http://books.nips.cc/papers/files/nips20/NIPS2007_0785.pdf},
  researchr = {https://researchr.org/publication/LangfordZ07},
  cites = {0},
  citedby = {0},
  pages = {817-824},
  booktitle = {Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007},
  editor = {John C. Platt and Daphne Koller and Yoram Singer and Sam T. Roweis},
  publisher = {MIT Press},
}