Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits

Piotr Januszewski, Dominik Grzegorzek, Pawel Czarnul. Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits. In Ana Paula Rocha 0001, Luc Steels, H. Jaap van den Herik, editors, Proceedings of the 16th International Conference on Agents and Artificial Intelligence, ICAART 2024, Volume 2, Rome, Italy, February 24-26, 2024. pages 87-98, SCITEPRESS, 2024. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.