Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits

Kaushik Roy, Qi Zhang, Manas Gaur, Amit P. Sheth. Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits. In Nuria Oliver, Fernando Pérez-Cruz, Stefan Kramer, Jesse Read, José Antonio Lozano, editors, Machine Learning and Knowledge Discovery in Databases. Research Track - European Conference, ECML PKDD 2021, Bilbao, Spain, September 13-17, 2021, Proceedings, Part I. Volume 12975 of Lecture Notes in Computer Science, pages 35-50, Springer, 2021. [doi]

Abstract

Abstract is missing.