Kaushik Roy, Qi Zhang, Manas Gaur, Amit P. Sheth. Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits. In Nuria Oliver, Fernando Pérez-Cruz, Stefan Kramer, Jesse Read, José Antonio Lozano, editors, Machine Learning and Knowledge Discovery in Databases. Research Track - European Conference, ECML PKDD 2021, Bilbao, Spain, September 13-17, 2021, Proceedings, Part I. Volume 12975 of Lecture Notes in Computer Science, pages 35-50, Springer, 2021. [doi]
Abstract is missing.