Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits

Kaushik Roy, Qi Zhang, Manas Gaur, Amit P. Sheth. Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits. In Nuria Oliver, Fernando Pérez-Cruz, Stefan Kramer, Jesse Read, José Antonio Lozano, editors, Machine Learning and Knowledge Discovery in Databases. Research Track - European Conference, ECML PKDD 2021, Bilbao, Spain, September 13-17, 2021, Proceedings, Part I. Volume 12975 of Lecture Notes in Computer Science, pages 35-50, Springer, 2021. [doi]

Authors

Kaushik Roy

This author has not been identified. Look up 'Kaushik Roy' in Google

Qi Zhang

This author has not been identified. Look up 'Qi Zhang' in Google

Manas Gaur

This author has not been identified. Look up 'Manas Gaur' in Google

Amit P. Sheth

This author has not been identified. Look up 'Amit P. Sheth' in Google