Human-in-the-Loop Generative Policy Learning from Demonstrations and Preferences

Eiji Uchibe. Human-in-the-Loop Generative Policy Learning from Demonstrations and Preferences. In Tadahiro Taniguchi, Chi Sing Andrew Leung, Tadashi Kozuno, Junichiro Yoshimoto, Mufti Mahmud, Maryam Doborjeh, Kenji Doya, editors, Neural Information Processing - 32nd International Conference, ICONIP 2025, Okinawa, Japan, November 20-24, 2025, Proceedings, Part V. Volume 16313 of Lecture Notes in Computer Science, pages 317-330, Springer, 2025. [doi]

Abstract

Abstract is missing.