Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes

Aaron Sonabend W., Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee. Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes. Journal of Machine Learning Research, 24, 2023. [doi]

Abstract

Abstract is missing.