Aaron Sonabend W., Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee. Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes. Journal of Machine Learning Research, 24, 2023. [doi]
Abstract is missing.