TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning

Weichen Li, Waleed Mustafa, Marcio Monteiro, Puyu Wang, Marius Kloft, Sophie Fellenz. TORA: Train Once, Realign Anytime for Offline Multi-Objective Reinforcement Learning. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 37609-37617, AAAI Press, 2026. [doi]

Authors

Weichen Li

This author has not been identified. Look up 'Weichen Li' in Google

Waleed Mustafa

This author has not been identified. Look up 'Waleed Mustafa' in Google

Marcio Monteiro

This author has not been identified. Look up 'Marcio Monteiro' in Google

Puyu Wang

This author has not been identified. Look up 'Puyu Wang' in Google

Marius Kloft

This author has not been identified. Look up 'Marius Kloft' in Google

Sophie Fellenz

This author has not been identified. Look up 'Sophie Fellenz' in Google