Axioms for AI Alignment from Human Feedback

Luise Ge, Daniel Halpern 0002, Evi Micha, Ariel D. Procaccia, Itai Shapira, Yevgeniy Vorobeychik, Junlin Wu 0001. Axioms for AI Alignment from Human Feedback. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Luise Ge

This author has not been identified. Look up 'Luise Ge' in Google

Daniel Halpern 0002

This author has not been identified. Look up 'Daniel Halpern 0002' in Google

Evi Micha

This author has not been identified. Look up 'Evi Micha' in Google

Ariel D. Procaccia

This author has not been identified. Look up 'Ariel D. Procaccia' in Google

Itai Shapira

This author has not been identified. Look up 'Itai Shapira' in Google

Yevgeniy Vorobeychik

This author has not been identified. Look up 'Yevgeniy Vorobeychik' in Google

Junlin Wu 0001

This author has not been identified. Look up 'Junlin Wu 0001' in Google