Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu 0003, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul F. Christiano. Learning to summarize with human feedback. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Nisan Stiennon

This author has not been identified. Look up 'Nisan Stiennon' in Google

Long Ouyang

This author has not been identified. Look up 'Long Ouyang' in Google

Jeffrey Wu 0003

This author has not been identified. Look up 'Jeffrey Wu 0003' in Google

Daniel M. Ziegler

This author has not been identified. Look up 'Daniel M. Ziegler' in Google

Ryan Lowe

This author has not been identified. Look up 'Ryan Lowe' in Google

Chelsea Voss

This author has not been identified. Look up 'Chelsea Voss' in Google

Alec Radford

This author has not been identified. Look up 'Alec Radford' in Google

Dario Amodei

This author has not been identified. Look up 'Dario Amodei' in Google

Paul F. Christiano

This author has not been identified. Look up 'Paul F. Christiano' in Google