Aligning AI With Shared Human Values

Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li 0001, Dawn Song, Jacob Steinhardt. Aligning AI With Shared Human Values. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [doi]

Authors

Dan Hendrycks

This author has not been identified. Look up 'Dan Hendrycks' in Google

Collin Burns

This author has not been identified. Look up 'Collin Burns' in Google

Steven Basart

This author has not been identified. Look up 'Steven Basart' in Google

Andrew Critch

This author has not been identified. Look up 'Andrew Critch' in Google

Jerry Li 0001

This author has not been identified. Look up 'Jerry Li 0001' in Google

Dawn Song

This author has not been identified. Look up 'Dawn Song' in Google

Jacob Steinhardt

This author has not been identified. Look up 'Jacob Steinhardt' in Google