The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

Cassidy Laidlaw, Anca D. Dragan. The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Cassidy Laidlaw

This author has not been identified. Look up 'Cassidy Laidlaw' in Google

Anca D. Dragan

This author has not been identified. Look up 'Anca D. Dragan' in Google