CLadder: A Benchmark to Assess Causal Reasoning Capabilities of Language Models

Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf. CLadder: A Benchmark to Assess Causal Reasoning Capabilities of Language Models. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Zhijing Jin

This author has not been identified. Look up 'Zhijing Jin' in Google

Yuen Chen

This author has not been identified. Look up 'Yuen Chen' in Google

Felix Leeb

This author has not been identified. Look up 'Felix Leeb' in Google

Luigi Gresele

This author has not been identified. Look up 'Luigi Gresele' in Google

Ojasv Kamal

This author has not been identified. Look up 'Ojasv Kamal' in Google

Zhiheng Lyu

This author has not been identified. Look up 'Zhiheng Lyu' in Google

Kevin Blin

This author has not been identified. Look up 'Kevin Blin' in Google

Fernando Gonzalez Adauto

This author has not been identified. Look up 'Fernando Gonzalez Adauto' in Google

Max Kleiman-Weiner

This author has not been identified. Look up 'Max Kleiman-Weiner' in Google

Mrinmaya Sachan

This author has not been identified. Look up 'Mrinmaya Sachan' in Google

Bernhard Schölkopf

This author has not been identified. Look up 'Bernhard Schölkopf' in Google