LILA: A Unified Benchmark for Mathematical Reasoning

Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan. LILA: A Unified Benchmark for Mathematical Reasoning. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11. pages 5807-5832, Association for Computational Linguistics, 2022. [doi]

Authors

Swaroop Mishra

This author has not been identified. Look up 'Swaroop Mishra' in Google

Matthew Finlayson

This author has not been identified. Look up 'Matthew Finlayson' in Google

Pan Lu

This author has not been identified. Look up 'Pan Lu' in Google

Leonard Tang

This author has not been identified. Look up 'Leonard Tang' in Google

Sean Welleck

This author has not been identified. Look up 'Sean Welleck' in Google

Chitta Baral

This author has not been identified. Look up 'Chitta Baral' in Google

Tanmay Rajpurohit

This author has not been identified. Look up 'Tanmay Rajpurohit' in Google

Oyvind Tafjord

This author has not been identified. Look up 'Oyvind Tafjord' in Google

Ashish Sabharwal

This author has not been identified. Look up 'Ashish Sabharwal' in Google

Peter Clark

This author has not been identified. Look up 'Peter Clark' in Google

Ashwin Kalyan

This author has not been identified. Look up 'Ashwin Kalyan' in Google