CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning

Hao Cui, Zahra Shamsi, Gowoon Cheon, Xuejian Ma, Shutong Li, Maria Tikhanovskaya, Peter Christian Norgaard, Nayantara Mudur, Martyna Beata Plomecka, Paul Raccuglia, Yasaman Bahri, Victor V. Albert, Pranesh Srinivasan, Haining Pan, Philippe Faist, Brian Rohr, Michael J. Statt, Dan Morris 0001, Drew Purves, Elise Kleeman, et al.. CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

@inproceedings{CuiSCMLTNMPRBAS25,
  title = {CURIE: Evaluating LLMs on Multitask Scientific Long-Context Understanding and Reasoning},
  author = {Hao Cui and Zahra Shamsi and Gowoon Cheon and Xuejian Ma and Shutong Li and Maria Tikhanovskaya and Peter Christian Norgaard and Nayantara Mudur and Martyna Beata Plomecka and Paul Raccuglia and Yasaman Bahri and Victor V. Albert and Pranesh Srinivasan and Haining Pan and Philippe Faist and Brian Rohr and Michael J. Statt and Dan Morris 0001 and Drew Purves and Elise Kleeman and et al.},
  year = {2025},
  url = {https://openreview.net/forum?id=jw2fC6REUB},
  researchr = {https://researchr.org/publication/CuiSCMLTNMPRBAS25},
  cites = {0},
  citedby = {0},
  booktitle = {The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025},
  publisher = {OpenReview.net},
}