LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs

researchr

You are not signed in
Sign in
Sign up

Arash Gholami Davoodi, Seyed Pouyan Mousavi Davoudi, Pouya Pezeshkpour. LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs. In Luis Chiruzzo, Alan Ritter, Lu Wang, editors, Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 1: Long Papers, Albuquerque, New Mexico, USA, April 29 - May 4, 2025. pages 3127-3140, Association for Computational Linguistics, 2025. [doi]

@inproceedings{DavoodiDP25,
  title = {LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs},
  author = {Arash Gholami Davoodi and Seyed Pouyan Mousavi Davoudi and Pouya Pezeshkpour},
  year = {2025},
  url = {https://aclanthology.org/2025.naacl-long.161/},
  researchr = {https://researchr.org/publication/DavoodiDP25},
  cites = {0},
  citedby = {0},
  pages = {3127-3140},
  booktitle = {Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 1: Long Papers, Albuquerque, New Mexico, USA, April 29 - May 4, 2025},
  editor = {Luis Chiruzzo and Alan Ritter and Lu Wang},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-189-6},
}

External Links

Cite Key

Statistics

PDF

Researchr

LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs