Automated L2 Proficiency Scoring: Weak Supervision, Large Language Models, and Statistical Guarantees

Aitor Arronte Alvarez, Naiyi Xie Fincham. Automated L2 Proficiency Scoring: Weak Supervision, Large Language Models, and Statistical Guarantees. In Ekaterina Kochmar, Bashar Alhafni, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan 0003, editors, Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2025, Vienna, Austria, July 31 - August 1, 2025. pages 384-397, Association for Computational Linguistics, 2025. [doi]

Authors

Aitor Arronte Alvarez

This author has not been identified. Look up 'Aitor Arronte Alvarez' in Google

Naiyi Xie Fincham

This author has not been identified. Look up 'Naiyi Xie Fincham' in Google