Testing ChatGPT for Stability and Reasoning: A Case Study Using Italian Medical Specialty Tests

Silvia Casola, Tiziano Labruna, Alberto Lavelli, Bernardo Magnini. Testing ChatGPT for Stability and Reasoning: A Case Study Using Italian Medical Specialty Tests. In Federico Boschetti, Gianluca E. Lebani, Bernardo Magnini, Nicole Novielli, editors, Proceedings of the 9th Italian Conference on Computational Linguistics, Venice, Italy, November 30 - December 2, 2023. Volume 3596 of CEUR Workshop Proceedings, CEUR-WS.org, 2023. [doi]

Abstract

Abstract is missing.