Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3

Ahmed R. Sadik, Siddhata Govind. Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3. In Muhammad Ali Babar 0001, Ayse Tosun, Stefan Wagner 0001, Viktoria Stray, editors, Proceedings of the 29th International Conference on Evaluation and Assessment in Software Engineering, EASE 2025, Istanbul, Turkey, June 17-20, 2025. pages 969-975, ACM, 2025. [doi]

Abstract

Abstract is missing.