Temperature check: theory and practice for training models with softmax-cross-entropy losses

Atish Agarwala, Samuel Stern Schoenholz, Jeffrey Pennington, Yann N. Dauphin. Temperature check: theory and practice for training models with softmax-cross-entropy losses. Trans. Mach. Learn. Res., 2023, 2023. [doi]

Abstract

Abstract is missing.