MB-GLOM: An attentive GLOM with multi-head projection and bottleneck residual

Hui Yang, Dazhong Mu, Ru Zeng, Yan Song. MB-GLOM: An attentive GLOM with multi-head projection and bottleneck residual. Neural Networks, 202:109054, 2026. [doi]

Abstract

Abstract is missing.