Modeling Multimodal Uncertainties via Probability Distribution Encoders Included Vision-Language Models

Junjie Wang, Yatai Ji, Yuxiang Zhang, Yanru Zhu, Tetsuya Sakai. Modeling Multimodal Uncertainties via Probability Distribution Encoders Included Vision-Language Models. IEEE Access, 12:420-434, 2024. [doi]

Abstract

Abstract is missing.