Deep sequential collaborative cognition of vision and language based model for video description

Pengjie Tang, Yunlan Tan, Jiewu Xia. Deep sequential collaborative cognition of vision and language based model for video description. Multimedia Tools Appl., 82(23):36207-36230, September 2023. [doi]

Abstract

Abstract is missing.