Transform, contrast and tell: Coherent entity-aware multi-image captioning - researchr publication

researchr

You are not signed in
Sign in
Sign up

Jingqiang Chen. Transform, contrast and tell: Coherent entity-aware multi-image captioning. Computer Vision and Image Understanding, 238:103878, January 2024. [doi]

Abstract is missing.

runs on WebDSL