Attention-Based Multimodal Deep Learning on Vision-Language Data: Models, Datasets, Tasks, Evaluation Metrics and Applications

Priyankar Bose, Pratip Rana, Preetam Ghosh. Attention-Based Multimodal Deep Learning on Vision-Language Data: Models, Datasets, Tasks, Evaluation Metrics and Applications. IEEE Access, 11:80624-80646, 2023. [doi]

Authors

Priyankar Bose

This author has not been identified. Look up 'Priyankar Bose' in Google

Pratip Rana

This author has not been identified. Look up 'Pratip Rana' in Google

Preetam Ghosh

This author has not been identified. Look up 'Preetam Ghosh' in Google