Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering

Hung-Ting Su, Yulei Niu, Xudong Lin 0003, Winston H. Hsu, Shih-Fu Chang. Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Workshops, Vancouver, BC, Canada, June 17-24, 2023. pages 4951-4960, IEEE, 2023. [doi]

Authors

Hung-Ting Su

This author has not been identified. Look up 'Hung-Ting Su' in Google

Yulei Niu

This author has not been identified. Look up 'Yulei Niu' in Google

Xudong Lin 0003

This author has not been identified. Look up 'Xudong Lin 0003' in Google

Winston H. Hsu

This author has not been identified. Look up 'Winston H. Hsu' in Google

Shih-Fu Chang

This author has not been identified. Look up 'Shih-Fu Chang' in Google