What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions

A. S. M. Iftekhar, Hao Chen, Kaustav Kundu, Xinyu Li, Joseph Tighe, Davide Modolo. What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 5343-5353, IEEE, 2022. [doi]

Abstract

Abstract is missing.