BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers

Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao 0001, Jifeng Dai. BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers. In Shai Avidan, Gabriel J. Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner, editors, Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part IX. Volume 13669 of Lecture Notes in Computer Science, pages 1-18, Springer, 2022. [doi]

Authors

Zhiqi Li

This author has not been identified. Look up 'Zhiqi Li' in Google

Wenhai Wang

This author has not been identified. Look up 'Wenhai Wang' in Google

Hongyang Li

This author has not been identified. Look up 'Hongyang Li' in Google

Enze Xie

This author has not been identified. Look up 'Enze Xie' in Google

Chonghao Sima

This author has not been identified. Look up 'Chonghao Sima' in Google

Tong Lu

This author has not been identified. Look up 'Tong Lu' in Google

Yu Qiao 0001

This author has not been identified. Look up 'Yu Qiao 0001' in Google

Jifeng Dai

This author has not been identified. Look up 'Jifeng Dai' in Google