Abstract is missing.
- DFNet: Enhance Absolute Pose Regression with Direct Feature MatchingShuai Chen, Xinghui Li, Zirui Wang, Victor Adrian Prisacariu. 1-17  [doi]
- Cornerformer: Purifying Instances for Corner-Based DetectorsHaoran Wei, Xin Chen, Lingxi Xie, Qi Tian 0001. 18-34  [doi]
- PillarNet: Real-Time and High-Performance Pillar-Based 3D Object DetectionGuangsheng Shi, Ruifeng Li, Chao Ma. 35-52  [doi]
- Robust Object Detection with Inaccurate Bounding BoxesChengxin Liu, Kewei Wang, Hao Lu 0003, Zhiguo Cao 0001, Ziming Zhang. 53-69  [doi]
- Efficient Decoder-Free Object Detection with TransformersPeixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen. 70-86  [doi]
- Cross-Modality Knowledge Distillation Network for Monocular 3D Object DetectionYu Hong, Hang Dai, Yong Ding 0003. 87-104  [doi]
- ReAct: Temporal Action Detection with Relational QueriesDingfeng Shi, Yujie Zhong, Qiong Cao, Jing Zhang, Lin Ma 0002, Jia Li 0003, Dacheng Tao. 105-121  [doi]
- Towards Accurate Active Camera LocalizationQihang Fang, Yingda Yin, Qingnan Fan, Fei Xia, Siyan Dong, Sheng Wang, Jue Wang, Leonidas J. Guibas, Baoquan Chen. 122-139  [doi]
- Camera Pose Auto-encoders for Improving Pose RegressionYoli Shavit, Yosi Keller. 140-157  [doi]
- Improving the Intra-class Long-Tail in 3D Detection via Rare Example MiningChiyu Max Jiang, Mahyar Najibi, Charles R. Qi, Yin Zhou, Dragomir Anguelov. 158-175  [doi]
- Bagging Regional Classification Activation Maps for Weakly Supervised Object LocalizationLei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu. 176-192  [doi]
- UC-OWOD: Unknown-Classified Open World Object DetectionZhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu. 193-210  [doi]
- RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced TransformersMichal J. Tyszkiewicz, Kevis-Kokitsi Maninis, Stefan Popov, Vittorio Ferrari. 211-228  [doi]
- GTCaR: Graph Transformer for Camera Re-localizationXinyi Li, Haibin Ling. 229-246  [doi]
- 3D Object Detection with a Self-supervised Lidar Scene Flow BackboneEmeç Erçelik, Ekim Yurtsever, Mingyu Liu, Zhijie Yang, Hanzhen Zhang, Pinar Topçam, Maximilian Listl, Yilmaz Kaan Çayli, Alois C. Knoll. 247-265  [doi]
- Open Vocabulary Object Detection with Pseudo Bounding-Box LabelsMingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li 0001, Ran Xu, Wenhao Liu, Caiming Xiong. 266-282  [doi]
- Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words RepresentationsWenjie Pei, Shuang Wu, Dianwen Mei, Fanglin Chen 0001, Jiandong Tian, Guangming Lu. 283-299  [doi]
- SALISA: Saliency-Based Input Sampling for Efficient Video Object DetectionBabak Ehteshami Bejnordi, AmirHossein Habibian, Fatih Porikli, Amir Ghodrati. 300-316  [doi]
- ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine RefinementDongli Tan, Jiang-Jiang Liu 0001, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji. 317-334  [doi]
- Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint VotingYangzheng Wu, Mohsen Zand, Ali Etemad, Michael A. Greenspan. 335-352  [doi]
- Long-Tailed Instance Segmentation Using Gumbel Optimized LossKonstantinos Panagiotis Alexandridis, Jiankang deng, Anh Nguyen, Shan Luo. 353-369  [doi]
- DetMatch: Two Teachers are Better than One for Joint 2D and 3D Semi-Supervised Object DetectionJinhyung Park, Chenfeng Xu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan. 370-389  [doi]
- ObjectBox: From Centers to Boxes for Anchor-Free Object DetectionMohsen Zand, Ali Etemad, Michael A. Greenspan. 390-406  [doi]
- Is Geometry Enough for Matching in Visual Localization?Qunjie Zhou, Sérgio Agostinho, Aljosa Osep, Laura Leal-Taixé. 407-425  [doi]
- SWFormer: Sparse Window Transformer for 3D Object Detection in Point CloudsPei Sun, Mingxing Tan, Weiyue Wang 0002, Chenxi Liu, Fei Xia, Zhaoqi Leng, Dragomir Anguelov. 426-442  [doi]
- PCR-CG: Point Cloud Registration via Deep Explicit Color and GeometryYu Zhang, Junle Yu, Xiaolin Huang, WenHui Zhou, Ji Hou. 443-459  [doi]
- GLAMD: Global and Local Attention Mask Distillation for Object DetectorsYounHo Jang, Wheemyung Shin, Jinbeom Kim, Simon S. Woo, Sung-Ho Bae. 460-476  [doi]
- FCAF3D: Fully Convolutional Anchor-Free 3D Object DetectionDanila Rukhovich, Anna Vorontsova, Anton Konushin 0002. 477-493  [doi]
- Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw PuzzlesGuodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang 0001. 494-511  [doi]
- Class-Agnostic Object Detection with Multi-modal TransformerMuhammad Maaz, Hanoona Abdul Rasheed, Salman Khan 0001, Fahad Shahbaz Khan, Rao Muhammad Anwer, Ming-Hsuan Yang 0001. 512-531  [doi]
- Enhancing Multi-modal Features Using Local Self-attention for 3D Object DetectionHao Li, Zehan Zhang, Xian Zhao, Yulong Wang, Yuxi Shen, Shiliang Pu, Hui Mao. 532-549  [doi]
- Object Detection as Probabilistic Set PredictionGeorg Hess, Christoffer Petersson, Lennart Svensson. 550-566  [doi]
- Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic ActionsZhi Li, Lu He, Huijuan Xu 0006. 567-584  [doi]
- Neural Correspondence Field for Object Pose EstimationLin Huang 0004, Tomas Hodan, Lingni Ma, Linguang Zhang, Luan Tran, Christopher D. Twigg, Po-Chen Wu, Junsong Yuan, Cem Keskin, Robert Wang 0002. 585-603  [doi]
- On Label Granularity and Object LocalizationElijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge J. Belongie, Andrew G. Howard, Oisin Mac Aodha. 604-620  [doi]
- OIMNet++: Prototypical Normalization and Localization-Aware Learning for Person SearchSanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham. 621-637  [doi]
- Out-of-Distribution Identification: Let Detector Tell Which I Am Not SureRuoqi Li, Chongyang Zhang, Hao Zhou, Chao Shi, Yan Luo. 638-654  [doi]
- Learning with Free Object Segments for Long-Tailed Instance SegmentationCheng Zhang 0014, Tai-Yu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, Wei-Lun Chao. 655-672  [doi]
- Autoregressive Uncertainty Modeling for 3D Bounding Box PredictionYuxuan Liu 0001, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen 0022. 673-694  [doi]
- 3D Random Occlusion and Multi-layer Projection for Deep Multi-camera Pedestrian LocalizationRui Qiu, Ming Xu 0011, Yuyao Yan, Jeremy S. Smith, Xi Yang 0008. 695-710  [doi]
- A Simple Single-Scale Vision Transformer for Object Detection and Instance SegmentationWuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou. 711-727  [doi]
- Simple Open-Vocabulary Object DetectionMatthias Minderer, Alexey A. Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani 0001, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby. 728-755  [doi]