Abstract is missing.
- BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal TransformersZhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao 0001, Jifeng Dai. 1-18 [doi]
- Category-Level 6D Object Pose and Size Estimation Using Self-supervised Deep Prior Deformation NetworksJiehong Lin, Zewei Wei, Changxing Ding, Kui Jia. 19-34 [doi]
- Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object DetectionHongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, Jian Sun 0001. 35-50 [doi]
- Point-to-Box Network for Accurate Object Detection via Single Point SupervisionPengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen Li 0003, Jian Zhao, Humphrey Shi, Zhenjun Han, Qixiang Ye. 51-67 [doi]
- Domain Adaptive Hand Keypoint and Pixel Localization in the WildTakehiko Ohkawa, Yu-Jhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato. 68-87 [doi]
- Towards Data-Efficient Detection TransformersWen Wang, Jing Zhang, Yang Cao 0010, Yongliang Shen 0001, Dacheng Tao. 88-105 [doi]
- Open-Vocabulary DETR with Conditional MatchingYuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy. 106-122 [doi]
- Prediction-Guided Distillation for Dense Object DetectionChenhongyi Yang, Mateusz Ochal, Amos J. Storkey, Elliot J. Crowley. 123-138 [doi]
- Multimodal Object Detection via Probabilistic EnsemblingYi-Ting Chen, Jinghao Shi, Zelin Ye, Christoph Mertz, Deva Ramanan, Shu Kong. 139-158 [doi]
- Exploiting Unlabeled Data with Vision and Language Models for Object DetectionShiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao 0003, B. G. Vijay Kumar, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris N. Metaxas. 159-175 [doi]
- CPO: Change Robust Panorama to Point Cloud LocalizationJunho Kim, Hojun Jang, Changwoon Choi, Young Min Kim 0001. 176-192 [doi]
- INT: Towards Infinite-Frames 3D Detection with an Efficient FrameworkJianyun Xu, Zhenwei Miao, Da Zhang, Hongyu Pan, Kaixuan Liu, Peihan Hao, Jun Zhu, Zhengyang Sun, Hongmin Li, Xin Zhan. 193-209 [doi]
- End-to-End Weakly Supervised Object Detection with Sparse Proposal EvolutionMingxiang Liao, Fang Wan, Yuan Yao, Zhenjun Han, Jialing Zou, Yuze Wang, Bailan Feng, Peng Yuan, Qixiang Ye. 210-226 [doi]
- Calibration-Free Multi-view Crowd CountingQi Zhang 0041, Antoni B. Chan. 227-244 [doi]
- Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-trainingZhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang. 245-262 [doi]
- SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point CloudXiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong Liu 0007. 263-279 [doi]
- Exploring Plain Vision Transformer Backbones for Object DetectionYanghao Li, Hanzi Mao, Ross B. Girshick, Kaiming He. 280-296 [doi]
- Adversarially-Aware Robust Object DetectorZiyi Dong, Pengxu Wei, Liang Lin. 297-313 [doi]
- HEAD: HEtero-Assists Distillation for Heterogeneous Object DetectorsLuting Wang 0001, Xiaojie Li, Yue Liao, Zeren Jiang, Jianlong Wu, Fei Wang 0032, Chen Qian 0006, Si Liu 0001. 314-331 [doi]
- You Should Look at All ObjectsZhenchao Jin, Dongdong Yu, Luchuan Song, Zehuan Yuan, Lequan Yu. 332-349 [doi]
- Detecting Twenty-Thousand Classes Using Image-Level SupervisionXingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra. 350-368 [doi]
- DCL-Net: Deep Correspondence Learning Network for 6D Pose EstimationHongyang Li, Jiehong Lin, Kui Jia. 369-385 [doi]
- Monocular 3D Object Detection with Depth from MotionTai Wang, Jiangmiao Pang, Dahua Lin. 386-403 [doi]
- DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose EstimationYilin Wen, Xiangyu Li, Hao Pan, Lei Yang 0048, Zheng Wang 0002, Taku Komura, Wenping Wang. 404-421 [doi]
- Distilling Object Detectors with Global KnowledgeSanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu 0004, Yunlu Xu, Yi Niu, Fan He. 422-438 [doi]
- Unifying Visual Perception by Dispersible Points LearningJianming Liang, Guanglu Song, Biao Leng, Yu Liu 0015. 439-456 [doi]
- PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object DetectionGang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang. 457-472 [doi]
- Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object DetectionZiteng Cui, Yingying Zhu, Lin Gu 0003, Guo-Jun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada. 473-491 [doi]
- Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural FeaturesWufei Ma, Angtian Wang, Alan L. Yuille, Adam Kortylewski. 492-508 [doi]
- Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle DetectionMaoxun Yuan, Yinyan Wang, Xingxing Wei. 509-525 [doi]
- RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object DetectionChang Xu, Jinwang Wang, Wen Yang 0001, Huai-yu, Lei Yu 0006, Gui-Song Xia. 526-543 [doi]
- Rethinking IoU-based Optimization for Single-stage 3D Object DetectionHualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua 0001, Minjian Zhao 0001, Gim Hee Lee. 544-561 [doi]
- TD-Road: Top-Down Road Network Extraction with Holistic Graph ConstructionYang He, Ravi Garg, Amber Roy-Chowdhury. 562-577 [doi]
- Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object DetectionShuang Wu, Wenjie Pei, Dianwen Mei, Fanglin Chen 0001, Jiandong Tian, Guangming Lu. 578-594 [doi]
- PointCLM: A Contrastive Learning-based Framework for Multi-instance Point Cloud RegistrationMingzhi Yuan, Zhihao Li, Qiuye Jin, Xinrong Chen, Manning Wang. 595-611 [doi]
- Weakly Supervised Object Localization via Transformer with Implicit Spatial CalibrationHaotian Bai, Ruimao Zhang, Jiong Wang, Xiang Wan. 612-628 [doi]
- MTTrans: Cross-domain Object Detection with Mean Teacher TransformerJinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis A. Gudovskiy, Tomoyuki Okuno, Jianxin Li 0002, Kurt Keutzer, Shanghang Zhang. 629-645 [doi]
- Multi-domain Multi-definition Landmark Localization for Small DatasetsDavid Ferman, Gaurav Bharaj. 646-663 [doi]
- DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionAbhinav Kumar 0004, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu 0002. 664-683 [doi]
- Label-Guided Auxiliary Training Improves 3D Object DetectorYaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang 0008. 684-700 [doi]
- PromptDet: Towards Open-Vocabulary Detection Using Uncurated ImagesChengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma 0002. 701-717 [doi]
- Densely Constrained Depth Estimator for Monocular 3D Object DetectionYingyan Li, YunTao Chen, Jiawei He 0002, Zhaoxiang Zhang. 718-734 [doi]
- Polarimetric Pose PredictionDaoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysock, Hyunjun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam. 735-752 [doi]