Abstract is missing.
- ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCOSanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh. 1-19 [doi]
- MOTCOM: The Multi-Object Tracking Dataset Complexity MetricMalte Pedersen, Joakim Bruslund Haurum, Patrick Dendorfer, Thomas B. Moeslund. 20-37 [doi]
- How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset?Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng 0001. 38-55 [doi]
- A Real World Dataset for Multi-view 3D ReconstructionRakesh Shrestha, Siqi Hu, Minghao Gou, Ziyuan Liu, Ping Tan. 56-73 [doi]
- REALY: Rethinking the Evaluation of 3D Face ReconstructionZenghao Chai, Haoxian Zhang, Jing Ren, Di Kang, Zhengzhuo Xu, Xuefei Zhe, Chun Yuan, Linchao Bao. 74-92 [doi]
- Capturing, Reconstructing, and Simulating: The UrbanScene3D DatasetLiqiang Lin, Yilin Liu, Yue Hu, Xingguang Yan, Ke Xie 0001, Hui Huang 0004. 93-109 [doi]
- 3D CoMPaT: Composition of Materials on Parts of 3D ThingsYuchen Li, Ujjwal Upadhyay, Habib Slim, Ahmed Abdelreheem 0002, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny. 110-127 [doi]
- PartImageNet: A Large, High-Quality Dataset of PartsJu He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jieneng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan L. Yuille. 128-145 [doi]
- A-OKVQA: A Benchmark for Visual Question Answering Using World KnowledgeDustin Schwenk, Apoorv Khandelwal 0001, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi. 146-162 [doi]
- OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural ImagesBingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan L. Yuille, Adam Kortylewski. 163-180 [doi]
- Facial Depth and Normal Estimation Using Single Dual-Pixel CameraMinjun Kang, Jaesung Choe, Hyowon Ha, Hae-Gon Jeon, Sunghoon Im, In-So Kweon, Kuk-Jin Yoon. 181-200 [doi]
- The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video EditingDawit Mureja Argaw, Fabian Caba Heilbron, Joon-Young Lee, Markus Woodson, In-So Kweon. 201-218 [doi]
- StyleBabel: Artistic Style Tagging and CaptioningDan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, Ajinkya Kale, Jo Briggs, Chris Speed, Hailin Jin, Baldo Faieta, Alex Filipkowski, Zhe Lin 0001, John P. Collomosse. 219-236 [doi]
- PANDORA: A Panoramic Detection Dataset for Object with OrientationHang Xu, Qiang Zhao 0005, Yike Ma, Xiaodong Li, Peng Yuan, Bailan Feng, Chenggang Yan 0001, Feng Dai. 237-252 [doi]
- FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in ContextPinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song. 253-270 [doi]
- Exploring Fine-Grained Audiovisual Categorization with the SSW60 DatasetGrant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge J. Belongie. 271-289 [doi]
- The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and CountingJustin Kay, Peter Kulits, Suzanne Stathatos, Siqi Deng, Erik Young, Sara Beery, Grant Van Horn, Pietro Perona. 290-311 [doi]
- A Dataset for Interactive Vision-Language Navigation with Unknown Command FeasibilityAndrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer. 312-328 [doi]
- BRACE: The Breakdancing Competition Dataset for Dance Motion SynthesisDavide Moltisanti, Jinyi Wu, Bo Dai, Chen Change Loy. 329-344 [doi]
- Dress Code: High-Resolution Multi-category Virtual Try-OnDavide Morelli, Matteo Fincato, Marcella Cornia, Federico Landi, Fabio Cesari, Rita Cucchiara. 345-362 [doi]
- A Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-supervised Classification and ClusteringLars Schmarje, Monty Santarossa, Simon-Martin Schröder, Claudius Zelenka, Rainer Kiko, Jenny Stracke, Nina Volkmann, Reinhard Koch. 363-380 [doi]
- ClearPose: Large-scale Transparent Object Dataset and BenchmarkXiaotong Chen, Huijie Zhang, Zeren Yu, Anthony Opipari, Odest Chadwicke Jenkins. 381-396 [doi]
- When Deep Classifiers Agree: Analyzing Correlations Between Learning Order and Image StatisticsIuliia Pliushch, Martin Mundt, Nicolas Lupp, Visvanathan Ramesh. 397-413 [doi]
- AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head ReenactmentKangyeol Kim, Sunghyun Park, Jaeseong Lee, Sunghyo Chung, Junsoo Lee, Jaegul Choo. 414-430 [doi]
- MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENerationThomas Hayes, Songyang Zhang, Xi Yin 0008, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh. 431-449 [doi]
- A Dense Material Segmentation Dataset for Indoor and Outdoor Scene ParsingPaul Upchurch, Ransen Niu. 450-466 [doi]
- MimicME: A Large Scale Diverse 4D Database for Facial Expression AnalysisAthanasios Papaioannou, Baris Gecer, Shiyang Cheng 0001, Grigorios Chrysos 0002, Jiankang deng, Eftychia Fotiadou, Christos Kampouris, Dimitrios Kollias, Stylianos Moschoglou, Kritaphat Songsri-in, Stylianos Ploumpis, George Trigeorgis, Panagiotis Tzirakis, Evangelos Ververas, Yuxiang Zhou, Allan Ponniah, Anastasios Roussos, Stefanos Zafeiriou. 467-484 [doi]
- Delving into Universal Lesion Segmentation: Method, Dataset, and BenchmarkYu Qiu, Jing Xu 0008. 485-503 [doi]
- Large Scale Real-World Multi-person TrackingBing Shuai, Alessandro Bergamo, Uta Büchler, Andrew G. Berneshawi, Alyssa Boden, Joseph Tighe. 504-521 [doi]
- D2-TPred: Discontinuous Dependency for Trajectory Prediction Under Traffic LightsYuzhen Zhang, Wentong Wang, Weizhi Guo, Pei Lv, Mingliang Xu, Wei Chen 0001, Dinesh Manocha. 522-539 [doi]
- The Missing Link: Finding Label Relations Across DatasetsJasper R. R. Uijlings, Thomas Mensink, Vittorio Ferrari. 540-556 [doi]
- Learning Omnidirectional Flow in 360$^\circ $ Video via Siamese RepresentationKeshav Bhandari, Bin Duan, Gaowen Liu, Hugo Latapie, Ziliang Zong, Yan Yan 0002. 557-574 [doi]
- VizWiz-FewShot: Locating Objects in Images Taken by People with Visual ImpairmentsYu-Yun Tseng, Alexander Bell, Danna Gurari. 575-591 [doi]
- TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual EnvironmentsShubham Dokania, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar. 592-608 [doi]
- Trapped in Texture Bias? A Large Scale Comparison of Deep Instance SegmentationJohannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schilling 0001. 609-627 [doi]
- Deformable Feature Aggregation for Dynamic Multi-modal 3D Object DetectionZehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao. 628-644 [doi]
- WeLSA: Learning to Predict 6D Pose from Weakly Labeled Data Using Shape AlignmentShishir Reddy Vutukur, Ivan Shugurov, Benjamin Busam, Andreas Hutter, Slobodan Ilic. 645-661 [doi]
- Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local GraphHonghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang 0001, Wei Qian, Xiaofei He 0001, Deng Cai 0001. 662-679 [doi]
- MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object DetectionXuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka-Chun Cheung, Hang Xu, Hongsheng Li 0001. 680-697 [doi]
- Long-tail Detection with Effective Class-MarginsJang Hyun Cho, Philipp Krähenbühl. 698-714 [doi]
- Semi-supervised Monocular 3D Object Detection by Multi-view ConsistencyQing Lian, Yanbo Xu, Weilong Yao, Yingcong Chen, Tong Zhang. 715-731 [doi]
- PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object DetectionHan Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song 0001. 732-747 [doi]