Abstract is missing.
- Megahertz Light Steering Without Moving PartsAdithya Pediredla, Srinivasa G. Narasimhan, Maysamreza Chamanzar, Ioannis Gkioulekas. 1-12 [doi]
- Affordances from Human Videos as a Versatile Representation for RoboticsShikhar Bahl, Russell Mendonca, Lili Chen, Unnat Jain, Deepak Pathak. 1-13 [doi]
- RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression ComprehensionLei Jin, Gen Luo, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji. 1-10 [doi]
- Robust Dynamic Radiance FieldsYu-Lun Liu 0001, Chen Gao, Andreas Meuleman, Hung-Yu Tseng, Ayush Saraf, Changil Kim 0001, Yung-Yu Chuang, Johannes Kopf 0001, Jia-Bin Huang 0001. 13-23 [doi]
- DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance FieldsYu Chen, Gim Hee Lee. 24-34 [doi]
- VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence NormalizationBingfan Zhu, Yanchao Yang 0001, Xulong Wang, Youyi Zheng, Leonidas J. Guibas. 35-45 [doi]
- AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware TrainingYifan Jiang 0001, Peter Hedman, Ben Mildenhall, Dejia Xu, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue. 46-55 [doi]
- SeaThru-NeRF: Neural Radiance Fields in Scattering MediaDeborah Levy, Amit Peleg, Naama Pearl, Dan Rosenbaum, Derya Akkaynak, Simon Korman, Tali Treibitz. 56-65 [doi]
- Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance FieldsBrian K. S. Isaac-Medina, Chris G. Willcocks, Toby P. Breckon. 66-75 [doi]
- Neural Residual Radiance Fields for Streamably Free-Viewpoint VideosLiao Wang, Qiang Hu, Qihan He, Ziyu Wang, Jingyi Yu, Tinne Tuytelaars, Lan Xu, Minye Wu. 76-87 [doi]
- Plen-VDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and RenderingHan Yan, Celong Liu, Chao Ma, Xing Mei. 88-96 [doi]
- Local Implicit Ray Function for Generalizable Radiance Field RepresentationXin Huang, Qi Zhang, Ying Feng, Xiaoyu Li, Xuan Wang, Qing Wang. 97-107 [doi]
- SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor ScenesYiming Gao 0007, Yan-Pei Cao, Ying Shan. 108-118 [doi]
- Frequency-Modulated Point Cloud Rendering with Easy EditingYi Zhang, Xiaoyang Huang, Bingbing Ni, Wenjun Zhang, Teng Li. 119-129 [doi]
- HexPlane: A Fast Representation for Dynamic ScenesAng Cao, Justin Johnson 0001. 130-141 [doi]
- Differentiable Shadow Mapping for Efficient Inverse GraphicsMarkus Worchel, Marc Alexa. 142-153 [doi]
- Hybrid Neural Rendering for Large-Scale Scenes with Motion BlurPeng Dai, Yinda Zhang 0001, Xin Yu 0004, Xiaoyang Lyu, Xiaojuan Qi. 154-164 [doi]
- TensoIR: Tensorial Inverse RenderingHaian Jin, Isabella Liu, Peijia Xu, Xiaoshuai Zhang, Songfang Han, Sai Bi, Xiaowei Zhou, Zexiang Xu, Hao Su 0001. 165-174 [doi]
- ShadowNeuS: Neural SDF Reconstruction by Shadow Ray SupervisionJingwang Ling, Zhibo Wang 0003, Feng Xu 0005. 175-185 [doi]
- Realistic Saliency Guided Image EnhancementS. Mahdi H. Miangoleh, Zoya Bylinskii, Eric Kee, Eli Shechtman, Yagiz Aksoy. 186-194 [doi]
- LightPainter: Interactive Portrait Relighting with Freehand ScribbleYiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, Hyunjoon Jung, Vishal M. Patel 0001. 195-205 [doi]
- A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and ReflectanceXianmin Xu, Yuxin Lin, Haoyang Zhou, Chong Zeng, Yaxin Yu, Kun Zhou 0001, Hongzhi Wu. 206-215 [doi]
- Learning Visibility Field for Detailed 3D Human Reconstruction and RelightingRuichen Zheng, Peng Li, Haoqian Wang, Tao Yu. 216-226 [doi]
- Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency LossesJunbong Jang, Kwonmoo Lee, Tae-Kyun Kim. 227-236 [doi]
- NeUDF: Leaning Neural Unsigned Distance Fields with Volume RenderingYu-Tao Liu, Li Wang, Jie Yang, Weikai Chen 0001, Xiaoxu Meng, Bo Yang, Lin Gao. 237-247 [doi]
- NeAT: Learning Neural Implicit Surfaces with Arbitrary Topologies from Multi-View ImagesXiaoxu Meng, Weikai Chen 0001, Bo Yang. 248-258 [doi]
- ALTO: Alternating Latent Topologies for Implicit 3D ReconstructionZhen Wang, Shijie Zhou, Jeong-Joon Park, Despoina Paschalidou, Suya You, Gordon Wetzstein, Leonidas J. Guibas, Achuta Kadambi. 259-270 [doi]
- Controllable Mesh Generation Through Sparse Latent Point Diffusion ModelsZhaoyang Lyu, Jinyi Wang, Yuwei An, Ya Zhang, Dahua Lin, Bo Dai 0002. 271-280 [doi]
- Photo Pre-Training, But for SketchKe Li 0004, Kaiyue Pang, Yi-Zhe Song. 275-285 [doi]
- Power Bundle Adjustment for Large-Scale 3D ReconstructionSimon Weber 0002, Nikolaus Demmel, Tin Chon Chan, Daniel Cremers. 281-289 [doi]
- Neural Pixel Composition for 3D-4D View Synthesis from Multi-ViewsAayush Bansal, Michael Zollhöfer. 290-299 [doi]
- Magic3D: High-Resolution Text-to-3D Content CreationChen-Hsuan Lin, Jun Gao 0004, Luming Tang, Towaki Takikawa, Xiaohui Zeng, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu 0001, Tsung-Yi Lin. 300-309 [doi]
- 3D Video Loops from Asynchronous InputLi Ma, Xiaoyu Li, Jing Liao 0001, Pedro V. Sander. 310-320 [doi]
- High-fidelity 3D GAN Inversion by Pseudo-multi-view OptimizationJiaxin Xie, Hao Ouyang, Jingtan Piao, Chenyang Lei, Qifeng Chen. 321-331 [doi]
- Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance FieldLeheng Li, Qing Lian, Luozhou Wang, Ningning Ma, Yingcong Chen. 332-341 [doi]
- 3D GAN Inversion with Facial Symmetry PriorFei Yin, Yong Zhang 0034, Xuan Wang, Tengfei Wang 0002, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Öztireli, Yujiu Yang. 342-351 [doi]
- StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face SwappingDiqiong Jiang, Dan Song, Ruofeng Tong 0001, Min Tang 0001. 352-361 [doi]
- FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face ReconstructionHaoran Bai, Di Kang, Haoxian Zhang, Jinshan Pan, Linchao Bao. 362-371 [doi]
- Robust Model-based Face Reconstruction through Weakly-Supervised Outlier SegmentationChunlu Li, Andreas Morel-Forster, Thomas Vetter, Bernhard Egger, Adam Kortylewski. 372-381 [doi]
- Learning Neural Proto-Face Field for Disentangled 3D Face Modeling in the WildZhenyu Zhang 0005, Renwang Chen, Weijian Cao, Ying Tai, Chengjie Wang. 382-393 [doi]
- A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild ImagesBiwen Lei, Jianqiang Ren, Mengyang Feng, Miaomiao Cui, Xuansong Xie. 394-403 [doi]
- BlendFields: Few-Shot Example-Driven Facial ModelingKacper Kania, Stephan J. Garbin, Andrea Tagliasacchi, Virginia Estellers, Kwang Moo Yi, Julien Valentin, Tomasz Trzcinski, Marek Kowalski. 404-415 [doi]
- Implicit Neural Head Synthesis via Controllable Local Deformation FieldsChuhan Chen, Matthew O'Toole, Gaurav Bharaj, Pablo Garrido 0001. 416-426 [doi]
- DPE: Disentanglement of Pose and Expression for General Video Portrait EditingYouxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-Ming Yan 0001. 427-436 [doi]
- GANHead: Towards Generative Animatable Neural Head AvatarsSijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai. 437-447 [doi]
- EDGE: Editable Dance Generation From MusicJonathan Tseng, Rodrigo Castellon, C. Karen Liu. 448-458 [doi]
- Unsupervised Volumetric AnimationAliaksandr Siarohin, Willi Menapace, Ivan Skorokhodov, Kyle Olszewski, Jian Ren, Hsin-Ying Lee, Menglei Chai, Sergey Tulyakov. 458-469 [doi]
- Blowing in the Wind: CycleNet for Human Cinemagraphs from Still ImagesHugo Bertiche, Niloy J. Mitra, Kuldeep Kulkarni, Chun-Hao Paul Huang, Tuanfeng Y. Wang, Meysam Madadi, Sergio Escalera, Duygu Ceylan. 459-468 [doi]
- Generating Holistic 3D Human Motion from SpeechHongwei Yi, Hualin Liang, YiFei Liu, Qiong Cao, YanDong Wen, Timo Bolkart, Dacheng Tao, Michael J. Black. 469-480 [doi]
- Avatars Grow Legs: Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion ModelYuming Du, Robin Kips, Albert Pumarola, Sebastian Starke, Ali K. Thabet, Artsiom Sanakoyeu. 481-490 [doi]
- Learning Anchor Transformations for 3D Garment AnimationFang Zhao, Zekun Li, Shaoli Huang, Junwu Weng, Tianfei Zhou, Guo-Sen Xie, Jue Wang, Ying Shan. 491-500 [doi]
- CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template DecompositionHongwen Zhang 0001, Siyou Lin, Ruizhi Shao, Yuxiang Zhang 0006, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu. 501-511 [doi]
- ECON: Explicit Clothed humans Optimized via Normal integrationYuliang Xiu, Jinlong Yang, Xu Cao, Dimitrios Tzionas, Michael J. Black. 512-523 [doi]
- PersonNeRF : Personalized Reconstruction from Photo CollectionsChung-Yi Weng, Pratul P. Srinivasan, Brian Curless, Ira Kemelmacher-Shlizerman. 524-533 [doi]
- 3D Human Mesh Estimation from Virtual MarkersXiaoxuan Ma, Jiajun Su, Chunyu Wang, Wentao Zhu, Yizhou Wang 0001. 534-543 [doi]
- Overcoming the TradeOff between Accuracy and Plausibility in 3D Hand Shape ReconstructionZiwei Yu, Chen Li 0038, Linlin Yang, Xiaoxu Zheng, Michael Bi Mi, Gim Hee Lee, Angela Yao. 544-553 [doi]
- Recovering 3D Hand Mesh Sequence from a Single Blurry Image: A New Dataset and Temporal UnfoldingYeonguk Oh, Joonkyu Park, Jaeha Kim, Gyeongsik Moon, Kyoung Mu Lee. 554-563 [doi]
- MeMaHand: Exploiting Mesh-Mano Interaction for Single Image Two-Hand ReconstructionCongyi Wang, Feida Zhu 0005, Shilei Wen. 564-573 [doi]
- PLIKS: A Pseudo-Linear Inverse Kinematic Solver for 3D Human Body EstimationKarthik Shetty, Annette Birkhold, Srikrishna Jaganathan, Norbert Strobel, Markus Kowarschik, Andreas K. Maier, Bernhard Egger. 574-584 [doi]
- CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation SynthesisJuntian Zheng, Qingyuan Zheng, Lixing Fang, Yun Liu, Li Yi. 585-594 [doi]
- Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD StreamYuheng Jiang, Kaixin Yao, Zhuo Su 0006, Zhehao Shen, Haimin Luo, Lan Xu. 595-605 [doi]
- BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsBowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas Müller 0013, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield. 606-617 [doi]
- Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial ScenesXuan Ju, Ailing Zeng, Jianan Wang, Qiang Xu, Lei Zhang 0001. 618-629 [doi]
- Omnimatte3D: Associating Objects and Their Effects in Unconstrained Monocular VideoMohammed Suhail, Erika Lu, Zhengqi Li, Noah Snavely, Leonid Sigal, Forrester Cole. 630-639 [doi]
- On the Benefits of 3D Pose and Tracking for Human Action RecognitionJathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik. 640-649 [doi]
- Towards Stable Human Pose Estimation via Cross-View Fusion and Foot StabilizationLi'an Zhuo, Jian Cao, Qi Wang, Bang Zhang, Liefeng Bo. 650-659 [doi]
- Human Pose as Compositional TokensZigang Geng, Chunyu Wang, Yixuan Wei, Ze Liu, Houqiang Li, Han Hu 0001. 660-671 [doi]
- PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape EstimationQihao Liu, Adam Kortylewski, Alan L. Yuille. 672-681 [doi]
- SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban EnvironmentsYudi Dai, Yitai Lin, Xiping Lin, Chenglu Wen, Lan Xu, Hongwei Yi, Siqi Shen, Yuexin Ma, Cheng Wang. 682-692 [doi]
- Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction ModuleLinzhi Huang, Yulong Li, Hongbo Tian, Yue Yang, Xiangang Li, Weihong Deng, Jieping Ye. 693-703 [doi]
- Human Pose Estimation in Extremely Low-Light ConditionsSohyun Lee, Jaesung Rim, Boseung Jeong, Geonu Kim, Byungju Woo, Haechan Lee, Sunghyun Cho, Suha Kwak. 704-714 [doi]
- m GAN: Towards Precise 3D Dose Prediction in RadiotherapyRiqiang Gao, Bin Lou, Zhoubing Xu, Dorin Comaniciu, Ali Kamen. 715-725 [doi]
- DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward EquilibriumAntyanta Bangunharcana, Ahmed Magd, Kyung Soo Kim. 726-738 [doi]
- A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial InitializationYijia He, Bo Xu 0022, Zhanpeng Ouyang, Hongdong Li. 739-748 [doi]
- Semidefinite Relaxations for Robust Multiview TriangulationLinus Härenstam-Nielsen, Niclas Zeller, Daniel Cremers. 749-757 [doi]
- A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB ImageZheheng Jiang, Hossein Rahmani, Sue Black 0002, Bryan M. Williams 0001. 758-768 [doi]
- Instant Multi-View Head Capture through Learnable RegistrationTimo Bolkart, Tianye Li, Michael J. Black. 768-779 [doi]
- On the Importance of Accurate Geometry Data for Dense 3D Vision TasksHyunjun Jung, Patrick Ruhkamp, Guangyao Zhai, Nikolas Brasch, Yitong Li, Yannick Verdie, Jifei Song, Yiren Zhou, Anil Armagan, Slobodan Ilic, Ales Leonardis, Nassir Navab, Benjamin Busam. 780-791 [doi]
- Learning 3D Scene Priors with 2D SupervisionYinyu Nie, Angela Dai, Xiaoguang Han 0001, Matthias Nießner. 792-802 [doi]
- OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and GenerationTong Wu, Jiarui Zhang, Xiao Fu, Yuxin Wang, Jiawei Ren, Liang Pan, Wayne Wu, Lei Yang, Jiaqi Wang, Chen Qian, Dahua Lin, Ziwei Liu. 803-814 [doi]
- OpenScene: 3D Scene Understanding with Open VocabulariesSongyou Peng, Kyle Genova, Chiyu "Max" Jiang, Andrea Tagliasacchi, Marc Pollefeys, Thomas A. Funkhouser. 815-824 [doi]
- Multi-View Azimuth Stereo via Tangent Space ConsistencyXu Cao, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita. 825-834 [doi]
- Progressive Transformation Learning for Leveraging Virtual Images in TrainingYi-Ting Shen, Hyungtae Lee, Heesung Kwon, Shuvra S. Bhattacharyya. 835-844 [doi]
- Connecting the Dots: Floorplan Reconstruction Using Two-Level QueriesYuanwen Yue, Theodora Kontogianni, Konrad Schindler, Francis Engelmann. 845-854 [doi]
- NeRF-Supervised Deep StereoFabio Tosi, Alessio Tonioni, Daniele De Gregorio, Matteo Poggi. 855-866 [doi]
- Semantic Scene Completion with Cleaner SelfFengyun Wang, Dong Zhang, Hanwang Zhang, Jinhui Tang 0001, Qianru Sun. 867-877 [doi]
- PanelNet: Understanding 360 Indoor Environment via Panel RepresentationHaozheng Yu, Lu He, Bing Jian, Weiwei Feng, Shan Liu 0001. 878-887 [doi]
- Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform CoordinatesAvinash Paliwal, Andrii Tsarov, Nima Khademi Kalantari. 888-898 [doi]
- Depth Estimation from Indoor Panoramas with Neural Scene RepresentationWenjie Chang, Yueyi Zhang, Zhiwei Xiong. 899-908 [doi]
- NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear InterpolationZehan Zheng, Danni Wu, Ruisi Lu, Fan Lu, Guang Chen 0001, Changjun Jiang. 909-918 [doi]
- RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View StereoChangjiang Cai, Pan Ji, Qingan Yan, Yi Xu. 919-928 [doi]
- NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera LocalizationShitao Tang, Sicong Tang, Andrea Tagliasacchi, Ping Tan, Yasutaka Furukawa. 929-939 [doi]
- MACARONS: Mapping and Coverage Anticipation with RGB Online Self-SupervisionAntoine Guédon, Tom Monnier, Pascal Monasse, Vincent Lepetit. 940-951 [doi]
- vMAP: Vectorised Object Mapping for Neural Field SLAMXin Kong, Shikun Liu, Marwan Taher, Andrew J. Davison. 952-961 [doi]
- Seeing a Rose in Five Thousand WaysYunzhi Zhang, Shangzhe Wu, Noah Snavely, Jiajun Wu 0001. 962-971 [doi]
- Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight TrackingYihao Wang, Zhigang Wang 0002, Bin Zhao 0001, Dong Wang, Mulin Chen, Xuelong Li 0001. 972-981 [doi]
- Seeing With Sound: Long-Range Acoustic Beamforming for Multimodal Scene UnderstandingPraneeth Chakravarthula, Jim Aldon D'Souza, Ethan Tseng, Joe Bartusek, Felix Heide. 982-991 [doi]
- Distilling Focal Knowledge from Imperfect Expert for 3D Object DetectionJia Zeng, Li Chen, Hanming Deng, Lewei Lu, Junchi Yan, Yu Qiao, Hongyang Li. 992-1001 [doi]
- AShapeFormer : Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via TransformersZechuan Li, Hongshan Yu, Zhengeng Yang, Tom Tongjia Chen, Naveed Akhtar. 1012-1021 [doi]
- Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous DrivingYinpeng Dong, Caixin Kang, Jinlai Zhang, Zijian Zhu, Yikai Wang 0001, Xiao Yang, Hang Su, Xingxing Wei, Jun Zhu. 1022-1032 [doi]
- Gaussian Label Distribution Learning for Spherical Image Object DetectionHang Xu, Xinyuan Liu, Qiang Zhao 0005, Yike Ma, Chenggang Yan 0001, Feng Dai. 1033-1042 [doi]
- Deep Depth Estimation from Thermal ImageUkcheol Shin, Jinsun Park, In-So Kweon. 1043-1053 [doi]
- LidarGait: Benchmarking 3D Gait Recognition with Point CloudsChuanfu Shen, Fan Chao, Wei Wu 0041, Rui Wang, George Q. Huang, Shiqi Yu 0001. 1054-1063 [doi]
- Generalized UAV Object Detection via Frequency Domain DisentanglementKunyu Wang, Xueyang Fu, Yukun Huang, Chengzhi Cao, Gege Shi, Zheng-Jun Zha. 1064-1073 [doi]
- Learning Compact Representations for LiDAR Completion and GenerationYuwen Xiong, Wei-Chiu Ma, Jingkang Wang, Raquel Urtasun. 1074-1083 [doi]
- CXTrack: Improving 3D Point Cloud Tracking with Contextual InformationTian-Xing Xu, Yuanchen Guo, Yu-Kun Lai, Song-Hai Zhang. 1084-1093 [doi]
- Multispectral Video Semantic Segmentation: A Benchmark Dataset and BaselineWei Ji, Jingjing Li, Cheng Bian, Zongwei Zhou, Jiaying Zhao, Alan L. Yuille, Li Cheng 0001. 1094-1104 [doi]
- LinK: Linear Kernel for LiDAR-based 3D PerceptionTao Lu, Xiang Ding, Haisong Liu, Gangshan Wu, Limin Wang. 1105-1115 [doi]
- Point Cloud Forecasting as a Proxy for 4D Occupancy ForecastingTarasha Khurana, Peiyun Hu, David Held, Deva Ramanan. 1116-1124 [doi]
- Curricular Object Manipulation in LiDAR-based Object DetectionZiyue Zhu, Qiang Meng, Xiao Wang, Ke Wang, Liujiang Yan, Jian Yang. 1125-1135 [doi]
- Delivering Arbitrary-Modal Semantic SegmentationJiaming Zhang 0001, Ruiping Liu, Hao Shi, Kailun Yang 0001, Simon Reiß, Kunyu Peng, Haodong Fu, Kaiwei Wang, Rainer Stiefelhagen. 1136-1147 [doi]
- Robust Outlier Rejection for 3D Registration with Variational BayesHaobo Jiang, Zheng Dang, Zhen Wei, Jin Xie, Jian Yang, Mathieu Salzmann. 1148-1157 [doi]
- 3D Human Keypoints Estimation from Point Clouds in the Wild without Human LabelsZhenzhen Weng, Alexander S. Gorban, Jingwei Ji, Mahyar Najibi, Yin Zhou, Dragomir Anguelov. 1158-1167 [doi]
- Self-Supervised Pre-Training with Masked Shape Prediction for 3D Scene UnderstandingLi Jiang, Zetong Yang, Shaoshuai Shi, Vladislav Golyanik, Dengxin Dai, Bernt Schiele. 1168-1178 [doi]
- ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D UnderstandingLe Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu 0001, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese. 1179-1189 [doi]
- Open-Vocabulary Point-Cloud Object Detection without 3D AnnotationYuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang. 1190-1199 [doi]
- FlatFormer: Flattened Window Attention for Efficient Point Cloud TransformerZhijian Liu, Xinyu Yang, Haotian Tang, Shang Yang, Song Han 0003. 1200-1211 [doi]
- PointCMP: Contrastive Mask Prediction for Self-supervised Learning on Point Cloud VideosZhiqiang Shen, Xiaoxiao Sheng, Longguang Wang, Yulan Guo, Qiong Liu, Xi Zhou. 1212-1222 [doi]
- E2PN: Efficient SE(3)-Equivariant Point NetworkMinghan Zhu, Maani Ghaffari, William A. Clark, Huei Peng. 1223-1232 [doi]
- Poly-PC: A Polyhedral Network for Multiple Point Cloud Tasks at OnceTao Xie, ShiGuang Wang, Ke Wang, Linqi Yang, Zhiqiang Jiang, Xingcheng Zhang, Kun Dai, Ruifeng Li, Jian Cheng. 1233-1243 [doi]
- Improving Graph Representation for Point Cloud Segmentation via Attentive FilteringNan Zhang, Zhiyi Pan, Thomas H. Li, Wei Gao 0003, Ge Li 0002. 1244-1254 [doi]
- BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud RegistrationSheng Ao, Qingyong Hu, Hanyun Wang, Kai Xu 0004, Yulan Guo. 1255-1264 [doi]
- TopDiG: Class-agnostic Topological Directional Graph Extraction from Remote Sensing ImagesBingnan Yang, Mi Zhang 0004, Zhan Zhang, Zhili Zhang, Xiangyun Hu. 1265-1274 [doi]
- Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants with no False Negatives and no False PositivesDaniel Widdowson, Vitaliy Kurlin. 1275-1284 [doi]
- Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic SegmentationXu Zheng, Jinjing Zhu, Yexin Liu, Zidong Cao, Chong Fu, Lin Wang. 1285-1295 [doi]
- CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple ShapesHarshil Bhatia, Edith Tretschk, Zorah Lähner, Marcel Seelbach Benkner, Michael Moeller 0001, Christian Theobalt, Vladislav Golyanik. 1296-1305 [doi]
- Enhancing Deformable Local Features by Jointly Learning to Detect and Describe KeypointsGuilherme A. Potje, Felipe Cadar, André Araujo, Renato Martins, Erickson R. Nascimento. 1306-1315 [doi]
- Understanding and Improving Features Learned in Deep Functional MapsSouhaib Attaiki, Maks Ovsjanikov. 1316-1326 [doi]
- High-Frequency Stereo Matching NetworkHaoliang Zhao, Huizhou Zhou, Yongjun Zhang, Jie Chen, Yitong Yang, Yong Zhao 0001. 1327-1336 [doi]
- Rethinking Optical Flow from Geometric Matching Consistent PerspectiveQiaole Dong, Chenjie Cao, Yanwei Fu. 1337-1347 [doi]
- Efficient Robust Principal Component Analysis via Block Krylov Iteration and CUR DecompositionShun Fang, Zhengqin Xu, Shiqian Wu, Shoulie Xie. 1348-1357 [doi]
- VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan SegmentationBingchen Yang, Haiyong Jiang, Hao Pan, Jun Xiao 0005. 1358-1367 [doi]
- TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous DrivingShaoheng Fang, Zi Wang, Yiqi Zhong, Junhao Ge, Siheng Chen. 1368-1378 [doi]
- Implicit Occupancy Flow Fields for Perception and Prediction in Self-DrivingBen Agro, Quinlan Sykora, Sergio Casas 0002, Raquel Urtasun. 1379-1388 [doi]
- UniSim: A Neural Closed-Loop Sensor SimulatorZe Yang 0003, Yun Chen 0014, Jingkang Wang, Sivabalan Manivasagam, Wei-Chiu Ma, Anqi Joyce Yang, Raquel Urtasun. 1389-1399 [doi]
- FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-Tail Trajectory PredictionYuning Wang, Pu Zhang, Lei Bai 0001, Jianru Xue. 1400-1409 [doi]
- EqMotion: Equivariant Multi-Agent Motion Prediction with Invariant Interaction ReasoningChenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang. 1410-1420 [doi]
- Lookahead Diffusion Probabilistic Models for Refining Mean EstimationGuoqiang Zhang 0003, Kenta Niwa, W. Bastiaan Kleijn. 1421-1429 [doi]
- Neural Volumetric Memory for Visual Locomotion ControlRuihan Yang, Ge Yang, Xiaolong Wang 0004. 1430-1440 [doi]
- Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human AttentionSounak Mondal, Zhibo Yang, Seoyoung Ahn, Dimitris Samaras, Gregory J. Zelinsky, Minh Hoai. 1441-1450 [doi]
- DrapeNet: Garment Generation and Self-Supervised DrapingLuca De Luigi, Ren Li, Benoît Guillard, Mathieu Salzmann, Pascal Fua. 1451-1460 [doi]
- Tracking Multiple Deformable Objects in Egocentric VideosMingzhen Huang, Xiaoxing Li, Jun Hu, Honghong Peng, Siwei Lyu. 1461-1471 [doi]
- Good is Bad: Causality Inspired Cloth-debiasing for Cloth-changing Person Re-identificationZhengwei Yang, Meng Lin, Xian Zhong, Yu Wu, Zheng Wang 0007. 1472-1481 [doi]
- Micron-BERT: BERT-Based Facial Micro-Expression RecognitionXuan-Bac Nguyen, Chi Nhan Duong, Xin Li 0005, Susan Gauch, Han-Seok Seo, Khoa Luu. 1482-1492 [doi]
- MARLIN: Masked Autoencoder for facial video Representation LearnINgZhixi Cai, Shreya Ghosh 0001, Kalin Stefanov, Abhinav Dhall, Jianfei Cai 0001, Hamid Rezatofighi, Reza Haffari, Munawar Hayat. 1493-1504 [doi]
- StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based GeneratorJiazhi Guan, Zhanwang Zhang, Hang Zhou, Tianshu Hu, Kaisiyuan Wang, Dongliang He, Haocheng Feng, Jingtuo Liu, Errui Ding, Ziwei Liu 0002, Jingdong Wang 0001. 1505-1515 [doi]
- REALIMPACT: A Dataset of Impact Sound Fields for Real ObjectsSamuel Clarke, Ruohan Gao, Mason Wang, Mark Rau, Julia Xu, Jui-Hsien Wang, Doug L. James, Jiajun Wu 0001. 1516-1525 [doi]
- STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action RecognitionXiaoyu Zhu, Po-Yao Huang 0001, Junwei Liang 0001, Celso M. de Melo, Alexander G. Hauptmann. 1526-1536 [doi]
- Progressive Spatio-temporal Alignment for Efficient Event-based Motion EstimationXueyan Huang, Yueyi Zhang, Zhiwei Xiong. 1537-1546 [doi]
- Event-Based Shape from PolarizationManasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza 0001. 1547-1556 [doi]
- Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-ResolutionYunfan Lu, Zipeng Wang, Minjie Liu, Hongjian Wang, Lin Wang. 1557-1567 [doi]
- BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame InterpolationJunheum Park, Jintae Kim, Chang-Su Kim 0001. 1568-1577 [doi]
- A Unified Pyramid Recurrent Network for Video Frame InterpolationXin Jin, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-Hee Hahm. 1578-1587 [doi]
- Event-based Blurry Frame Interpolation under Blind ExposureWenming Weng, Yueyi Zhang, Zhiwei Xiong. 1588-1598 [doi]
- FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow EstimationXiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka-Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li. 1599-1610 [doi]
- POTTER: Pooling Attention Transformer for Efficient Human Mesh RecoveryCe Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen 0001. 1611-1620 [doi]
- Adaptive Patch Deformation for Textureless-Resilient Multi-View StereoYuesong Wang 0001, Zhaojie Zeng, Tao Guan, Wei Yang 0011, Zhuo Chen, Wenkai Liu, Luoyuan Xu, Yawei Luo. 1621-1630 [doi]
- On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches TransferZhenjie Yu, Shuang Li 0008, Yirui Shen, Chi Harold Liu, Shuigen Wang. 1631-1640 [doi]
- Thermal Spread Functions (TSF): Physics-Guided Material ClassificationAniket Dashpute, Vishwanath Saragadam, Emma Alexander, Florian Willomitzer, Aggelos K. Katsaggelos, Ashok Veeraraghavan, Oliver Cossairt. 1641-1650 [doi]
- Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-ResolutionXuhai Chen, Jiangning Zhang, Chao Xu, Yabiao Wang, Chengjie Wang, Yong Liu 0007. 1651-1661 [doi]
- Learning Semantic-Aware Knowledge Guidance for Low-Light Image EnhancementYuhui Wu, Chen Pan, Guoqing Wang, Yang Yang 0003, Jiwei Wei, Chongyi Li, Heng Tao Shen. 1662-1671 [doi]
- CutMIB: Boosting Light Field Super-Resolution via Multi-View Image BlendingZeyu Xiao, Yutong Liu, Ruisheng Gao, Zhiwei Xiong. 1672-1682 [doi]
- sRGB Real Noise Synthesizing with Neighboring Correlation-Aware Noise ModelZixuan Fu, Lanqing Guo, Bihan Wen. 1683-1691 [doi]
- Masked Image Training for Generalizable Deep Image DenoisingHaoyu Chen, Jinjin Gu, Yihao Liu 0001, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu. 1692-1703 [doi]
- DR2: Diffusion-Based Robust Degradation Remover for Blind Face RestorationZhixin Wang, Ziying Zhang, Xiaoyun Zhang, Huangjie Zheng, Mingyuan Zhou, Ya Zhang, Yanfeng Wang. 1704-1713 [doi]
- Learning Distortion Invariant Representation for Image Restoration from a Causality PerspectiveXin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen 0001. 1714-1724 [doi]
- Perception-Oriented Single Image Super-Resolution using Optimal Objective EstimationSeung-Ho Park, Young-Su Moon, Nam Ik Cho. 1725-1735 [doi]
- Catch Missing Details: Image Reconstruction with Frequency Augmented Variational AutoencoderXinmiao Lin, Yikang Li 0001, Jenhao Hsiao, Chiuman Ho, Yu Kong. 1736-1745 [doi]
- MD-VQA: Multi-Dimensional Quality Assessment for UGC Live VideosZicheng Zhang, Wei Wu, Wei Sun 0029, Danyang Tu, Wei Lu 0021, Xiongkuo Min, Ying Chen, Guangtao Zhai. 1746-1755 [doi]
- CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large InputSenmao Tian, Ming Lu, Jiaming Liu, Yandong Guo, Yurong Chen 0001, Shunli Zhang. 1756-1765 [doi]
- Initialization Noise in Image Gradients and Saliency MapsAnn-Christin Woerl, Jan Disselhoff, Michael Wand. 1766-1775 [doi]
- Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-ResolutionJie-En Yao, Li-Yuan Tsao, Yi-Chen Lo, Roy Tseng, Chia-Che Chang, Chun-Yi Lee. 1776-1785 [doi]
- Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance PursuitXiaohang Wang 0004, Xuanhong Chen, Bingbing Ni, Hang Wang, Zhengyan Tong, Yutian Liu. 1786-1795 [doi]
- CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-ResolutionJiezhang Cao, Qin Wang 0013, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang 0008, Yulun Zhang, Radu Timofte, Luc Van Gool. 1796-1807 [doi]
- Multiplicative Fourier Level of DetailYishun Dou, Zhong Zheng, Qiaoqiao Jin, Bingbing Ni. 1808-1817 [doi]
- Document Image Shadow Removal Guided by Color-Aware BackgroundLing Zhang, Yinghao He, Qing Zhang, Zheng Liu, Xiaolong Zhang 0002, Chunxia Xiao. 1818-1827 [doi]
- StyleRes: Transforming the Residuals for Real Image Editing with StyleGANHamza Pehlivan, Yusuf Dalva, Aysegul Dundar. 1828-1837 [doi]
- TopNet: Transformer-Based Object Placement Network for Image CompositingSijie Zhu, Zhe Lin 0001, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen 0001. 1838-1847 [doi]
- VecFontSDF: Learning to Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance FunctionsZeqing Xia, Bojun Xiong, Zhouhui Lian. 1848-1857 [doi]
- CF-Font: Content Fusion for Few-Shot Font GenerationChi Wang, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu. 1858-1867 [doi]
- SIEDOB: Semantic Image Editing by Disentangling Object and BackgroundWuyang Luo, Su Yang, Xinjian Zhang, Weishan Zhang. 1868-1878 [doi]
- MaskSketch: Unpaired Structure-guided Masked Image GenerationDina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa. 1879-1889 [doi]
- Text2Scene: Text-driven Indoor Scene Stylization with Part-Aware DetailsInwoo Hwang, Hyeonwoo Kim, Young Min Kim 0001. 1890-1899 [doi]
- Uncovering the Disentanglement Capability in Text-to-Image Diffusion ModelsQiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui, Tong Yu 0001, Zhe Lin, Yang Zhang 0001, Shiyu Chang. 1900-1910 [doi]
- VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion ModelsAjay Jain, Amber Xie, Pieter Abbeel. 1911-1920 [doi]
- Plug-and-Play Diffusion Features for Text-Driven Image-to-Image TranslationNarek Tumanyan, Michal Geyer, Shai Bagon, Tali Dekel. 1921-1930 [doi]
- Multi-Concept Customization of Text-to-Image DiffusionNupur Kumari, Bingliang Zhang, Richard Zhang 0001, Eli Shechtman, Jun-Yan Zhu. 1931-1941 [doi]
- Unifying Layout Generation with a Decoupled Diffusion ModelMude Hui, Zhizheng Zhang 0004, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu. 1942-1951 [doi]
- BBDM: Image-to-Image Translation with Brownian Bridge Diffusion ModelsBo Li, Kaitao Xue, Bin Liu 0057, Yu-Kun Lai. 1952-1961 [doi]
- Towards Practical Plug-and-Play Diffusion ModelsHyojun Go, Yunsung Lee, Jin Young Kim, Seunghyun Lee, Myeongho Jeong, Hyun Seung Lee, Seungtaek Choi. 1962-1971 [doi]
- Post-Training Quantization on Diffusion ModelsYuzhang Shang, Zhihang Yuan, Bin Xie, Bingzhe Wu, Yan Yan 0002. 1972-1981 [doi]
- DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits AnimationShuai Shen, Wenliang Zhao, Zibin Meng, Wanhua Li 0001, Zheng Zhu, Jie Zhou 0001, Jiwen Lu. 1982-1991 [doi]
- Mask-Guided Matting in the WildKwanYong Park, Sanghyun Woo, Seoung Wug Oh, In-So Kweon, Joon-Young Lee. 1992-2001 [doi]
- Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image GenerationMengqi Huang, Zhendong Mao, Quan Wang, Yongdong Zhang 0001. 2002-2011 [doi]
- Compression-Aware Video Super-ResolutionYingwei Wang, Takashi Isobe, Xu Jia, Xin Tao, Huchuan Lu, Yu-Wing Tai. 2012-2021 [doi]
- Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN modelsNilesh A. Ahuja, Parual Datta, Bhavya Kanzariya, V. Srinivasa Somayazulu, Omesh Tickoo. 2022-2030 [doi]
- DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for VideosQi Zhao, M. Salman Asif, Zhan Ma. 2031-2040 [doi]
- Polynomial Implicit Neural Representations For Large Diverse DatasetsRajhans Singh, Ankita Shukla, Pavan K. Turaga. 2041-2051 [doi]
- Learning Decorrelated Representations Efficiently Using Fast Fourier TransformYutaro Shigeto, Masashi Shimbo, Yuya Yoshikawa, Akikazu Takeuchi. 2052-2060 [doi]
- SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerXuanyao Chen, Zhijian Liu, Haotian Tang, Li Yi, Hang Zhao, Song Han. 2061-2070 [doi]
- N-Gram in Swin Transformers for Efficient Lightweight Image Super-ResolutionHaram Choi, Jeongmin Lee 0003, Jihoon Yang. 2071-2081 [doi]
- Slide-Transformer: Hierarchical Vision Transformer with Local Self-AttentionXuran Pan, Tianzhu Ye, Zhuofan Xia, Shiji Song, Gao Huang. 2082-2091 [doi]
- Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision TransformersSiyuan Wei, Tianzhu Ye, Shen Zhang, Yao Tang, Jiajun Liang. 2092-2101 [doi]
- Top-Down Visual Attention from Analysis by SynthesisBaifeng Shi, Trevor Darrell, Xin Wang. 2102-2112 [doi]
- Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural NetworksMarkus Frey, Christian F. Doeller, Caswell Barry. 2113-2121 [doi]
- Masked Image Modeling with Local Multi-Scale ReconstructionHaoqing Wang, Yehui Tang, Yunhe Wang 0001, Jianyuan Guo, Zhi-Hong Deng, Kai Han 0002. 2122-2131 [doi]
- Siamese Image Modeling for Self-Supervised Vision Representation LearningChenxin Tao, Xizhou Zhu, Weijie Su 0002, Gao Huang, Bin Li, Jie Zhou, Yu Qiao 0006, Xiaogang Wang 0001, Jifeng Dai. 2132-2141 [doi]
- MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisTianhong Li, Huiwen Chang, Shlok Kumar Mishra, Han Zhang, Dina Katabi, Dilip Krishnan. 2142-2152 [doi]
- Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identificationYukang Zhang, Hanzi Wang. 2153-2162 [doi]
- DistilPose: Tokenized Pose Regression with Heatmap DistillationSuhang Ye, Yingyi Zhang, Jie Hu, Liujuan Cao, Shengchuan Zhang, Lei Shen, Jun Wang, Shouhong Ding, Rongrong Ji. 2163-2172 [doi]
- Graph Transformer GANs for Graph-Constrained House GenerationHao Tang 0005, Zhenyu Zhang 0005, Humphrey Shi, Bo Li, Ling Shao 0001, Nicu Sebe, Radu Timofte, Luc Van Gool. 2173-2182 [doi]
- Automatic High Resolution Wire Segmentation and RemovalMang Tik Chiu, Xuaner Zhang, Zijun Wei, YuQian Zhou, Eli Shechtman, Connelly Barnes, Zhe Lin, Florian Kainz, Sohrab Amirghodsi, Humphrey Shi. 2183-2192 [doi]
- Tree Instance Segmentation with Temporal Contour GraphAdnan Firoze, Cameron Wingren, Raymond A. Yeh, Bedrich Benes, Daniel G. Aliaga. 2193-2202 [doi]
- Dual-Path Adaptation from Image to Video TransformersJungin Park, Jiyoung Lee, Kwanghoon Sohn. 2203-2213 [doi]
- Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video LearningA. J. Piergiovanni, Weicheng Kuo, Anelia Angelova. 2214-2224 [doi]
- Modeling Video as Stochastic Processes for Fine-Grained Video Representation LearningHeng Zhang, Daqing Liu, Qi Zheng, Bing Su 0001. 2225-2234 [doi]
- Masked Motion Encoding for Self-Supervised Video Representation LearningXinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan. 2235-2245 [doi]
- Boosting Video Object Segmentation via Space-Time Correspondence LearningYurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang. 2246-2256 [doi]
- Two-shot Video Object SegmentationKun Yan, Xiao Li, Fangyun Wei, Jinglu Wang, Chenbin Zhang, Ping Wang, Yan Lu. 2257-2267 [doi]
- Look Before You Match: Instance Understanding Matters in Video Object SegmentationJunke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang. 2268-2278 [doi]
- Spatial-then-Temporal Self-Supervised Learning for Video CorrespondenceRui Li, Dong Liu. 2279-2288 [doi]
- Few-Shot Referring Relationships in VideosYogesh Kumar, Anand Mishra 0001. 2289-2298 [doi]
- Vision Transformers are Parameter-Efficient Audio-Visual LearnersYan-Bo Lin, Yi-Lin Sung, Jie Lei 0003, Mohit Bansal, Gedas Bertasius. 2299-2309 [doi]
- Egocentric Video Task TranslationZihui Xue, Yale Song, Kristen Grauman, Lorenzo Torresani. 2310-2320 [doi]
- QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture GenerationSicheng Yang, Zhiyong Wu 0001, Minglei Li 0001, Zhensong Zhang, Lei Hao, Weihong Bao, Haolin Zhuang. 2321-2330 [doi]
- Co-speech Gesture Synthesis by Reinforcement Learning with Contrastive Pretrained RewardsMingyang Sun, Mengchen Zhao, Yaqing Hou, Minglei Li 0001, Huang Xu, Songcen Xu, Jianye Hao. 2331-2340 [doi]
- TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action RecognitionIshan Rajendrakumar Dave, Mamshad Nayeem Rizve, Chen Chen 0001, Mubarak Shah. 2341-2352 [doi]
- How can objects help action recognition?Xingyi Zhou, Anurag Arnab, Chen Sun 0002, Cordelia Schmid. 2353-2362 [doi]
- Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action RecognitionLilang Lin, Jiahang Zhang, Jiaying Liu 0001. 2363-2372 [doi]
- Decomposed Cross-Modal Distillation for RGB-based Temporal Action DetectionPilhyeon Lee, Taeoh Kim, Minho Shim, Dongyoon Wee, Hyeran Byun. 2373-2383 [doi]
- ASPnet: Action Segmentation with Shared-Private Representation of Multiple Data SourcesBeatrice van Amsterdam, Abdolrahim Kadkhodamohammadi, Imanol Luengo, Danail Stoyanov. 2384-2393 [doi]
- Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action LocalizationHuan Ren, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. 2394-2404 [doi]
- LOGO: A Long-Form Video Dataset for Group Action Quality AssessmentShiyi ZHANG, Wenxun Dai, Sujia Wang, Xiangwei Shen, Jiwen Lu, Jie Zhou 0001, Yansong Tang. 2405-2414 [doi]
- Use Your Head: Improving Long-Tail Video RecognitionToby Perrett, Saptarshi Sinha, Tilo Burghardt, Majid Mirmehdi, Dima Damen. 2415-2425 [doi]
- Conditional Generation of Audio from Video via Foley AnalogiesYuexi Du, Ziyang Chen, Justin Salamon, Bryan Russell, Andrew Owens. 2426-2436 [doi]
- Weakly Supervised Video Representation Learning with Unaligned Text for Sequential VideosSixun Dong, Huazhang Hu, Dongze Lian, Weixin Luo, Yicheng Qian, Shenghua Gao. 2437-2447 [doi]
- You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed VideosXiang Fang, Daizong Liu, Pan Zhou, Guoshun Nan. 2448-2460 [doi]
- Connecting Vision and Language with Video Localized NarrativesPaul Voigtlaender, Soravit Changpinyo, Jordi Pont-Tuset, Radu Soricut, Vittorio Ferrari. 2461-2471 [doi]
- Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation LearningPeng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu 0030, Xiangyang Ji, Li Yuan, Jie Chen 0001. 2472-2482 [doi]
- Aligning Step-by-Step Instructional Diagrams to Video DemonstrationsJiahao Zhang, Anoop Cherian, Yanbin Liu, Yizhak Ben-Shabat, Cristian Rodriguez Opazo, Stephen Gould. 2483-2492 [doi]
- Make-A-Story: Visual Memory Conditioned Consistent Story GenerationTanzila Rahman, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Shweta Mahajan, Leonid Sigal. 2493-2502 [doi]
- Test of Time: Instilling Video-Language Models with a Sense of TimePiyush Bagad, Makarand Tapaswi, Cees G. M. Snoek. 2503-2516 [doi]
- How You Feelin'? Learning Emotions and Mental States in Movie ScenesDhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi. 2517-2528 [doi]
- Continuous Sign Language Recognition with Correlation NetworkLianyu Hu 0003, Liqing Gao, Zekang Liu, Wei Feng 0005. 2529-2539 [doi]
- DIP: Dual Incongruity Perceiving Network for Sarcasm DetectionChangsong Wen, Guoli Jia, Jufeng Yang. 2540-2550 [doi]
- Gloss Attention for Gloss-free Sign Language TranslationAoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao. 2551-2562 [doi]
- Object-Goal Visual Navigation via Effective Exploration of Relations Among Historical Navigation StatesHeming Du, Lincheng Li, Zi Huang, Xin Yu 0002. 2563-2573 [doi]
- Behavioral Analysis of Vision-and-Language Navigation AgentsZijiao Yang, Arjun Majumdar, Stefan Lee. 2574-2582 [doi]
- KERM: Knowledge Enhanced Reasoning for Vision-and-Language NavigationXiangyang Li, Zihan Wang, Jiahao Yang, Yaowei Wang, Shuqiang Jiang. 2583-2592 [doi]
- Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query LocalizationMengmeng Xu, Yanghao Li, Cheng-Yang Fu, Bernard Ghanem, Tao Xiang, Juan-Manuel Pérez-Rúa. 2593-2603 [doi]
- Efficient Multimodal Fusion via Interactive PromptingYaowei Li, Ruijie Quan, Linchao Zhu, Yi Yang. 2604-2613 [doi]
- NS3D: Neuro-Symbolic Grounding of 3D Objects and RelationsJoy Hsu, Jiayuan Mao, Jiajun Wu 0001. 2614-2623 [doi]
- Dynamic Inference with Grounding Based Vision and Language ModelsBurak Uzkent, Amanmeet Garg, Wentao Zhu, Keval Doshi, Jingru Yi, Xiaolong Wang, Mohamed Omar. 2624-2633 [doi]
- Improving Commonsense in Vision-Language Models via Knowledge Graph RiddlesShuquan Ye, Yujia Xie, Dongdong Chen 0001, Yichong Xu, Lu Yuan, Chenguang Zhu 0001, Jing Liao 0001. 2634-2645 [doi]
- 3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical LearningWei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang 0015, Yanning Zhang, Qi Wu 0001. 2646-2656 [doi]
- Teaching Structured Vision & Language Concepts to Vision & Language ModelsSivan Doveh, Assaf Arbelle, Sivan Harary, Eli Schwartz, Roei Herzig, Raja Giryes, Rogério Feris, Rameswar Panda, Shimon Ullman, Leonid Karlinsky. 2657-2668 [doi]
- FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion TasksXiao Han, Xiatian Zhu, Licheng Yu, Li Zhang 0040, Yi-Zhe Song, Tao Xiang. 2669-2680 [doi]
- Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language TasksHao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai. 2691-2700 [doi]
- Learning from Unique Perspectives: User-aware Saliency ModelingShi Chen, Nachiappan Valliappan, Shaolei Shen, Xinyu Ye, Kai Kohlhoff, Junfeng He. 2701-2710 [doi]
- CRAFT: Concept Recursive Activation FacTorization for ExplainabilityThomas Fel, Agustin Picard, Louis Béthune, Thibaut Boissin, David Vigouroux, Julien Colin, Rémi Cadènc, Thomas Serre. 2711-2721 [doi]
- Doubly Right Object Recognition: A Why Prompt for Visual RationalesChengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang, Xin Wang, Carl Vondrick. 2722-2732 [doi]
- Sketch2Saliency: Learning to Detect Salient Objects from Human DrawingsAyan Kumar Bhunia, Subhadeep Koley, Amandeep Kumar, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song. 2733-2743 [doi]
- PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image ClassificationMeike Nauta, Jörg Schlötterer, Maurice van Keulen, Christin Seifert. 2744-2753 [doi]
- CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or NotAneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Subhadeep Koley, Tao Xiang, Yi-Zhe Song. 2765-2775 [doi]
- iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual RecognitionYixuan Wei, Yue Cao 0001, Zheng Zhang 0022, Houwen Peng, Zhuliang Yao, Zhenda Xie, Han Hu 0001, Baining Guo. 2776-2786 [doi]
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalDing Jiang, Mang Ye. 2787-2797 [doi]
- Multi-Modal Representation Learning with Text-Driven Soft MasksJaeyoo Park, Bohyung Han. 2798-2807 [doi]
- Texts as Images in Prompt Tuning for Multi-Label Image RecognitionZixian Guo, Bowen Dong, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo. 2808-2817 [doi]
- Reproducible Scaling Laws for Contrastive Language-Image LearningMehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, Jenia Jitsev. 2818-2829 [doi]
- Multilateral Semantic Relations Modeling for Image Text RetrievalZheng Wang 0044, Zhenwei Gao, Kangshuai Guo, Yang Yang 0002, Xiaoming Wang, Heng Tao Shen. 2830-2839 [doi]
- Smallcap: Lightweight Image Captioning Prompted with Retrieval AugmentationRita Ramos, Bruno Martins 0001, Desmond Elliott, Yova Kementchedjhieva. 2840-2849 [doi]
- Probing Sentiment-Oriented PreTraining Inspired by Human Sentiment Perception MechanismTinglei Feng, Jiaxuan Liu, Jufeng Yang. 2850-2860 [doi]
- Prefix Conditioning Unifies Language and Label SupervisionKuniaki Saito, Kihyuk Sohn, Xiang Zhang, Chun-Liang Li, Chen-Yu Lee, Kate Saenko, Tomas Pfister. 2861-2870 [doi]
- Crossing the Gap: Domain Generalization for Image CaptioningYuchen Ren, Zhendong Mao, Shancheng Fang, Yan Lu, Tong He 0004, Hao Du, Yongdong Zhang, Wanli Ouyang. 2871-2880 [doi]
- A Bag-of-Prototypes Representation for Dataset-Level ApplicationsWeijie Tu, Weijian Deng, Tom Gedeon, Liang Zheng 0001. 2881-2892 [doi]
- CrowdCLIP: Unsupervised Crowd Counting via Vision-Language ModelDingkang Liang, Jiahao Xie, Zhikang Zou, Xiaoqing Ye, Wei Xu, Xiang Bai. 2893-2903 [doi]
- 2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-Based TransformersJianfeng He, Yuan Gao, Tianzhu Zhang, Zhe Zhang, Feng Wu 0001. 2904-2914 [doi]
- Learning to Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic SpaceYong Zhang 0056, Yingwei Pan, Ting Yao, Rui Huang 0001, Tao Mei, Chang Wen Chen. 2915-2924 [doi]
- Relational Context Learning for Human-Object Interaction DetectionSanghyun Kim, Deunsol Jung, Minsu Cho. 2925-2934 [doi]
- Learning Open-Vocabulary Semantic Segmentation Models From Natural Language SupervisionJilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie. 2935-2944 [doi]
- Side Adapter Network for Open-Vocabulary Semantic SegmentationMengde Xu, Zheng Zhang, Fangyun Wei, Han Hu, Xiang Bai. 2945-2954 [doi]
- Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion ModelsJiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello. 2955-2966 [doi]
- IFSeg: Image-free Semantic Segmentation via Vision-Language ModelSukmin Yun, Seong Hyeon Park, Paul Hongsuck Seo, Jinwoo Shin. 2967-2977 [doi]
- PartManip: Learning Cross-Category Generalizable Part Manipulation Policy from Point Cloud ObservationsHaoran Geng, Ziming Li, Yiran Geng, Jiayi Chen, Hao Dong 0003, He Wang 0010. 2978-2988 [doi]
- OneFormer: One Transformer to Rule Universal Image SegmentationJitesh Jain, Jiachen Li 0003, Mangtik Chiu, Ali Hassani 0001, Nikita Orlov, Humphrey Shi. 2989-2998 [doi]
- Delving into Shape-aware Zero-shot Semantic SegmentationXinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou. 2999-3009 [doi]
- CoMFormer: Continual Learning in Semantic and Panoptic SegmentationFabio Cermelli, Matthieu Cord, Arthur Douillard. 3010-3020 [doi]
- Learning to Segment Every Referring Object Point by PointMengxue Qu, Yu Wu 0011, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao 0001. 3021-3030 [doi]
- Unsupervised Continual Semantic Adaptation Through Neural RenderingZhizheng Liu, Francesco Milano 0001, Jonas Frey, Roland Siegwart, Hermann Blum, Cesar Cadena 0001. 3031-3040 [doi]
- Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationFeng Li, Hao Zhang 0097, Huaizhe Xu, Shilong Liu, Lei Zhang 0001, Lionel M. Ni, Heung-Yeung Shum. 3041-3050 [doi]
- Transformer Scale Gate for Semantic SegmentationHengcan Shi, Munawar Hayat, Jianfei Cai 0001. 3051-3060 [doi]
- Style Projected Clustering for Domain Generalized Semantic SegmentationWei Huang, Chang Chen, Yong Li, Jiacheng Li, Cheng Li, Fenglong Song, Youliang Yan, Zhiwei Xiong. 3061-3071 [doi]
- Rethinking Few-Shot Medical Segmentation: A Vector Quantization ViewShiqi Huang, Tingfa Xu, Ning Shen, Feng Mu, Jianan Li. 3072-3081 [doi]
- Continual Semantic Segmentation with Automatic Memory Sample SelectionLanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu. 3082-3092 [doi]
- Token Contrast for Weakly-Supervised Semantic SegmentationLixiang Ru, Heliang Zheng, Yibing Zhan, Bo Du 0001. 3093-3102 [doi]
- Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation GraphRixin Zhou, Jiafu Wei, Qian Zhang, Ruihua Qi, Xi Yang, Chuntao Li. 3103-3113 [doi]
- Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic SegmentationXiaoyang Wang, Bingfeng Zhang, Limin Yu, Jimin Xiao. 3114-3123 [doi]
- Cut and Learn for Unsupervised Object Detection and Instance SegmentationXudong Wang 0007, Rohit Girdhar, Stella X. Yu, Ishan Misra. 3124-3134 [doi]
- Extracting Class Activation Maps from Non-Discriminative Features as wellZhaozheng Chen, Qianru Sun. 3135-3144 [doi]
- BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance SegmentationTianheng Cheng, Xinggang Wang, Shaoyu Chen, Qian Zhang, Wenyu Liu 0001. 3145-3154 [doi]
- Hierarchical Fine-Grained Image Forgery Detection and LocalizationXiao Guo, Xiaohong Liu, Zhiyuan Ren, Steven Grosz, Iacopo Masi, Xiaoming Liu 0002. 3155-3165 [doi]
- Towards Professional Level Crowd Annotation of Expert Domain DataPei Wang, Nuno Vasconcelos. 3166-3175 [doi]
- Unsupervised Object Localization: Observing the Background to Discover ObjectsOriane Siméoni, Chloé Sekkat, Gilles Puy, Antonín Vobecky, Éloi Zablocki, Patrick Pérez. 3176-3186 [doi]
- Semi-supervised learning made simple with self-supervised clusteringEnrico Fini, Pietro Astolfi, Karteek Alahari, Xavier Alameda-Pineda, Julien Mairal, Moin Nabi, Elisa Ricci 0001. 3187-3197 [doi]
- Unbalanced Optimal Transport: A Unified Framework for Object DetectionHenri De Plaen, Pierre-François De Plaen, Johan A. K. Suykens, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool. 3198-3207 [doi]
- DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object DetectionJiawei Ma, Yulei Niu, Jincheng Xu, Shiyuan Huang, Guangxing Han, Shih-Fu Chang. 3208-3218 [doi]
- CLIP the Gap: A Single Domain Generalization Approach for Object DetectionVidit Vidit, Martin Engilberge, Mathieu Salzmann. 3219-3229 [doi]
- Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown ObjectsWenteng Liang, Feng Xue, Yihao Liu, Guofeng Zhong, Anlong Ming. 3230-3239 [doi]
- Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object DetectionXinjiang Wang, Xingyi Yang, Shilong Zhang, Yijiang Li, Litong Feng, Shijie Fang, Chengqi Lyu, Kai Chen, Wayne Zhang. 3240-3249 [doi]
- Optimal Proposal Learning for Deployable End-to-End Pedestrian DetectionXiaolin Song, Binghui Chen, Pengyu Li, Jun-Yan He, Biao Wang, Yifeng Geng, Xuansong Xie, Honggang Zhang. 3250-3260 [doi]
- AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object DetectionYipeng Gao, Kun-Yu Lin, Junkai Yan, Yaowei Wang, Wei-Shi Zheng 0001. 3261-3271 [doi]
- Where is My Spot? Few-shot Image Generation via Latent Subspace OptimizationChenxi Zheng, Bangzhen Liu, Huaidong Zhang, Xuemiao Xu, Shengfeng He. 3272-3281 [doi]
- Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution DetectionFan Lu, Kai Zhu 0004, Wei Zhai, Kecheng Zheng, Yang Cao. 3282-3291 [doi]
- MAESTER: Masked Autoencoder Guided Segmentation at Pixel Resolution for Accurate, Self-Supervised Subcellular Structure RecognitionRonald Xie, Kuan Pang, Gary D. Bader, Bo Wang. 3292-3301 [doi]
- Orthogonal Annotation Benefits Barely-supervised Medical Image SegmentationHeng Cai, Shumeng Li, Lei Qi, Qian Yu, Yinghuan Shi, Yang Gao 0001. 3302-3311 [doi]
- RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure PredictionDonghao Zhou, Chunbin Gu, Junde Xu, Furui Liu, Qiong Wang 0001, Guangyong Chen, Pheng-Ann Heng. 3312-3322 [doi]
- Topology-Guided Multi-Class Cell Context Generation for Digital PathologyShahira Abousamra, Rajarsi Gupta 0001, Tahsin M. Kurç, Dimitris Samaras, Joel H. Saltz, Chao Chen 0012. 3323-3333 [doi]
- Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report GenerationMingjie Li 0006, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang. 3334-3343 [doi]
- Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsMingu Kang, Heon Song, Seonwook Park, Donggeun Yoo, Sérgio Pereira. 3344-3354 [doi]
- Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive LearningKangning Liu, Weicheng Zhu, Yiqiu Shen, Sheng Liu, Narges Razavian, Krzysztof J. Geras, Carlos Fernandez-Granda. 3355-3365 [doi]
- Learning Expressive Prompting With Residuals for Vision TransformersRajshekhar Das, Yonatan Dukler, Avinash Ravichandran, Ashwin Swaminathan. 3366-3377 [doi]
- Detection of Out-of-Distribution Samples Using Binary Neuron Activation PatternsBartlomiej Olber, Krystian Radlak, Adam Popowicz, Michal Szczepankiewicz, Krystian Chachula. 3378-3387 [doi]
- Decoupling MaxLogit for Out-of-Distribution DetectionZihan Zhang, Xiang Xiang. 3388-3397 [doi]
- Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete LabelsZixuan Ding, Ao Wang, Hui Chen, Qiang Zhang, Pengzhang Liu, Yongjun Bao, Weipeng Yan, Jungong Han. 3398-3407 [doi]
- Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label ClassificationYoungwook Kim, Jae-Myung Kim, Jieun Jeong, Cordelia Schmid, Zeynep Akata, Jungwoo Lee 0001. 3408-3417 [doi]
- DivClust: Controlling Diversity in Deep ClusteringIoannis Maniadis Metaxas, Georgios Tzimiropoulos, Ioannis Patras. 3418-3428 [doi]
- Deep Semi-Supervised Metric Learning with Mixed Label PropagationFuren Zhuang, Pierre Moulin. 3429-3438 [doi]
- Leveraging Inter-Rater Agreement for Classification in the Presence of Noisy LabelsMaria Sofia Bucarelli, Lucas Cassano, Federico Siciliano, Amin Mantrach, Fabrizio Silvestri. 3439-3448 [doi]
- Modeling Inter-Class and Intra-Class Constraints in Novel Class DiscoveryWenbin Li, Zhichen Fan, Jing Huo, Yang Gao. 3449-3458 [doi]
- Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class DiscoveryMuli Yang, Liancheng Wang, Cheng Deng, Hanwang Zhang. 3459-3468 [doi]
- Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency is All You NeedTong Wei, Kai Gan. 3469-3478 [doi]
- PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category DiscoverySheng Zhang, Salman Khan, Zhiqiang Shen, Muzammal Naseer, Guangyi Chen, Fahad Shahbaz Khan. 3479-3488 [doi]
- Probabilistic Knowledge Distillation of Face EnsemblesJianqing Xu, Shen Li, Ailin Deng, Miao Xiong, Jiaying Wu, Jiaxiang Wu 0002, Shouhong Ding, Bryan Hooi. 3489-3498 [doi]
- Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed RecognitionZhipeng Zhou, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Wei Gong 0001. 3499-3509 [doi]
- Promoting Semantic Connectivity: Dual Nearest Neighbors Contrastive Learning for Unsupervised Domain GeneralizationYuchen Liu 0006, Yaoming Wang, Yabo Chen, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong. 3510-3519 [doi]
- Instance Relation Graph Guided Source-Free Domain Adaptive Object DetectionVibashan VS, Poojan Oza, Vishal M. Patel 0001. 3520-3530 [doi]
- MOT: Masked Optimal Transport for Partial Domain AdaptationYou-Wei Luo, Chuan-Xian Ren. 3531-3540 [doi]
- TOPLight: Lightweight Neural Networks with Task-Oriented Pretraining for Visible-Infrared RecognitionHao Yu, Xu Cheng, Wei Peng. 3541-3550 [doi]
- OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain AdaptationYe Liu, Lingfeng Qiao, Changchong Lu, Di Yin, Chen Lin, Haoyuan Peng, Bo Ren 0002. 3551-3560 [doi]
- Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game PerspectiveJinjing Zhu, Haotian Bai, Lin Wang. 3561-3571 [doi]
- ARO-Net: Learning Implicit Fields from Anchored Radial ObservationsYizhi Wang, Zeyu Huang, Ariel Shamir, Hui Huang 0004, Hao Zhang, Ruizhen Hu. 3572-3581 [doi]
- A Probabilistic Framework for Lifelong Test-Time AdaptationDhanajit Brahma, Piyush Rai. 3582-3591 [doi]
- Distribution Shift Inversion for Out-of-Distribution PredictionRunpeng Yu, Songhua Liu, Xingyi Yang, Xinchao Wang. 3592-3602 [doi]
- Learning Joint Latent Space EBM Prior Model for Multi-layer GeneratorJiali Cui, Ying Nian Wu, Tian Han 0001. 3603-3612 [doi]
- A Data-Based Perspective on Transfer LearningSaachi Jain, Hadi Salman, Alaa Khaddaj, Eric Wong 0001, Sung Min Park, Aleksander Madry. 3613-3622 [doi]
- A Meta-Learning Approach to Predicting Performance and Data RequirementsAchin Jain, Gurumurthy Swaminathan, Paolo Favaro, Hao Yang, Avinash Ravichandran, Hrayr Harutyunyan, Alessandro Achille, Onkar Dabeer, Bernt Schiele, Ashwin Swaminathan, Stefano Soatto. 3623-3632 [doi]
- Guided Recommendation for Model Fine-TuningHao Li, Charless C. Fowlkes, Hao Yang, Onkar Dabeer, Zhuowen Tu, Stefano Soatto. 3633-3642 [doi]
- EMT-NAS: Transferring architectural knowledge between tasks from different datasetsPeng Liao, Yaochu Jin, Wenli Du. 3643-3653 [doi]
- AttriCLIP: A Non-Incremental Learner for Incremental Knowledge LearningRunqi Wang, Xiaoyue Duan, Guoliang Kang, Jianzhuang Liu, Shaohui Lin, Songcen Xu, Jinhu Lv, Baochang Zhang 0001. 3654-3663 [doi]
- Batch Model Consolidation: A Multi-Task Model Consolidation FrameworkIordanis Fostiropoulos, Jiaye Zhu, Laurent Itti. 3664-3676 [doi]
- SmartAssign: Learning A Smart Knowledge Assignment Strategy for Deraining and DesnowingYinglong Wang, Chao Ma, Jianzhuang Liu. 3677-3686 [doi]
- TinyMIM: An Empirical Study of Distilling MIM Pre-trained ModelsSucheng Ren, Fangyun Wei, Zheng Zhang, Han Hu. 3687-3697 [doi]
- Computationally Budgeted Continual Learning: What Does Matter?Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet K. Dokania, Philip H. S. Torr, Ser-Nam Lim, Bernard Ghanem, Adel Bibi. 3698-3707 [doi]
- GradMA: A Gradient-Memory-based Accelerated Federated Learning with Alleviated Catastrophic ForgettingKangyang Luo, Xiang Li, Yunshi Lan, Ming Gao. 3708-3717 [doi]
- Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space DecouplingZhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie 0006, Lizhuang Ma. 3718-3727 [doi]
- Neuro-Modulated Hebbian Learning for Fully Test-Time AdaptationYushun Tang, Ce Zhang, Heng Xu, Shuoshuo Chen, Jie Cheng, Luziwei Leng, Qinghai Guo, Zhihai He. 3728-3738 [doi]
- Generalizing Dataset Distillation via Deep Generative PriorGeorge Cazenavette, Tongzhou Wang 0001, Antonio Torralba 0001, Alexei A. Efros, Jun-Yan Zhu. 3739-3748 [doi]
- Minimizing the Accumulated Trajectory Error to Improve Dataset DistillationJiawei Du, Yidi Jiang, Vincent Y. F. Tan, Joey Tianyi Zhou, Haizhou Li 0001. 3749-3758 [doi]
- Slimmable Dataset CondensationSonghua Liu, Jingwen Ye, Runpeng Yu, Xinchao Wang. 3759-3768 [doi]
- Sharpness-Aware Gradient Matching for Domain GeneralizationPengfei Wang, Zhaoxiang Zhang, Zhen Lei, Lei Zhang. 3769-3778 [doi]
- Dynamic Neural Network for Multi-Task Learning Searching across Diverse Network TopologiesWonhyeok Choi, Sunghoon Im. 3779-3788 [doi]
- SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision BoundariesAhmed Imtiaz Humayun, Randall Balestriero, Guha Balakrishnan, Richard G. Baraniuk. 3789-3798 [doi]
- VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue DistributionJaeill Kim, Suhyun Kang, Duhun Hwang, Jungwook Shin, Wonjong Rhee. 3799-3810 [doi]
- Efficient On-Device Training via Gradient FilteringYuedong Yang, Guihong Li, Radu Marculescu. 3811-3820 [doi]
- Are Data-Driven Explanations Robust Against Out-of-Distribution Data?Tang Li, Fengchun Qiao, Mengmeng Ma 0002, Xi Peng 0005. 3821-3831 [doi]
- BiasAdv: Bias-Adversarial Augmentation for Model DebiasingJongin Lim 0002, Youngdong Kim, Byungjai Kim, Chanho Ahn, Jinwoo Shin, Eunho Yang, Seungju Han. 3832-3841 [doi]
- Q-DETR: An Efficient Low-Bit Quantized Detection TransformerSheng Xu, Yanjing Li, Mingbao Lin, Peng Gao, Guodong Guo, Jinhu Lü, Baochang Zhang 0001. 3842-3851 [doi]
- NIPQ: Noise proxy-based Integrated Pseudo-QuantizationJuncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park. 3852-3861 [doi]
- CUDA: Convolution-Based Unlearnable DatasetsVinu Sankar Sadasivan, Mahdi Soltanolkotabi, Soheil Feizi. 3862-3871 [doi]
- KD-DLGAN: Data Limited Image Generation via Knowledge DistillationKaiwen Cui, Yingchen Yu, Fangneng Zhan, ShengCai Liao, Shijian Lu, Eric P. Xing. 3872-3882 [doi]
- Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN TrainingSiddarth Asokan, Chandra Sekhar Seelamantula. 3883-3893 [doi]
- Efficient Verification of Neural Networks Against LVM-Based SpecificationsHarleen Hanspal, Alessio Lomuscio. 3894-3903 [doi]
- Bi-directional Feature Fusion Generative Adversarial Network for Ultra-high Resolution Pathological Image Virtual Re-stainingKexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang. 3904-3913 [doi]
- DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly DetectionXuan Zhang, Shiyu Li, Xi Li 0010, Ping Huang, Jiulong Shan, Ting Chen. 3914-3923 [doi]
- OmniAL: A Unified CNN Framework for Unsupervised Anomaly LocalizationYing Zhao. 3924-3933 [doi]
- Federated Incremental Semantic SegmentationJiahua Dong, Duzhen Zhang, Yang Cong, Wei Cong, Henghui Ding, Dengxin Dai. 3934-3943 [doi]
- Re-Thinking Federated Active Learning Based on Inter-Class DiversitySangmook Kim, Sangmin Bae, Hwanjun Song, Se-Young Yun. 3944-3953 [doi]
- Federated Domain Generalization with Generalization AdjustmentRuipeng Zhang, Qinwei Xu, Jiangchao Yao, Ya Zhang, Qi Tian 0001, Yanfeng Wang. 3954-3963 [doi]
- On the Effectiveness of Partial Variance Reduction in Federated Learning with Heterogeneous DataBo Li, Mikkel N. Schmidt, Tommy S. Alstrøm, Sebastian U. Stich. 3964-3973 [doi]
- The Resource Problem of Using Linear Layer Leakage Attack in Federated LearningJoshua C. Zhao, Ahmed Roushdy Elkordy, Atul Sharma, Yahya H. Ezzeldin, Salman Avestimehr, Saurabh Bagchi. 3974-3983 [doi]
- Unlearnable Clusters: Towards Label-Agnostic Unlearnable ExamplesJiaming Zhang 0006, Xingjun Ma, Qi Yi, Jitao Sang, Yu-Gang Jiang, Yaowei Wang, Changsheng Xu. 3984-3993 [doi]
- Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection GeneralizationShichao Dong, Jin Wang, Renhe Ji, Jiajun Liang, Haoqiang Fan, Zheng Ge. 3994-4004 [doi]
- Backdoor Defense via Adaptively Splitting Poisoned DatasetKuofeng Gao, Yang Bai, Jindong Gu, Yong Yang, Shu-Tao Xia. 4005-4014 [doi]
- How to Backdoor Diffusion Models?Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho. 4015-4024 [doi]
- TrojViT: Trojan Insertion in Vision TransformersMengxin Zheng, Qian Lou, Lei Jiang 0001. 4025-4034 [doi]
- TrojDiff: Trojan Attacks on Diffusion Models with Diverse TargetsWeixin Chen, Dawn Song, Bo Li 0026. 4035-4044 [doi]
- Ensemble-based Blackbox Attacks on Dense PredictionZikui Cai, Yaoteng Tan, M. Salman Asif. 4045-4055 [doi]
- Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based AttacksYunrui Yu, Cheng-Zhong Xu 0001. 4056-4066 [doi]
- The Best Defense is a Good Offense: Adversarial Augmentation Against Adversarial AttacksIuri Frosio, Jan Kautz. 4067-4076 [doi]
- Adversarial Robustness via Random Projection FiltersMinjing Dong, Chang Xu 0002. 4077-4086 [doi]
- Jedi: Entropy-Based Localization and Removal of Adversarial PatchesBilel Tarchoun, Anouar Ben Khalifa, Mohamed-Ali Mahjoub, Nael B. Abu-Ghazaleh, Ihsen Alouani. 4087-4095 [doi]
- Exploring the Relationship Between Architectural Design and Adversarially Robust GeneralizationAishan Liu, Shiyu Tang, Siyuan Liang, Ruihao Gong, Boxi Wu, Xianglong Liu 0001, Dacheng Tao. 4096-4107 [doi]
- Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch CorruptionsYong Guo, David Stutz, Bernt Schiele. 4108-4118 [doi]
- Towards Effective Adversarial Textured 3D Meshes on Physical Face RecognitionXiao Yang, Chang Liu, Longlong Xu, Yikai Wang, Yinpeng Dong, Ning Chen, Hang Su, Jun Zhu. 4119-4128 [doi]
- AltFreezing for More General Video Face Forgery DetectionZhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Houqiang Li. 4129-4138 [doi]
- Passive Micron-Scale Time-of-Flight with Sunlight InterferometryAlankar Kotwal, Anat Levin, Ioannis Gkioulekas. 4139-4149 [doi]
- 2-NeRF: Fast Neural Radiance Field Training with Free Camera TrajectoriesPeng Wang 0099, Yuan Liu, Zhaoxi Chen 0009, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang. 4150-4159 [doi]
- NoPe-NeRF: Optimising Neural Radiance Field with No Pose PriorWenjing Bian, Zirui Wang, Kejie Li, Jia-Wang Bian. 4160-4169 [doi]
- BAD-NeRF: Bundle Adjusted Deblur Neural Radiance FieldsPeng Wang, Lingzhe Zhao, Ruijie Ma, Peidong Liu. 4170-4179 [doi]
- DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion ModelsJamie Wynn, Daniyar Turmukhambetov. 4180-4189 [doi]
- SPARF: Neural Radiance Fields from Sparse and Noisy PosesPrune Truong, Marie-Julie Rakotosaona, Fabian Manhardt, Federico Tombari. 4190-4200 [doi]
- Interactive Segmentation of Radiance FieldsRahul Goel, Dhawal Sirikonda, Saurabh Saini, P. J. Narayanan. 4201-4211 [doi]
- Temporal Interpolation is all You Need for Dynamic Neural Radiance FieldsSungheon Park, Minjung Son 0001, Seokhwan Jang, Young Chun Ahn, Ji-Yeon Kim, Nahyup Kang. 4212-4221 [doi]
- Compressing Volumetric Radiance Fields to 1 MBLingzhi Li 0002, Zhen Shen, Zhongshu Wang, Li Shen 0003, Liefeng Bo. 4222-4231 [doi]
- Multiscale Tensor Decomposition and Rendering Equation Encoding for View SynthesisKang Han, Wei Xiang 0001. 4232-4241 [doi]
- Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene StylizationYuechen Zhang, Zexin He, Jinbo Xing, Xufeng Yao, Jiaya Jia. 4242-4251 [doi]
- Representing Volumetric Videos as Dynamic MLP MapsSida Peng, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou. 4252-4262 [doi]
- Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense GridsWei Dong, Christopher B. Choy, Charles Loop, Or Litany, Yuke Zhu, Anima Anandkumar. 4263-4272 [doi]
- DynIBaR: Neural Dynamic Image-Based RenderingZhengqi Li, Qianqian Wang, Forrester Cole, Richard Tucker 0001, Noah Snavely. 4273-4284 [doi]
- Plateau-Reduced Differentiable Path TracingMichael Fischer, Tobias Ritschel 0001. 4285-4294 [doi]
- NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect IlluminationHaoqian Wu, Zhipeng Hu, Lincheng Li, Yongqiang Zhang, Changjie Fan, Xin Yu 0002. 4295-4304 [doi]
- WildLight: In-the-wild Inverse Rendering with a FlashlightZiang Cheng, Junxuan Li, Hongdong Li. 4305-4314 [doi]
- Relightable Neural Human Assets from Multi-view Gradient IlluminationsTaotao Zhou, Kai He, Di Wu, Teng Xu 0008, Qixuan Zhang, Kuixiang Shao, Wenzheng Chen, Lan Xu, Jingyi Yu. 4315-4327 [doi]
- DiffRF: Rendering-Guided 3D Radiance Field DiffusionNorman Müller, Yawar Siddiqui, Lorenzo Porzi, Samuel Rota Bulò, Peter Kontschieder, Matthias Nießner. 4328-4338 [doi]
- Analyzing Physical Impacts Using Transient Surface Wave ImagingTianyuan Zhang, Mark Sheinin, Dorian Chan, Mark Rau, Matthew O'Toole, Srinivasa G. Narasimhan. 4339-4348 [doi]
- Neural Kaleidoscopic Space SculptingByeongjoo Ahn, Michael DeZeeuw, Ioannis Gkioulekas, Aswin C. Sankaranarayanan. 4349-4358 [doi]
- Towards Unbiased Volume Rendering of Neural Implicit Surfaces with Geometry PriorsYongqiang Zhang, Zhipeng Hu, Haoqian Wu, Minda Zhao, Lincheng Li, Zhengxia Zou, Changjie Fan. 4359-4368 [doi]
- Neural Kernel Surface ReconstructionJiahui Huang, Zan Gojcic, Matan Atzmon, Or Litany, Sanja Fidler, Francis Williams. 4369-4379 [doi]
- MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled ConsistencyMingye Xu, Mutian Xu, Tong He 0004, Wanli Ouyang, Yali Wang 0001, Xiaoguang Han 0001, Yu Qiao 0001. 4380-4390 [doi]
- Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field InversionDario Pavllo, David Joseph Tan, Marie-Julie Rakotosaona, Federico Tombari. 4391-4401 [doi]
- DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene SynthesisYinghao Xu, Menglei Chai, Zifan Shi, Sida Peng, Ivan Skorokhodov, Aliaksandr Siarohin, Ceyuan Yang, Yujun Shen, Hsin-Ying Lee, Bolei Zhou, Sergey Tulyakov. 4402-4412 [doi]
- Heat Diffusion Based Multi-Scale and Geometric Structure-Aware Transformer for Mesh SegmentationChi Chong Wong. 4413-4422 [doi]
- Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular ImageYu Deng, Baoyuan Wang, Heung-Yeung Shum. 4423-4433 [doi]
- 3D-aware Conditional Image SynthesisKangle Deng, Gengshan Yang, Deva Ramanan, Jun-Yan Zhu. 4434-4445 [doi]
- VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANsAnna Frühstück, Nikolaos Sarafianos, Yuanlu Xu, Peter Wonka, Tony Tung. 4446-4455 [doi]
- SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationYen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander G. Schwing, Liangyan Gui. 4456-4465 [doi]
- Generating Part-Aware Editable 3D Shapes without 3D SupervisionKonstantinos Tertikas, Despoina Paschalidou, Boxiao Pan, Jeong-Joon Park, Mikaela Angelina Uy, Ioannis Z. Emiris, Yannis Avrithis, Leonidas J. Guibas. 4466-4478 [doi]
- NeuralLift-360: Lifting an in-the-Wild 2D Photo to A 3D Object with 360° ViewsDejia Xu, Yifan Jiang 0001, Peihao Wang, Zhiwen Fan, Yi Wang, Zhangyang Wang. 4479-4489 [doi]
- Implicit Identity Driven Deepfake Face Swapping DetectionBaojin Huang, Zhongyuan Wang, Jifan Yang, Jiaxin Ai, Qin Zou 0001, Qian Wang 0002, Dengpan Ye. 4490-4499 [doi]
- Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural FieldsRohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, K. Madhava Krishna, Srinath Sridhar 0002. 4500-4510 [doi]
- Improving Fairness in Facial Albedo Estimation via Visual-Textual CuesXingyu Ren, Jiankang deng, Chao Ma 0004, Yichao Yan, Xiaokang Yang. 4511-4520 [doi]
- High-fidelity 3D Face Generation from Natural Language DescriptionsMenghua Wu, Hao Zhu, Linjia Huang, Yiyu Zhuang, Yuanxun Lu, Xun Cao. 4521-4530 [doi]
- DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face AlignmentHeyuan Li, Bo Wang, Yu Cheng, Mohan S. Kankanhalli, Robby T. Tan. 4531-4540 [doi]
- High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative PriorsYunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan. 4541-4551 [doi]
- 3DAvatarGAN: Bridging Domains for Personalized Editable AvatarsRameen Abdal, Hsin-Ying Lee, Peihao Zhu 0001, Menglei Chai, Aliaksandr Siarohin, Peter Wonka, Sergey Tulyakov. 4552-4562 [doi]
- RODIN: A Generative Model for Sculpting 3D Digital Avatars Using DiffusionTengfei Wang, Bo Zhang 0025, Ting Zhang, Shuyang Gu, Jianmin Bao, Tadas Baltrusaitis, JingJing Shen, Dong Chen 0003, Fang Wen 0001, Qifeng Chen, Baining Guo. 4563-4573 [doi]
- Instant Volumetric Head AvatarsWojciech Zielonka, Timo Bolkart, Justus Thies. 4574-4584 [doi]
- Synthesizing Photorealistic Virtual Humans Through Cross-Modal DisentanglementSiddarth Ravichandran, Ondrej Texler, Dimitar Dinev, Hyun-Jae Kang. 4585-4594 [doi]
- 3D Cinemagraphy from a Single ImageXingyi Li, Zhiguo Cao 0001, Huiqiang Sun, Jianming Zhang 0001, Ke Xian, Guosheng Lin. 4595-4605 [doi]
- TryOnDiffusion: A Tale of Two UNetsLuyang Zhu, Dawei Yang, Tyler Zhu, Fitsum Reda, William Chan, Chitwan Saharia, Mohammad Norouzi 0002, Ira Kemelmacher-Shlizerman. 4606-4615 [doi]
- Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand DisentanglementXingqun Qi, Chen Liu, Muyi Sun, Lincheng Li, Changjie Fan, Xin Yu 0002. 4616-4626 [doi]
- Normal-guided Garment UV Prediction for Human Re-texturingYasamin Jafarian, Tuanfeng Y. Wang, Duygu Ceylan, Jimei Yang, Nathan Carr 0001, Yi Zhou, Hyun Soo Park. 4627-4636 [doi]
- REC-MV: REconstructing 3D Dynamic Cloth from Monocular VideosLingteng Qiu, Guanying Chen, Jiapeng Zhou, Mutian Xu, Junle Wang, Xiaoguang Han 0001. 4637-4646 [doi]
- SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human ReconstructionYukang Cao, Kai Han 0001, Kwan-Yee K. Wong. 4647-4657 [doi]
- Handy: Towards a High Fidelity 3D Hand Shape and Appearance ModelRolandos-Alexandros Potamias, Stylianos Ploumpis, Stylianos Moschoglou, Vasileios Triantafyllou, Stefanos Zafeiriou. 4670-4680 [doi]
- Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete CounterpartsNikolas Lamb, Cameron Palmer, Benjamin Molloy, Sean Banerjee, Natasha Kholgade Banerjee. 4681-4691 [doi]
- Distilling Neural Fields for Real-Time Articulated Shape ReconstructionJeff Tan, Gengshan Yang, Deva Ramanan. 4692-4701 [doi]
- GANmouflage: 3D Object Nondetection with Texture FieldsRui Guo, Jasmine Collins, Oscar de Lima, Andrew Owens. 4702-4712 [doi]
- 3D Human Pose Estimation via Intuitive PhysicsShashank Tripathi, Lea Müller, Chun-Hao P. Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas. 4713-4725 [doi]
- Object pop-up: Can we infer 3D objects and their poses from human interactions alone?Ilya A. Petrov 0001, Riccardo Marin, Julian Chibane, Gerard Pons-Moll. 4726-4736 [doi]
- UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned PolicyYinzhen Xu, Weikang Wan, Jialiang Zhang, Haoran Liu, Zikang Shan, Hao Shen, Ruicheng Wang, Haoran Geng, Yijia Weng, Jiayi Chen, Tengyu Liu, Li Yi, He Wang. 4737-4746 [doi]
- Constrained Evolutionary Diffusion Filter for Monocular Endoscope TrackingXiongbiao Luo. 4747-4756 [doi]
- Visibility Aware Human-Object Interaction Tracking from Single RGB CameraXianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll. 4757-4768 [doi]
- Transformer-based Unified Recognition of Two Hands Manipulating ObjectsHoseong Cho, Chanwoo Kim, Jihyeon Kim, Seongyeong Lee, Elkhan Ismayilzada, SeungRyul Baek. 4769-4778 [doi]
- HuManiFlow: Ancestor-Conditioned Normalising Flows on SO(3) Manifolds for Human Pose and Shape Distribution EstimationAkash Sengupta, Ignas Budvytis, Roberto Cipolla. 4779-4789 [doi]
- 3D Human Pose Estimation with Spatio-Temporal Criss-Cross AttentionZhenhua Tang, Zhaofan Qiu, Yanbin Hao, Richang Hong, Ting Yao. 4790-4799 [doi]
- GFPose: Learning 3D Human Pose Prior with Gradient FieldsHai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang. 4800-4810 [doi]
- JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and TrackingEdward Vendrow, Duy-Tho Le, Jianfei Cai 0001, Hamid Rezatofighi. 4811-4820 [doi]
- Analyzing and Diagnosing Pose Estimation with AttributionsQiyuan He, Linlin Yang, Kerui Gu, Qiuxia Lin, Angela Yao. 4821-4830 [doi]
- Shape-Constraint Recurrent Flow for 6D Object Pose EstimationYang Hai, Rui Song 0003, Jiaojiao Li 0001, Yinlin Hu. 4831-4840 [doi]
- TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose EstimationHanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam. 4841-4852 [doi]
- Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery from Sparse Image EnsembleChun-Han Yao, Wei-Chih Hung, Yuanzhen Li, Michael Rubinstein, Ming-Hsuan Yang 0001, Varun Jampani. 4853-4862 [doi]
- Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast SolutionBangyan Liao, Delin Qu, Yifei Xue, Huiqing Zhang, Yizhen Lao. 4863-4871 [doi]
- Revisiting the P3P ProblemYaqing Ding 0001, Jian Yang, Viktor Larsson, Carl Olsson, Kalle Åström. 4872-4880 [doi]
- Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable CategoriesSamarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotný. 4881-4891 [doi]
- MobileBrick: Building LEGO for 3D Reconstruction on Mobile DevicesKejie Li, Jia-Wang Bian, Robert Castle, Philip H. S. Torr, Victor Adrian Prisacariu. 4892-4901 [doi]
- EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene SupervisionJiahui Lei, Congyue Deng, Karl Schmeckpeper, Leonidas J. Guibas, Kostas Daniilidis. 4902-4912 [doi]
- GINA-3D: Learning to Generate Implicit Neural Assets in the WildBokui Shen, Xinchen Yan, Charles R. Qi, Mahyar Najibi, Boyang Deng, Leonidas J. Guibas, Yin Zhou, Dragomir Anguelov. 4913-4926 [doi]
- Habitat-Matterport 3D Semantics DatasetKarmesh Yadav, Ram Ramrakhya, Santhosh Kumar Ramakrishnan, Théophile Gervet, John Turner, Aaron Gokaslan, Noah Maestre, Angel Xuan Chang, Dhruv Batra, Manolis Savva, Alexander William Clegg, Devendra Singh Chaplot. 4927-4936 [doi]
- BUOL: A Bottom-Up Framework with Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single ImageTao Chu, Pan Zhang, Qiong Liu, Jiaqi Wang. 4937-4946 [doi]
- Panoptic Compositional Feature Field for Editable Scene Rendering with Network-Inferred Labels via Metric LearningXinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang. 4947-4957 [doi]
- A Light Touch Approach to Teaching Transformers Multi-view GeometryYash Bhalgat, João F. Henriques, Andrew Zisserman. 4958-4969 [doi]
- Learning to Render Novel Views from Wide-Baseline Stereo PairsYilun Du, Cameron Smith, Ayush Tewari, Vincent Sitzmann. 4970-4980 [doi]
- Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and StereoLukas Mehl, Jenny Schmalfuss, Azin Jahedi, Yaroslava Nalivayko, Andrés Bruhn. 4981-4991 [doi]
- EventNeRF: Neural Radiance Fields from a Single Colour Event CameraViktor Rudnev, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik. 4992-5002 [doi]
- LightedDepth: Video Depth Estimation in Light of Limited Inference View AnglesShengjie Zhu, Xiaoming Liu 0002. 5003-5012 [doi]
- Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display CameraRuicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Jinwei Gu, Chen Change Loy. 5013-5022 [doi]
- Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel ImageDonggun Kim, Hyeonjoong Jang, Inchul Kim, Min H. Kim 0001. 5023-5032 [doi]
- Trap Attention: Monocular Depth Estimation with Manual TrapsChao Ning, Hongping Gan. 5033-5043 [doi]
- Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and PosesEric Brachmann, Tommaso Cavallari, Victor Adrian Prisacariu. 5044-5053 [doi]
- Energy-Efficient Adaptive 3D SensingBrevin Tilmon, Zhanghao Sun, Sanjeev J. Koppal, Yicheng Wu, Georgios Evangelidis 0002, Ramzi Zahreddine, Gurunandan Krishnan, Sizhuo Ma, Jian Wang. 5054-5063 [doi]
- Incremental 3D Semantic Scene Graph Prediction from RGB SequencesShun-Cheng Wu, Keisuke Tateno, Nassir Navab, Federico Tombari. 5064-5074 [doi]
- Consistent Direct Time-of-Flight Video Depth Super-ResolutionZhanghao Sun, Wei Ye, Jinhui Xiong, Gyeongmin Choe, Jialiang Wang, Shuochen Su, Rakesh Ranjan. 5075-5085 [doi]
- Learning to Zoom and UnzoomChittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan. 5086-5095 [doi]
- FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D DetectionYuqi Wang, YunTao Chen, Zhaoxiang Zhang. 5096-5105 [doi]
- 3D Video Object Detection with Learnable Object-Centric Global OptimizationJiawei He 0002, YunTao Chen, Naiyan Wang, Zhaoxiang Zhang. 5106-5115 [doi]
- UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye ViewShengchao Zhou, Weizhou Liu, Chen Hu, Shuchang Zhou 0001, Chao Ma. 5116-5125 [doi]
- ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D DataHaojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. 5126-5135 [doi]
- Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU SupervisionQi Ming, Lingjuan Miao, Zhe Ma, Lin Zhao, Zhiqiang Zhou, Xuhui Huang, Yuanpei Chen, Yufei Guo. 5136-5145 [doi]
- SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial ExamplesHan Liu, Yuhao Wu, Zhiyuan Yu, Yevgeniy Vorobeychik, Ning Zhang. 5146-5155 [doi]
- Normalizing Flow based Feature Synthesis for Outlier-Aware Object DetectionNishant Kumar 0005, Sinisa Segvic, Abouzar Eslami, Stefan Gumhold. 5156-5165 [doi]
- OcTr: Octree-Based Transformer for 3D Object DetectionChao Zhou, Yanan Zhang 0005, Jiaxin Chen, Di Huang 0001. 5166-5175 [doi]
- HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic FusionSijie Wang, Qiyu Kang, Rui She, Wei Wang, Kai Zhao, Yang Song, Wee-Peng Tay. 5176-5185 [doi]
- LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera DistillationSong Wang, Wentong Li, Wenyu Liu 0005, Xiaolu Liu, Jianke Zhu. 5186-5195 [doi]
- MSF: Motion-guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud SequencesChenhang He, Ruihuang Li, Yabin Zhang, Shuai Li, Lei Zhang. 5196-5205 [doi]
- SFD2: Semantic-Guided Feature Detection and DescriptionFei Xue, Ignas Budvytis, Roberto Cipolla. 5206-5216 [doi]
- Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous DrivingLucas Nunes, Louis Wiesmann, Rodrigo Marcuzzi, Xieyuanli Chen, Jens Behley, Cyrill Stachniss. 5217-5228 [doi]
- Unsupervised 3D Point Cloud Representation Learning by Triangle Constrained Contrast for Autonomous DrivingBo Pang, Hongchi Xia, Cewu Lu. 5229-5239 [doi]
- RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous DrivingAngelika Ando, Spyros Gidaris, Andrei Bursuc, Gilles Puy, Alexandre Boulch, Renaud Marlet. 5240-5250 [doi]