Abstract is missing.
- Enhancing Material Features Using Dynamic Backward Attention on Cross-Resolution PatchesYuwen Heng, Yihong Wu, Srinandan Dasmahapatra, Hansung Kim. 4 [doi]
- MAC: Mask-Augmentation for Motion-Aware Video Representation LearningArif Akar, Ufuk Umut Senturk, Nazli Ikizler-Cinbis. 5 [doi]
- Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth EstimationHang Zhou, Sarah Taylor, David Greenwood 0001, Michal Mackiewicz. 7 [doi]
- TripleDNet: Exploring Depth Estimation with Self-Supervised Representation LearningUfuk Umut Senturk, Arif Akar, Nazli Ikizler-Cinbis. 8 [doi]
- Domain Generalization Capability Enhancement for Binary Neural NetworksJianming Ye, Shunan Mao, Shiliang Zhang. 13 [doi]
- Spatio-Temporal Learnable Proposals for End-to-End Video Object DetectionKhurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal. 18 [doi]
- Deep Image Harmonization by Bridging the Reality GapJunyan Cao, Wenyan Cong, Li Niu 0002, Jianfu Zhang 0003, Liqing Zhang 0001. 23 [doi]
- Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion AugmentationGeorgios Kouros, Shubham Shrivastava, Cédric Picron, Sushruth Nagesh, Punarjay Chakravarty, Tinne Tuytelaars. 26 [doi]
- Approximating Continuous Convolutions for Deep Network CompressionTheo W. Costain, Victor Adrian Prisacariu. 27 [doi]
- EpipolarNVS: leveraging on Epipolar geometry for single-image Novel View SynthesisGaëtan Landreau, Mohamed Tamaazousti. 30 [doi]
- Disentangling 3D Attributes from a Single 2D Image: Human Pose, Shape and GarmentXue Hu, Xinghui Li, Benjamin Busam, Yiren Zhou, Ales Leonardis, Shanxin Yuan. 31 [doi]
- Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision TransformerGuglielmo Camporese, Elena Izzo, Lamberto Ballan. 32 [doi]
- TAG: Boosting Text-VQA via Text-aware Visual Question-answer GenerationJun Wang 0090, Mingfei Gao, Yuqian Hu, Ramprasaath R. Selvaraju, Chetan Ramaiah, Ran Xu, Joseph F. JáJá, Larry Davis 0001. 33 [doi]
- MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent NetworksAngel Villar-Corrales, Ani Karapetyan, Andreas Boltres, Sven Behnke. 34 [doi]
- Training Binarized Neural Networks the Easy WayAlasdair Paren, Rudra P. K. Poudel. 35 [doi]
- LOCL: Learning Object-Attribute Composition using LocalizationSatish Kumar, A. S. M. Iftekhar, Ekta Prashnani, B. S. Manjunath. 37 [doi]
- Positive Pair Distillation Considered Harmful: Continual Meta Metric Learning for Lifelong Object Re-IdentificationKai Wang, Chenshen Wu, Andy Bagdanov, Xialei Liu, Shiqi Yang, Shangling Jui, Joost van de Weijer 0001. 38 [doi]
- Deep Clustering by Semantic Contrastive LearningJiabo Huang, Shaogang Gong. 39 [doi]
- Open-vocabulary Semantic Segmentation with Frozen Vision-Language ModelsChaofan Ma, Yuhuan Yang, Yan-Feng Wang, Ya Zhang 0002, Weidi Xie. 45 [doi]
- Pay Self-Attention to Audio-Visual NavigationYinfeng Yu, Lele Cao, Fuchun Sun 0001, Xiaohong Liu, Liejun Wang. 46 [doi]
- Energy-Based Residual Latent Transport for Unsupervised Point Cloud CompletionRuikai Cui, Shi Qiu, Saeed Anwar, Jing Zhang 0052, Nick Barnes. 48 [doi]
- ShowFace: Coordinated Face Inpainting with Memory-Disentangled Refinement NetworksZhuojie Wu, Xingqun Qi, Zijian Wang, Wanting Zhou, Kun Yuan, Muyi Sun, Zhenan Sun. 52 [doi]
- SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware FeaturesZhong-Min Tsai, Yu-Ju Tsai, Chien-Yao Wang, Hong-Yuan Mark Liao, Youn-Long Lin, Yung-Yu Chuang. 55 [doi]
- MorphPool: Efficient Non-linear Pooling & Unpooling in CNNsRick Groenendijk, Leo Dorst, Theo Gevers. 56 [doi]
- Detailed Annotations of Chest X-Rays via CT Projection for Report UnderstandingConstantin Marc Seibold, Simon Reiß, M. Saquib Sarfraz, Matthias A. Fink, Victoria Mayer, Jan Sellner, Moon Sung Kim, Klaus H. Maier-Hein, Jens Kleesiek, Rainer Stiefelhagen. 58 [doi]
- Propagating Difference Flows for Efficient Video Super-ResolutionRuisheng Gao, Zeyu Xiao, Zhiwei Xiong. 60 [doi]
- One-Pot Multi-Frame DenoisingLujia Jin, Shi Zhao, Lei Zhu 0012, Qian Chen, Yanye Lu. 61 [doi]
- Automatic universal taxonomies for multi-domain semantic segmentationPetra Bevandic, Sinisa Segvic. 63 [doi]
- DiffSketching: Sketch Control Image Synthesis with Diffusion ModelsQiang Wang, Di Kong, Fengyin Lin, Yonggang Qi. 67 [doi]
- Re-Attention Transformer for Weakly Supervised Object LocalizationHui Su, Yue Ye, Zhiwei Chen, Mingli Song, Lechao Cheng. 70 [doi]
- SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB ImageFlorian Langer, Gwangbin Bae, Ignas Budvytis, Roberto Cipolla. 72 [doi]
- Hybrid Cost Volume Regularization for Memory-efficient Multi-view Stereo NetworksQingtian Zhu, Zizhuang Wei, Zhongtao Wang, Yisong Chen, Guoping Wang. 73 [doi]
- Rethinking Graph Neural Networks for Unsupervised Video Object SegmentationDaizong Liu, Wei Hu. 76 [doi]
- Event-based Non-Rigid Reconstruction from ContoursYuxuan Xue, Haolong Li, Stefan Leutenegger, Joerg Stueckler. 78 [doi]
- Beyond the CLS Token: Image Reranking using Pretrained Vision TransformersChao Zhang 0023, Stephan Liwicki, Roberto Cipolla. 80 [doi]
- Motion-Aware Graph Reasoning Hashing for Self-supervised Video RetrievalZiyun Zeng, Jinpeng Wang, Bin Chen 0011, YuTing Wang, Shu-Tao Xia. 82 [doi]
- Blind Removal of Facial Foreign ShadowsYaojie Liu, Andrew Z. Hou, Xinyu Huang, Liu Ren, Xiaoming Liu 0002. 88 [doi]
- StyleFaceUV: a 3D Face UV Map Generator for View-Consistent Face Image SynthesisWei-Chieh Chung, Jiankai Zhu, I-Chao Shen, Yu-Ting Wu, Yung-Yu Chuang. 89 [doi]
- Convolutional Sparse Coding Network Via Improved Proximal Gradient For Compressed Sensing Magnetic Resonance ImagingXiaofan Wang, Yali Zhang, Pengyu Li, Jinjia Wang. 90 [doi]
- Learning to Construct 3D Building Wireframes from 3D Line CloudsYicheng Luo, Jing Ren, Xuefei Zhe, Di Kang, Yajing Xu, Peter Wonka, Linchao Bao. 91 [doi]
- OSM: An Open Set Matting Framework with OOD Detection and Few-Shot LearningYuhongze Zhou, Issam Hadj Laradji, Liguang Zhou, Derek Nowrouzezahrai. 92 [doi]
- Subtask-dominated Supervised Pretraining Transfer Learning for Person SearchChuang Liu, Hua Yang 0001, Shibao Zheng. 94 [doi]
- XCon: Learning with Experts for Fine-grained Category DiscoveryYixin Fei, Zhongkai Zhao, Siwei Yang, Bingchen Zhao. 96 [doi]
- Style2NeRF: An Unsupervised One-Shot NeRF for Semantic 3D ReconstructionJames Charles, Wim Abbeloos, Daniel Olmeda Reino, Roberto Cipolla. 104 [doi]
- Visible Watermark Removal with Dynamic Kernel and Semantic-aware PropagationXing Zhao, Li Niu 0002, Liqing Zhang 0001. 106 [doi]
- A Simple Plugin for Transforming Images to Arbitrary ScalesQinye Zhou, Ziyi Li, Weidi Xie, Xiaoyun Zhang, Yan-Feng Wang, Ya Zhang 0002. 107 [doi]
- ELDA: Using Edges to Have an Edge on Semantic Segmentation Based UDATing-Hsuan Liao, Huang-Ru Liao, Shan-Ya Yang, Jie-En Yao, Li-Yuan Tsao, Hsu-Shen Liu, Chen-Hao Chao, Bo-Wun Cheng, Chia-Che Chang, Yi-Chen Lo, Chun-Yi Lee. 108 [doi]
- UV-Based 3D Hand-Object Reconstruction with Grasp OptimizationZiwei Yu, Linlin Yang, You Xie, Ping Chen, Angela Yao. 111 [doi]
- Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal ModelingHsin-Ying Lee, Hung-Ting Su, Bing-Chen Tsai, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu. 116 [doi]
- ARCSC-Net: An Approximate Residual Convolutional Sparse Coding Network For Compressed Sensing MRIQian Wang, Pengyu Li, Jinjia Wang. 120 [doi]
- Pro-DDPM: Progressive Growing of Variable Denoising Diffusion Probabilistic Models for Faster ConvergenceRohit Gandikota, Nicholas Brown. 121 [doi]
- Few-shot Semantic Segmentation with Support-induced Graph Convolutional NetworkJie Liu 0043, Yanqi Bao, Wenzhe Ying, Haochen Wang, Yang Gao, Jan-Jakob Sonke, Efstratios Gavves. 126 [doi]
- Pose-graph via Adaptive Image Re-orderingDaniel Barath, Jana Noskova, Ivan Eichhardt, Jiri Matas. 127 [doi]
- SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene ReconstructionYitong Xia, Hao Tang 0005, Radu Timofte, Luc Van Gool. 131 [doi]
- Multi-View Multi-Person 3D Pose Estimation with Uncalibrated Camera NetworksYan Xu, Kris Kitani. 132 [doi]
- Can I see an Example? Active Learning the Long Tail of Attributes and RelationsTyler L. Hayes, Maximillian Nickel, Christopher Kanan, Ludovic Denoyer, Arthur Szlam. 134 [doi]
- Bootstrapping Human Optical Flow and PoseAritro Roy Arko, Jim Little 0001, Kwang Moo Yi. 139 [doi]
- LW-ISP: A Lightweight Model with ISP and Deep LearningHongyang Chen, Kaisheng Ma. 148 [doi]
- Font Representation Learning via Paired-glyph MatchingJunho Cho, Kyuewang Lee, Jin Young Choi 0002. 149 [doi]
- Shape Preserving Facial Landmarks with Graph Attention NetworksAndrés Prados-Torreblanca, José Miguel Buenaposada, Luis Baumela. 155 [doi]
- KPE: Keypoint Pose Encoding for Transformer-based Image GenerationSoon Yau Cheong, Armin Mustafa, Andrew Gilbert. 163 [doi]
- ISG: I can See Your Gene ExpressionYan Yang, Liyuan Pan, Liu Liu 0009, Eric A. Stone. 173 [doi]
- Two-View Left Ventricular Segmentation and Ejection Fraction Estimation in 2D EchocardiogramsFrank Cally A Tabuco, Jose Donato A Magno, Nathaniel S. Orillaza Jr., Rani Ailyna V. Domingo, Prospero C. Naval. 176 [doi]
- Meta Transferring for DeblurringPo-Sheng Liu, Fu-Jen Tsai, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin. 181 [doi]
- Debiasing Image-to-Image Translation ModelsMd. Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell. 182 [doi]
- Learning Object-level Point Augmentor for Semi-supervised 3D Object DetectionCheng-Ju Ho, Chen-Hsuan Tai, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang 0001. 185 [doi]
- Dual Decision Improves Open-Set Panoptic SegmentationHaiming Xu, Hao Chen, Lingqiao Liu, Yufei Yin. 190 [doi]
- SeA: Selective Attention for Fine-grained Visual CategorizationYajie Chen, Huan Wang, Peiwen Pan. 191 [doi]
- Inharmonious Region Localization with Auxiliary Style FeaturePenghao Wu, Li Niu 0002, Liqing Zhang 0001. 197 [doi]
- Inharmonious Region Localization via Recurrent Self-ReasoningPenghao Wu, Li Niu 0002, Jing Liang 0007, Liqing Zhang 0001. 198 [doi]
- End-to-End Learning of Multi-category 3D Pose and Shape EstimationYigit Baran Can, Alexander Liniger, Danda Pani Paudel, Luc Van Gool. 200 [doi]
- GameCodec: Neural Cloud Gaming Video CodecHoang Le, Reza Pourreza 0002, Amir Said, Guillaume Sautière, Auke Wiggers. 204 [doi]
- Multi-hop Modulated Graph Convolutional Networks for 3D Human Pose EstimationJae-Yung Lee, Igil Kim. 207 [doi]
- Learning ODINAmir Jevnisek, Shai Avidan. 210 [doi]
- Weak-shot Semantic Segmentation by Transferring Semantic Affinity and BoundarySiyuan Zhou, Li Niu 0002, Jianlou Si, Chen Qian, Liqing Zhang 0001. 211 [doi]
- Self-Supervised Robustifying Guidance for Monocular 3D Face ReconstructionHitika Tiwari, Min-Hung Chen, Yi-Min Tsai, Hsien-Kai Kuo, Hung-Jen Chen, Kevin Jou, K. S. Venkatesh, Yong-Sheng Chen. 220 [doi]
- Ki-Pode: Keypoint-based Implicit Pose Distribution Estimation of Rigid ObjectsThorbjørn Mosekjær Iversen, Rasmus Laurvig Haugaard, Anders Glent Buch. 222 [doi]
- NeRD++: Improved 3D-mirror symmetry learning from a single imageYancong Lin, Silvia-Laura Pintea, Jan C. van Gemert. 223 [doi]
- Towards Unsupervised Sketch-based Image RetrievalConghui Hu, Yongxin Yang, Yunpeng Li, Timothy M. Hospedales, Yi-Zhe Song. 224 [doi]
- Feature Embedding by Template Matching as a ResNet BlockAda Gorgun, Yeti Ziya Gürbüz, A. Aydin Alatan. 225 [doi]
- In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze EstimationBolin Lai, Miao Liu, Fiona Ryan, James M. Rehg. 227 [doi]
- Selective Colour Restoration of Underwater SurfacesChau Yi Li, Andrea Cavallaro. 228 [doi]
- Semantic Segmentation with Active Semi-Supervised Representation LearningAneesh Rangnekar, Christopher Kanan, Matthew J. Hoffman 0001. 229 [doi]
- Privacy Vulnerability of Split Computing to Data-Free Model Inversion AttacksXin Dong, Hongxu Yin, Jose M. Alvarez, Jan Kautz, Pavlo Molchanov, H. T. Kung 0001. 230 [doi]
- Hybrid-Learning Video Moment Retrieval across Multi-Domain LabelsWeitong Cai, Jiabo Huang, Shaogang Gong. 231 [doi]
- Unsupervised Domain Adaptive Fundus Image Segmentation with Few Labeled Source DataQianbi Yu, Dongnan Liu, Chaoyi Zhang, Xinwen Zhang, Weidong Cai 0001. 237 [doi]
- You Only Need 90K Parameters to Adapt Light: a Light Weight Transformer for Image Enhancement and Exposure CorrectionZiteng Cui, Kunchang Li, Lin Gu 0003, Shenghan Su, Peng Gao, Zhengkai Jiang 0001, Yu Qiao 0001, Tatsuya Harada. 238 [doi]
- Learnable Descriptive Convolutional Network for Face Anti-SpoofingPei-Kai Huang, Hui-Yu Ni, Yanqin Ni, Chiou-Ting Hsu. 239 [doi]
- Trident Pyramid Networks for Object DetectionCédric Picron, Tinne Tuytelaars. 241 [doi]
- CICC: Channel Pruning via the Concentration of Information and Contributions of ChannelsYihao Chen, Zhishan Li, Yingqing Yang, Lei Xie 0007, Yong Liu 0007, Longhua Ma, Shanqi Liu, Guanzhong Tian. 243 [doi]
- Content-Diverse Comparisons improve IQAWilliam Thong, José Costa Pereira, Sarah Parisot, Ales Leonardis, Steven McDonagh. 244 [doi]
- A Tri-Layer Plugin to Improve Occluded DetectionGuanqi Zhan, Weidi Xie, Andrew Zisserman. 250 [doi]
- Dress Well via Fashion Cognitive LearningKaicheng Pang, Xingxing Zou, Waikeung Wong. 251 [doi]
- EAPruning: Evolutionary Pruning for Vision Transformers and CNNsQingyuan Li, Bo Zhang 0046, Xiangxiang Chu. 258 [doi]
- Check Your Other Door! Creating Backdoor Attacks in the Frequency DomainHasan Abed Al Kader Hammoud, Bernard Ghanem. 259 [doi]
- USB: Universal-Scale Object Detection BenchmarkYosuke Shinya. 261 [doi]
- Edge Detection of Motion-Blurred Images based on GANsFeng Li, Jiyu Li. 266 [doi]
- LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion ModelsParamanand Chandramouli, Kanchana Vaishnavi Gandikota. 267 [doi]
- Exploring Localization for Self-supervised Fine-grained Contrastive LearningDi Wu, Siyuan Li, Zelin Zang, Stan Z. Li. 268 [doi]
- Learning to Wear: Details-Preserved Virtual Try-on via Disentangling Clothes and WearerSangho Lee, Seoyoung Lee, Joonseok Lee. 272 [doi]
- Rethinking the Evaluation of Unbiased Scene Graph GenerationXingchen Li, Long Chen 0016, Jian Shao, Shaoning Xiao, Songyang Zhang, Jun Xiao 0001. 279 [doi]
- Improving Gradient Paths for Binary Convolutional Neural NetworksBaozhou Zhu, H. Peter Hofstee, Jinho Lee, Zaid Al-Ars. 281 [doi]
- Dual Pyramid Generative Adversarial Networks for Semantic Image SynthesisShijie Li, Ming-Ming Cheng, Jürgen Gall. 285 [doi]
- Region-of-Interest Based Neural Video CompressionYura Perugachi-Diaz, Guillaume Sautière, Davide Abati, Yang Yang 0010, AmirHossein Habibian, Taco S. Cohen. 288 [doi]
- Compressing Video Calls using Synthetic Talking HeadsMadhav Agarwal, Anchit Gupta, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar. 289 [doi]
- Implicit texture mapping for multi-view video synthesisMohamed Ilyes Lakhal, Oswald Lanz, Andrea Cavallaro. 290 [doi]
- TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature GraftingMuhammad Hamza Sharif, Dmitry Demidov, Asif Hanif, Mohammad Yaqub, Min Xu. 293 [doi]
- Exemplar Learning for Medical Image SegmentationQing En, Yuhong Guo. 296 [doi]
- APSNet: Attention Based Point Cloud SamplingYang Ye, Xiulong Yang, Shihao Ji. 298 [doi]
- Rethinking Prototypical Contrastive Learning through Alignment, Uniformity and CorrelationShentong Mo, Zhun Sun, Chao Li 0013. 299 [doi]
- Learning visual representations for transfer learning by suppressing textureShlok Kumar Mishra, Anshul Shah, Ankan Bansal, Janit Anjaria, Jonghyun Choi, Abhinav Shrivastava, Abhishek Sharma 0001, David Jacobs 0001. 300 [doi]
- Towards Efficient Neural Scene Graphs by Learning Consistency FieldsYeji Song, Chaerin Kong, Seoyoung Lee, Nojun Kwak, Joonseok Lee. 302 [doi]
- Casual Indoor HDR Radiance Capture from Omnidirectional ImagesPulkit Gera, Mohammad Reza Karimi Dastjerdi, Charles Renaud, P. J. Narayanan, Jean-François Lalonde. 305 [doi]
- Towards Scalable Spectral Clustering via Spectrum-Preserving SparsificationYongyu Wang, Zhuo Feng. 307 [doi]
- G2Net: Generic Game-Theoretic Network for Partial-Label Image ClassificationRabab Abdelfattah, Xin Zhang, Mostafa M. Fouda, Xiaofeng Wang, Song Wang 0002. 309 [doi]
- Track Targets by Dense Spatio-Temporal Position EncodingJinkun Cao, Hao Wu, Kris Kitani. 311 [doi]
- Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic CountingWei Lin 0018, Kunlin Yang, Xinzhu Ma, Junyu Gao 0001, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi, Antoni B. Chan. 313 [doi]
- Spatio-Temporal Fusion-based Monocular 3D Lane DetectionYin Wang, Qiuyi Guo, Peiwen Lin, Guangliang Cheng, Jian Wu. 314 [doi]
- Task Generalizable Spatial and Texture Aware Image Downsizing NetworkLin Ma 0002, Weiming Li, Hongsheng Li 0001, Qiang Wang, Ji-Yeon Kim. 315 [doi]
- Zero-shot Visual Commonsense Immorality PredictionYujin Jeong, Seongbeom Park, Suhong Moon, Jinkyu Kim. 320 [doi]
- Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language RecognitionYoungjoon Jang, Youngtaek Oh, Jae-Won Cho, Dong-Jin Kim 0003, Joon Son Chung, In-So Kweon. 322 [doi]
- Dist2: Distribution-Guided Distillation for Object DetectionTianchu Guo, Pengyu Li, Wei Liu, Bin Luo, Biao Wang. 323 [doi]
- Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance SegmentationEvan Ling, Dezhao Huang, Minhoe Hur. 329 [doi]
- Learning Clothes-irrelevant Cues for Clothes-Changing Person Re-identificationJingyi Mu, Yong Li, Jun Li, Jian Yang. 337 [doi]
- DisPositioNet: Disentangled Pose and Identity in Semantic Image ManipulationAzade Farshad, Yousef Yeganeh, Helisa Dhamo, Federico Tombari, Nassir Navab. 340 [doi]
- VID-Trans-ReID: Enhanced Video Transformers for Person Re-identificationAishah Alsehaim, Toby P. Breckon. 342 [doi]
- Non-uniform Sampling Strategies for NeRF on 360° imagesTakashi Otonari, Satoshi Ikehata, Kiyoharu Aizawa. 344 [doi]
- ScannerNet: A Deep Network for Scanner-Quality Document Images under Complex IlluminationChih-Jou Hsu, Yu-Ting Wu, Ming-Sui Lee, Yung-Yu Chuang. 345 [doi]
- Resolving Semantic Confusions for Improved Zero-Shot DetectionSandipan Sarma, Sushil Kumar, Arijit Sur. 347 [doi]
- T4DT: Tensorizing Time for Learning Temporal 3D Visual DataMikhail Usvyatsov, Rafael Ballester, Lina Bashaeva, Konrad Schindler, Gonzalo Ferrer, Ivan V. Oseledets. 348 [doi]
- Unsupervised Flow Refinement near Motion BoundariesShuzhi Yu, Hannah Halin Kim, Shuai Yuan, Carlo Tomasi. 351 [doi]
- Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentMustafa Shukor, Guillaume Couairon, Matthieu Cord. 353 [doi]
- Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action RecognitionKiyoon Kim, Shreyank N. Gowda, Oisin Mac Aodha, Laura Sevilla-Lara. 355 [doi]
- An Action Is Worth Multiple Words: Handling Ambiguity in Action RecognitionKiyoon Kim, Davide Moltisanti, Oisin Mac Aodha, Laura Sevilla-Lara. 356 [doi]
- Geometry Driven Progressive Warping for One-Shot Face AnimationYatao Zhong, Faezeh Amjadi, Ilya Zharkov. 357 [doi]
- Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All NetworksZhile Yang, Shangqi Guo, Ying Fang, Jian Liu. 358 [doi]
- Dual-lens Reference Image Super-ResolutionJing Zhu, Wenbo Li, Hongxia Jin. 359 [doi]
- MaterialNet: Multi-scale Texture Hierarchy and Multi-view Surface Reflectance for Material Type RecognitionDongJin Lee, Seungkyu Lee. 361 [doi]
- Multiple Object Tracking from appearance by hierarchically clustering trackletsAndreu Girbau, Ferran Marqués, Shin'ichi Satoh 0001. 362 [doi]
- LcT: Locally-Enhanced Cross-Window Vision TransformerCanhui Wei, Huiwei Wang. 364 [doi]
- TetGAN: A Convolutional Neural Network for Tetrahedral Mesh GenerationWilliam Gao, April Wang, Gal Metzer, Raymond A. Yeh, Rana Hanocka. 365 [doi]
- Boosting Adversarial Robustness From The Perspective of Effective Margin RegularizationZiquan Liu, Antoni B. Chan. 367 [doi]
- Less is More: Facial Landmarks can Recognize a Spontaneous SmileMd. Tahrim Faroque, Yan Yang, Md. Zakir Hossain, Sheikh Motahar Naim, Nabeel Mohammed, Shafin Rahman. 369 [doi]
- CounTR: Transformer-based Generalised Visual CountingChang Liu, Yujie Zhong, Andrew Zisserman, Weidi Xie. 370 [doi]
- BOAT: Bilateral Local Attention Vision TransformerTan Yu, Gangming Zhao, Ping Li 0001, Yizhou Yu. 371 [doi]
- SSR: An Efficient and Robust Framework for Learning with Unknown Label NoiseChen Feng, Georgios Tzimiropoulos, Ioannis Patras. 372 [doi]
- Unsupervised Low Light Image Enhancement Transformer Based on Dual Contrastive LearningFengji Ma, Jinping Sun. 373 [doi]
- iiTransformer: A Unified Approach to Exploiting Local and Non-local Information for Image RestorationSoo-Min Kang, Youngchan Song, Hanul Shin, Tammy Lee. 377 [doi]
- Free-form 3D Scene Inpainting with Dual-stream GANRu-Fen Jheng, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu. 378 [doi]
- PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints DetectionShenwei Xie, Wanfeng Zheng, Zhenglin Xian, Junli Yang, Chuang Zhang, Ming Wu 0001. 381 [doi]
- Finding Directions in GAN's Latent Space for Neural Face ReenactmentStella Bounareli, Vasileios Argyriou, Georgios Tzimiropoulos. 383 [doi]
- Information Theoretic Representation DistillationRoy Miles, Adrián López Rodríguez, Krystian Mikolajczyk. 385 [doi]
- Ranking Aggregation with Interactive Feedback for Collaborative Person Re-identificationJi Huang, Chao Liang, Yue Zhang, Zhongyuan Wang 0001, Chunjie Zhang. 386 [doi]
- A Memory Transformer Network for Incremental LearningAhmet Iscen, Thomas Bird, Mathilde Caron, Alireza Fathi, Cordelia Schmid. 388 [doi]
- TaylorSwiftNet: Taylor Driven Temporal Modeling for Swift Future Frame PredictionMohammad Saber Pourheydari, Emad Bahrami Rad, Mohsen Fayyaz, Gianpiero Francesca, Mehdi Noroozi, Jürgen Gall. 389 [doi]
- CroCPS: Addressing Photometric Challenges in Self-Supervised Category-Level 6D Object Poses with Cross-Modal LearningPengyuan Wang 0002, Lorenzo Garattoni, Sven Meier, Nassir Navab, Benjamin Busam. 390 [doi]
- Robust Action Segmentation from Timestamp SupervisionYaser Souri, Yazan Abu Farha, Emad Bahrami Rad, Gianpiero Francesca, Jürgen Gall. 392 [doi]
- Variational Simultaneous Stereo Matching and Defogging in Low VisibilityYining Ding, Andrew M. Wallace, Sen Wang. 394 [doi]
- Sparse in Space and Time: Audio-visual Synchronisation with Trainable SelectorsVladimir Iashin, Weidi Xie, Esa Rahtu, Andrew Zisserman. 395 [doi]
- Segmentation Assisted U-shaped Multi-scale Transformer for Crowd CountingYifei Qian, Liangfei Zhang, Xiaopeng Hong, Carl Donovan, Ognjen Arandjelovic. 397 [doi]
- MUAD: Multiple Uncertainties for Autonomous Driving, a benchmark for multiple uncertainty types and tasksGianni Franchi, Xuanlong Yu, Andrei Bursuc, Angel Tena, Rémi Kazmierczak, Séverine Dubuisson, Emanuel Aldea, David Filliat. 398 [doi]
- Instance Segmentation of Dense and Overlapping Objects via LayeringLong Chen 0014, Yuli Wu, Dorit Merhof. 400 [doi]
- Semi-Supervised Object Detection with Object-wise Contrastive Learning and Regression UncertaintyHonggyu Choi, Zhixiang Chen 0003, Xuepeng Shi, Tae-Kyun Kim. 405 [doi]
- Revisiting Self-Supervised Contrastive Learning for Facial Expression RecognitionYuxuan Shu, Xiao Gu 0003, Guang-Zhong Yang, Benny P. L. Lo. 406 [doi]
- Flynet: Max it, Excite it, Quantize itLuis Guerra, Tom Drummond. 407 [doi]
- HSPA: Hough Space Pattern Analysis as an Answer to Local Description Ambiguities for 3D Pose EstimationFabrice Mayran de Chamisso, Boris Meden, Mohamed Tamaazousti. 411 [doi]
- Masked Supervised Learning for Semantic SegmentationHasib Zunair, Abdessamad Ben Hamza. 417 [doi]
- Fill in Fabrics: Body-Aware Self-Supervised Inpainting for Image-Based Virtual Try-OnHasib Zunair, Yan Gobeil, Samuel Mercier, Abdessamad Ben Hamza. 418 [doi]
- Selective Partial Domain AdaptationPengxin Guo, Jinjing Zhu, Yu Zhang. 420 [doi]
- Unified Negative Pair Generation toward Well-discriminative Feature Space for Face RecognitionJunuk Jung, Seonhoon Lee, Heung-Seon Oh, Yongjun Park 0003, Sungbin Son, Joochan Park. 421 [doi]
- Pyramid Region-based Slot Attention Network for Temporal Action Proposal GenerationShuaicheng Li, Feng Zhang, Rui-Wei Zhao, Kunlin Yang, Lingbo Liu, Rui Feng, Jun Hou. 424 [doi]
- RGB-T Multi-Modal Crowd Counting Based on TransformerZhengyi Liu, Wei Wu, Yacheng Tan, Guanghui Zhang. 427 [doi]
- Disentangling based Environment-Robust Feature Learning for Person ReIDYifan Liu, Yali Li 0001, Shengjin Wang. 428 [doi]
- STPLS3D: A Large-Scale Synthetic and Real Aerial Photogrammetry 3D Point Cloud DatasetMeida Chen, Qingyong Hu, Zifan Yu, Hugues Thomas, Andrew Feng, Yu Hou, Kyle McCullough, Fengbo Ren, Lucio Soibelman. 429 [doi]
- Dual-Pixel Raindrop RemovalYizhou Li, Yusuke Monno, Masatoshi Okutomi. 439 [doi]
- Disentangling Content and Motion for Text-Based Neural Video ManipulationLevent Karacan, Tolga Kerimoglu, Ismail Inan, Tolga Birdal, Erkut Erdem, Aykut Erdem. 443 [doi]
- MagFormer: Hybrid Video Motion Magnification Transformer from Eulerian and Lagrangian PerspectivesSicheng Gao, Yutang Feng, Linlin Yang, Xuhui Liu, Zichen Zhu, David S. Doermann, Baochang Zhang 0001. 444 [doi]
- Defect Transfer GAN: Diverse Defect Synthesis for Data AugmentationRuyu Wang, Sabrina Hoppe, Eduardo Monari, Marco F. Huber. 445 [doi]
- Towards Robust In-domain and Out-of-Domain Generalization: Contrastive Learning with Prototype Alignment and Collaborative AttentionYuan-Jhe Kuo, Cheng-Yu Yang, Chiou-Ting Hsu. 446 [doi]
- A Unified Mixture-View Framework for Unsupervised Representation LearningXiangxiang Chu, Xiaohang Zhan, Bo Zhang 0046. 447 [doi]
- CLAD: A Contrastive Learning based Approach for Background DebiasingKe Wang, Harshitha Machiraju, Oh-Hyeon Choung, Michael H. Herzog, Pascal Frossard. 449 [doi]
- Fractional Optimization Model for Infrared and Visible Image FusionKang Zhang, Shiwei Wu, Zhiliang Wu, Xia Yuan, Chunxia Zhao. 458 [doi]
- Doubly Contrastive End-to-End Semantic Segmentation for Autonomous Driving under Adverse WeatherJongoh Jeong, Jong-Hwan Kim. 460 [doi]
- Object Tracking Network Based on Deformable Attention MechanismKexin Chen, Baojie Fan, Xiaobin Guo. 469 [doi]
- Trans2k: Unlocking the Power of Deep Models for Transparent Object TrackingAlan Lukezic, Ziga Trojer, Jiri Matas, Matej Kristan. 470 [doi]
- Multi-body Self-CalibrationAndrea Porfiri Dal Cin, Giacomo Boracchi, Luca Magri. 471 [doi]
- Anomaly Detection and Localization Using Attention-Guided Synthetic Anomaly and Test-Time AdaptationBehzad Bozorgtabar, Dwarikanath Mahapatra, Jean-Philippe Thiran. 472 [doi]
- K-Space Transformer for Undersampled MRI ReconstructionZiheng Zhao, Tianjiao Zhang, Weidi Xie, Yan-Feng Wang, Ya Zhang. 473 [doi]
- SGENet: Spatial Guided Enhancement Network for Image Motion DeblurringYu-Chieh Wang, Chia-Hung Yeh. 474 [doi]
- On the Importance of Image Encoding in Automated Chest X-Ray Report GenerationOtabek Nazarov, Mohammad Yaqub, Karthik Nandakumar. 475 [doi]
- IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its UncertaintyGwangbin Bae, Ignas Budvytis, Roberto Cipolla. 476 [doi]
- Neighbor Regularized Bayesian Optimization for Hyperparameter OptimizationLei Cui, Yangguang Li, Xin Lu, Dong An, Fenggang Liu. 479 [doi]
- Fixed Point Layers for Geodesic Morphological OperationsSantiago Velasco-Forero, Ayoub Rhim, Jesús Angulo. 480 [doi]
- Unleashing the Potential of Vision-Language Models for Long-Tailed Visual RecognitionTeli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li 0001, Baochang Zhang 0001, Peng Gao, Yu Qiao 0001. 481 [doi]
- BaseTransformers: Attention over base data-points for One Shot LearningMayug Maniparambil, Kevin McGuinness, Noel E. O'Connor. 482 [doi]
- SAGE: Saliency-Guided Mixup with Optimal RearrangementsAvery Ma, Nikita Dvornik, Ran Zhang, Leila Pishdad, Konstantinos G. Derpanis, Afsaneh Fazly. 484 [doi]
- Polycentric Clustering and Structural Regularization for Source-free Unsupervised Domain AdaptationXinyu Guan, Han Sun, Ningzhong Liu, Huiyu Zhou 0001. 485 [doi]
- Cluster-level pseudo-labelling for source-free cross-domain facial expression recognitionAlessandro Conti, Paolo Rota, Yiming Wang, Elisa Ricci 0001. 486 [doi]
- Information Removal at the bottleneck in Deep Neural NetworksEnzo Tartaglione. 488 [doi]
- Enhancing Person Synthesis in Complex Scenes via Intrinsic and Contextual Structure ModelingXi Tian, Yongliang Yang, Qi Wu. 491 [doi]
- Classification of Biomedical Journal Images using Retargeting-Based Data Augmentation and Visually Explainable Attention PriorsVinit Veerendraveer Singh, Chandra Kambhamettu. 497 [doi]
- Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image RetrievalAbhra Chaudhuri, Massimiliano Mancini, Yanbei Chen, Zeynep Akata, Anjan Dutta 0001. 499 [doi]
- Beyond Deterministic Translation for Unsupervised Domain AdaptationEleni Chiou, Eleftheria Panagiotaki, Iasonas Kokkinos. 501 [doi]
- Stating Comparison Score Uncertainty and Verification Decision Confidence Towards Transparent Face RecognitionMarco Huber, Philipp Terhörst, Florian Kirchbuchner, Naser Damer, Arjan Kuijper. 506 [doi]
- Why Do Self-Supervised Models Transfer? On the Impact of Invariance on Downstream TasksLinus Ericsson, Henry Gouk, Timothy M. Hospedales. 509 [doi]
- CNeRV: Content-adaptive Neural Representation for Visual DataHao Chen 0066, Matthew Gwilliam, Bo He, Ser-Nam Lim, Abhinav Shrivastava. 510 [doi]
- Teaching StyleGAN to Read: Improving Text-to-image Synthesis with U2C Transfer LearningVinicius G. Pereira, Jonatas Wehrmann. 512 [doi]
- An Empirical Verification of Wide Networks TheoryDario Balboni, Davide Bacciu. 517 [doi]
- Face editing using a regression-based approach in the StyleGAN latent spaceSaeid Motiian, Siavash Khodadadeh, Shabnam Ghadar, Baldo Faieta, Ladislau Bölöni. 522 [doi]
- ORA3D: Overlap Region Aware Multi-view 3D Object DetectionWonseok Roh, Gyusam Chang, Seokha Moon, Giljoo Nam, Chan Young Kim, Younghyun Kim, Sangpil Kim, Jinkyu Kim. 526 [doi]
- Robust normalizing flows using Bernstein-type polynomialsSameera Ramasinghe, Kasun Fernando, Salman Khan 0001, Nick Barnes. 532 [doi]
- Unconditional Image-Text Pair Generation with Multimodal Cross QuantizerHyunGyung Lee, Sungjin Park, Joonseok Lee, Edward Choi. 533 [doi]
- Distilling Representational Similarity using Centered Kernel Alignment (CKA)Aninda Saha, Alina Bialkowski, Sara Khalifa. 535 [doi]
- Centered Symmetric Quantization for Hardware-Efficient Low-Bit Neural NetworksFaaiz Asim, Jaewoo Park, Azat Azamat, Jongeun Lee. 538 [doi]
- On Temporal Granularity in Self-Supervised Video Representation LearningRui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu 0005, Matthew Brown, Serge J. Belongie, Ming-Hsuan Yang 0001, Hartwig Adam, Yin Cui. 541 [doi]
- RORD: A Real-world Object Removal DatasetMin-Cheol Sagong, Yoon-Jae Yeo, Seung-Won Jung, Sung Jea Ko. 542 [doi]
- CLIPFont: Text Guided Vector WordArt GenerationYiren Song, Yuxuan Zhang. 543 [doi]
- Learning to Segment Object Affordances on Synthetic Data for Task-oriented Robotic HandoversAlbert Christensen, Daniel Lehotský, Marius W. Jørgensen, Dimitris Chrysostomou. 544 [doi]
- Adversarial Pixel Restoration as a Pretext Task for Transferable PerturbationsHashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Naseer, Salman Khan 0001, Fahad Shahbaz Khan. 546 [doi]
- Towards Self-Supervised Gaze EstimationArya Farkhondeh, Cristina Palmero, Simone Scardapane, Sergio Escalera. 549 [doi]
- Multi-View Neural Surface Reconstruction with Structured LightChunyu Li, Taisuke Hashimoto, Eiichi Matsumoto, Hiroharu Kato. 550 [doi]
- Are we pruning the correct channels in image-to-image translation models?Yiyong Li, Zhun Sun, Chao Li 0013. 551 [doi]
- Local Feature Extraction from Salient Regions by Feature Map TransformationYerim Jung, Nur Suriza Syazwany, Sang-Chul Lee. 552 [doi]
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating MotionSubhabrata Choudhury, Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht 0001. 554 [doi]
- Masked Vision-Language Transformers for Scene Text RecognitionJie Wu, Ying Peng, Shengming Zhang, Weigang Qi, Jian Zhang. 555 [doi]
- SP-ViT: Learning 2D Spatial Priors for Vision TransformersYuxuan Zhou, Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Lei Zhang 0006, Margret Keuper, Xian-Sheng Hua 0001. 564 [doi]
- Flow-based GAN for 3D Point Cloud Generation from a Single ImageYao Wei, George Vosselman, Michael Ying Yang. 569 [doi]
- Membership Privacy-Preserving GANHeonseok Ha, Uiwon Hwang, Jaehee Jang, Ho Bae, Sungroh Yoon. 576 [doi]
- Event Transformer FlowNet for optical flow estimationYi Tian, Juan Andrade-Cetto. 577 [doi]
- Robustifying the Multi-Scale Representation of Neural Radiance FieldsNishant Jain, Suryansh Kumar, Luc Van Gool. 578 [doi]
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained ModelsOmiros Pantazis, Gabriel J. Brostow, Kate E. Jones, Oisin Mac Aodha. 580 [doi]
- Image-to-Image Translation with Text GuidanceBowen Li 0001, Philip H. S. Torr, Thomas Lukasiewicz. 581 [doi]
- SalLiDAR: Saliency Knowledge Transfer Learning for 3D Point Cloud UnderstandingGuanqun Ding, Nevrez Imamoglu, Ali Caglayan, Masahiro Murakawa, Ryosuke Nakamura. 584 [doi]
- DUDA: Online-Offline Dual Domain Adaption for Semantic SegmentationAn-tao Pan, Yawei Luo, Yi Yang 0001, Jun Xiao 0001. 585 [doi]
- Towards Unified Multi-Excitation for Unsupervised Video PredictionJunyan Wang, Likun Qin, Peng Zhang 0058, Yang Long 0001, BingZhang Hu, Maurice Pagnucco, Shizheng Wang, Yang Song 0001. 587 [doi]
- Class-Prototypes for Contrastive Learning in Weakly-Supervised 3D Point Cloud SegmentationRong Li, Anh-Quan Cao, Raoul de Charette. 589 [doi]
- Semantics-Adding Flaw-Erasing Network for Semantic Human MattingJiayu Sun, Zhanghan Ke, Ke Xu 0010, Fan Shao, Lihe Zhang, Huchuan Lu, Rynson W. H. Lau. 592 [doi]
- clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIPJustin N. M. Pinkney, Chuan Li. 594 [doi]
- Polishing Network for Decoding of Higher-Quality Diverse Image CaptionsYue Zheng, Ya-Li Li 0001, Shengjin Wang. 601 [doi]
- HDR Reconstruction from Bracketed Exposures and EventsRichard Shaw, Sibi Catley-Chandar, Ales Leonardis, Eduardo Pérez-Pellitero. 603 [doi]
- Improving Interpretability by Information Bottleneck Saliency Guided LocalizationHao Zhou, Keyang Cheng, Yu Si, Liuyang Yan. 605 [doi]
- Distilling and Refining Domain-Specific Knowledge for Semi-Supervised Domain AdaptationJu-Hyun Kim, Ba-Hung Ngo, Jae-Hyeon Park, Jung Eun Kwon, Ho Sub Lee, Sung In Cho. 606 [doi]
- GLAMI-1M: A Multilingual Image-Text Fashion DatasetVaclav Kosar, Antonín Hoskovec, Milan Sulc, Radek Bartyzal. 607 [doi]
- Weakly-supervised Fingerspelling Recognition in British Sign Language VideosK. R. Prajwal, Hannah Bull, Liliane Momeni, Samuel Albanie, Gül Varol, Andrew Zisserman. 609 [doi]
- VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose EstimationMegh Shukla, Roshan Roy, Pankaj Singh, Shuaib Ahmed, Alexandre Alahi. 610 [doi]
- Part-based Face Recognition with Vision TransformersZhonglin Sun, Georgios Tzimiropoulos. 611 [doi]
- Layer Folding: Neural Network Depth Reduction using Activation LinearizationAmir Ben Dror, Niv Zehngut, Avraham Raviv, Evgeny Artyomov, Ran Vitek. 612 [doi]
- Wide Feature Projection with Fast and Memory-Economic Attention for Efficient Image Super-ResolutionMinghao Fu, Dongyang Zhang, Min Lei, Kun He, Changyu Li, Jie Shao. 615 [doi]
- FoGMesh: 3D Human Mesh Recovery in Videos with Focal Transformer and GRUYihao He, Xiaoning Song, Tianyang Xu, Yang Hua, Xiao-Jun Wu 0001. 618 [doi]
- AssocFormer: Association Transformer for Multi-label ClassificationXin-xing, Chong Peng, Yu Zhang 0094, Ai-Ling Lin, Nathan Jacobs. 619 [doi]
- Turbo Training with Token DropoutTengda Han, Weidi Xie, Andrew Zisserman. 622 [doi]
- FIND: An Unsupervised Implicit 3D Model of Articulated Human FeetOliver Boyne, James Charles, Roberto Cipolla. 630 [doi]
- D-STEP: Dynamic Spatio-Temporal PruningAvraham Raviv, Yonatan Dinai, Igor Drozdov, Niv Zehngut, Ishay Goldin. 632 [doi]
- Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and GenerationMohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi. 636 [doi]
- Personalised CLIP or: how to find your vacation videosBruno Korbar, Andrew Zisserman. 639 [doi]
- Consistency-CAM: Towards Improved Weakly Supervised Semantic SegmentationSai Rajeswar, Issam Hadj Laradji, Pau Rodríguez, David Vázquez 0001, Aaron C. Courville. 644 [doi]
- Scaling up Instance Segmentation using Approximately Localized PhrasesKaran Desai, Ishan Misra, Justin Johnson 0001, Laurens van der Maaten. 648 [doi]
- Partially-Supervised Novel Object Captioning Using Context from Paired DataShashank Bujimalla, Mahesh Subedar, Omesh Tickoo. 649 [doi]
- Self-Improving SLAM in Dynamic Environments: Learning When to MaskAdrian Bojko, Romain Dupont, Mohamed Tamaazousti, Hervé Le Borgne. 654 [doi]
- Two-Stream Transformer Architecture for Long Form Video UnderstandingEdward Fish, Jon Weinbren, Andrew Gilbert. 660 [doi]
- Attention Distillation: self-supervised vision transformer students need more guidanceKai Wang, Fei Yang 0004, Joost van de Weijer 0001. 666 [doi]
- A Closer Look at Temporal Ordering in the Segmentation of Instructional VideosAnil Batra, Shreyank N. Gowda, Frank Keller, Laura Sevilla-Lara. 669 [doi]
- MoBYv2AL: Self-supervised Active Learning for Image ClassificationRazvan Caramalau, Binod Bhattarai, Danail Stoyanov, Tae-Kyun Kim. 674 [doi]
- Shifting Transformation Learning for Robust Out-of-Distribution DetectionSina Mohseni, Arash Vahdat, Jay Yadawa. 679 [doi]
- Imagining Hidden Supporting Objects using Volumetric Conditional GANs and Differentiable Stability ScoresHector Basevi, Ales Leonardis. 682 [doi]
- Towards Device Efficient Conditional Image GenerationNisarg A. Shah, Gaurav Bharaj. 689 [doi]
- Rethinking Group Fisher Pruning for Efficient Label-Free Network CompressionJong-Ryul Lee, Yong-Hyuk Moon. 693 [doi]
- Improving Dense Representation Learning by Superpixelization and Contrasting Cluster AssignmentRobin Karlsson, Tomoki Hayashi, Keisuke Fujii 0001, Alexander Carballo, Kento Ohtani, Kazuya Takeda. 699 [doi]
- LIIF-GAN: Learning Representation With Local Implicit Image Function and GAN for Realistic Images on a Continuous ScaleJun Seok Kang, Sang Chul Ahn. 703 [doi]
- Multi-task Curriculum Learning based on Gradient SimilarityHiroaki Igarashi, Kenichi Yoneji, Kohta Ishikawa, Rei Kawakami, Teppei Suzuki, Shingo Yashima, Ikuro Sato. 705 [doi]
- VoRF: Volumetric Relightable FacesPramod Rao, Mallikarjun B. R. 0001, Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Ayush Tewari, Christian Theobalt, Mohamed Elgharib. 708 [doi]
- Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive LearningShuaicheng Li, Feng Zhang, Kunlin Yang, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi. 709 [doi]
- AISFormer: Amodal Instance Segmentation with TransformerMinh Q. Tran, Khoa Vo 0001, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le. 712 [doi]
- One-shot Network Pruning at Initialization with Discriminative Image PatchesYinan Yang 0001, Yu Wang 0018, Ying Ji, Heng Qi, Jien Kato. 715 [doi]
- Spatio-temporal tendency reasoning for human body pose and shape estimation from videosBoyang Zhang, Suping Wu, Hu Cao, Kehua Ma, Pan Li, Lei Lin. 719 [doi]
- PPL: Pairwise Prototype Learning for Masked Face RecognitionMinsoo Kim, Gi Pyo Nam, Yu-Jin Hong, Ig-Jae Kim. 723 [doi]
- Dual consistency assisted multi-confident learning for the hepatic vessel segmentation using noisy labelsNam Nguyen Phuong, Tuan Van Vo, Soan T. M. Duong, Chanh D. Tr. Nguyen, Trung Bui, Steven Quoc Hung Truong. 725 [doi]
- Memory-Driven Text-to-Image GenerationBowen Li 0001, Philip H. S. Torr, Thomas Lukasiewicz. 726 [doi]
- Handling Class-Imbalance for Improved Zero-Shot Domain GeneralizationAhmad Arfeen, Titir Dutta, Soma Biswas. 728 [doi]
- Improving Local Features with Relevant Spatial Information by Vision Transformer for Crowd CountingNguyen Hoang Tran, Ta Duc Huy, Soan T. M. Duong, Nguyen-Phan, Dao Huu Hung, Chanh D. Tr. Nguyen, Trung Bui, Steven Quoc Hung Truong. 729 [doi]
- How to Train Vision Transformer on Small-scale Datasets?Hanan Gani, Muzammal Naseer, Mohammad Yaqub. 731 [doi]
- Revisiting single-gated Mixtures of ExpertsAmelie Royer, Ilia Karmanov, Andrii Skliar, Babak Ehteshami Bejnordi, Tijmen Blankevoort. 736 [doi]
- PAUMER: Patch Pausing Transformer for Semantic SegmentationEvann Courdier, Prabhu Teja Sivaprasad, François Fleuret. 737 [doi]
- ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance FieldsOctave Mariotti, Oisin Mac Aodha, Hakan Bilen. 740 [doi]
- Low Light Video Enhancement by Learning on Static Videos with Cross-Frame AttentionShivam Chhirolya, Sameer Malik, Rajiv Soundararajan. 743 [doi]
- Explorable Data Consistent CT ReconstructionHannah Dröge, Yuval Bahat, Felix Heide, Michael Moeller 0001. 746 [doi]
- Siamese U-Net for Image Anomaly Detection and Segmentation with Contrastive LearningChia-Ying Lin, Shang-Hong Lai. 752 [doi]
- Face Pyramid Vision TransformerKhawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood. 758 [doi]
- Scale-Equivariant U-NetMateus Sangalli, Samy Blusseau, Santiago Velasco-Forero, Jesús Angulo. 763 [doi]
- Dual Space Multiple Instance Representative Learning for Medical Image ClassificationXiaoxian Zhang, Sheng Huang 0001, Yi Zhang, Xiaohong Zhang 0002, Mingchen Gao, Chen Liu 0026. 768 [doi]
- Parallel and Robust Text Rectifier for Scene Text RecognitionBingcong Li, Xin Tang, Jun Wang, Liang Diao, Rui Fang, Guotong Xie, Weifu Chen. 770 [doi]
- Visual-Semantic Transformer for Scene Text RecognitionLiang Diao, Xin Tang, Jun Wang, Rui Fang, Guotong Xie, Weifu Chen. 772 [doi]
- Anatomically constrained CT image translation for heterogeneous blood vessel segmentationGiammarco La Barbera, Haithem Boussaid, Francesco Maso, Sabine Sarnacki, Laurence Rouet, Pietro Gori, Isabelle Bloch. 776 [doi]
- Robust Target Training for Multi-Source Domain AdaptationZhongying Deng, Da Li 0001, Yi-Zhe Song, Tao Xiang. 778 [doi]
- Morphological Network: How Far Can We Go with Morphological Neurons?Ranjan Mondal, Sanchayan Santra, Soumendu Sundar Mukherjee, Bhabatosh Chanda. 779 [doi]
- XDGAN: Multi-Modal 3D Shape Generation in 2D SpaceHassan Abu Alhaija, Alara Dirik, André Knörig, Sanja Fidler, Maria Shugrina. 782 [doi]
- Self-Supervised Learning of Inlier Events for Event-based Optical FlowJun Nagata, Yoshimitsu Aoki. 785 [doi]
- Universal Perturbation Attack on Differentiable No-Reference Image- and Video-Quality MetricsEkaterina Shumitskaya, Anastasia Antsiferova, Dmitriy S. Vatolin. 790 [doi]
- Structured Spatial Reasoning for Human Pose EstimationYing Huang 0003, Shanfeng Hu, Zi-Ke Zhang. 797 [doi]
- Knowledge Diversification in Ensembles of Identical Neural NetworksBishshoy Das, Sumantra Dutta Roy. 798 [doi]
- Re-examining Distillation for Continual Object DetectionEli Verwimp, Kuo Yang, Sarah Parisot, Lanqing Hong, Steven McDonagh, Eduardo Pérez-Pellitero, Matthias De Lange, Tinne Tuytelaars. 807 [doi]
- Search for Concepts: Learning Visual Concepts Using Direct OptimizationPradyumna Reddy, Paul Guerrero, Niloy J. Mitra. 810 [doi]
- AVisT: A Benchmark for Visual Object Tracking in Adverse VisibilityMubashir Noman, Wafa Al Ghallabi, Daniya Kareem, Christoph Mayer 0007, Akshay Dudhane, Martin Danelljan, Hisham Cholakkal, Salman Khan 0001, Luc Van Gool, Fahad Shahbaz Khan. 817 [doi]
- $S^2$-Flow: Joint Semantic and Style Editing of Facial ImagesKrishnakant Singh, Simone Schaub-Meyer, Stefan Roth 0001. 821 [doi]
- Efficient Feature Extraction for High-resolution Video Frame InterpolationMoritz Nottebaum, Stefan Roth 0001, Simone Schaub-Meyer. 825 [doi]
- HiFECap: Monocular High-Fidelity and Expressive Capture of Human PerformancesYue Jiang, Marc Habermann, Vladislav Golyanik, Christian Theobalt. 826 [doi]
- Pseudo-Label Noise Suppression Techniques for Semi-Supervised Semantic SegmentationSebastian A. Scherer, Robin Schön, Rainer Lienhart. 829 [doi]
- Analysis of Training Object Detection Models with Synthetic DataBram Vanherle, Steven Moonen, Frank Van Reeth, Nick Michiels. 833 [doi]
- Unifying the Visual Perception of Humans and Machines on Fine-Grained Texture SimilarityWeibo Wang, Xinghui Dong. 839 [doi]
- A Cascade Dense Connection Fusion Network for Depth CompletionRizhao Fan, Zhigen Li, Matteo Poggi, Stefano Mattoccia. 843 [doi]
- Correlation between Alignment-Uniformity and Performance of Dense Contrastive RepresentationsJong Hak Moon, Wonjae Kim, Edward Choi. 844 [doi]
- Wide-Range MRI Artifact Removal with TransformersLennart Alexander Van der Goten, Kevin Smith. 846 [doi]
- Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained ClassificationZhenxin Wu, Qingliang Chen, Yongjian Huang. 847 [doi]
- Animal Pose Refinement in 2D Images with 3D ConstraintsXiaowei Dai, Shuiwang Li, Qijun Zhao, Hongyu Yang. 848 [doi]
- G-CMP: Graph-enhanced Contextual Matrix Profile for unsupervised anomaly detection in sensor-based remote health monitoringNivedita Bijlani, Oscar Mendez Maldonado, Samaneh Kouchaki. 854 [doi]
- Global Filter Pruning with Self-Attention for Real-Time UAV TrackingMengyuan Liu, Yuelong Wang, Qiangyu Sun, Shuiwang Li. 861 [doi]
- Self-adversarial Multi-scale Contrastive Learning for Semantic Segmentation of Thermal Facial ImagesJitesh Joshi, Nadia Berthouze, Youngjun Cho. 864 [doi]
- Prior-Aware Synthetic Data to the Rescue: Animal Pose Estimation with Very Limited Real DataLe Jiang, Shuangjun Liu, Xiangyu Bai, Sarah Ostadabbas. 868 [doi]
- Adaptive-TTA: accuracy-consistent weighted test time augmentation method for the uncertainty calibration of deep learning classifiersPedro Conde, Cristiano Premebida. 869 [doi]
- Dual-Curriculum Teacher for Domain-Inconsistent Object Detection in Autonomous DrivingLonghui Yu, Yifan Zhang, Lanqing Hong, Fei Chen 0013, Zhenguo Li. 872 [doi]
- Adaptive Task Sampling and Variance Reduction for Gradient-Based Meta-LearningZhuoqun Liu, Yuankun Jiang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong. 876 [doi]
- Anatomy-Aware Self-Supervised Learning for Aligned Multi-Modal Medical DataHongyu Hu, Tiancheng Lin 0001, Yuanfan Guo, Chunxiao Li, Rong Wu, Yi Xu. 877 [doi]
- Multi-Scale Adversarial Learning and Difficult Supervision for Kidney and Kidney Tumor SegmentationShenhai Zheng, Qiuyu Sun, Xin Ye, Weisheng Li, Laquan Li. 879 [doi]
- Estimating water turbidity from a smartphone cameraLina M. Lozano Wilches, Chotiwat Jantarakasem, Laure Sioné, Michael Templeton, Krystian Mikolajczyk. 880 [doi]
- Performance Limiting Factors of Deep Neural Networks for Pedestrian DetectionYasin Bayzidi, Alen Smajic, Jan David Schneider, Fabian Hüger, Ruby Moritz, Alois C. Knoll. 883 [doi]
- SVS: Adversarial refinement for sparse novel view synthesisVioleta Menéndez González, Andrew Gilbert, Graeme Phillipson, Stephen Jolly, Simon Hadfield. 886 [doi]
- Learning to Augment via Implicit Differentiation for Domain GeneralizationTingwei Wang, Da Li 0001, Kaiyang Zhou, Tao Xiang, Yi-Zhe Song. 888 [doi]
- Efficient Self-Ensemble for Semantic SegmentationWalid Bousselham, Guillaume Thibault, Lucas Pagano, Archana Machireddy, Joe W. Gray, Young-Hwan Chang, Xubo Song. 892 [doi]
- Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene SegmentationLiang Zeng, Attila Lengyel 0001, Nergis Tomen, Jan C. van Gemert. 893 [doi]
- Class-Balanced Loss Based on Class Volume for Long-Tailed Object RecognitionZhijian Zheng, Teck Khim Ng. 896 [doi]
- CASAPose: Class-Adaptive and Semantic-Aware Multi-Object Pose EstimationNiklas Gard, Anna Hilsmann, Peter Eisert. 899 [doi]
- Revisiting Deep Fisher Vectors: Using Fisher Information to Improve Object ClassificationSarah Ahmed, Tayyaba Azim, Joseph Early, Sarvapali D. Ramchurn. 900 [doi]
- Maximizing Mutual Shape InformationMd. Amirul Islam, Matthew Kowal, Patrick Esser, Björn Ommer, Konstantinos G. Derpanis, Neil D. B. Bruce. 909 [doi]
- DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object DetectionZiyuan Zhao, Mingxi Xu, Peisheng Qian, Ramanpreet Singh Pahwa, Richard Chang 0002. 916 [doi]
- Global Contextual Complementary Network for Multi-View StereoYongrong Cao, Suping Wu, Xing Zheng, Bin Wang, Pan Li, Zhixiang Yuan, Lei Lin, Yuxin Peng. 919 [doi]
- Towards a more efficient few-shot learning-based human gesture recognition via dynamic vision sensorsLinglin Jing, Yifan Wang 0008, Tailin Chen, Shirin Dora, Zhigang Ji, Hui Fang 0003. 938 [doi]
- FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding TasksSantiago Castro, Fabian Caba. 939 [doi]
- GLPose: Global-Local Attention Network with Feature Interpolation Regularization for Head Pose Estimation of People Wearing Facial MasksHsueh-Wei Chen, Yi Chen, Pei-Yung Hsiao, Li-Chen Fu, Zirong Ding. 946 [doi]
- Global Proxy-based Hard Mining for Visual Place RecognitionAmar Ali-bey, Brahim Chaib-draa, Philippe Giguère. 958 [doi]
- BIO-CC: Biologically inspired color constancyOguzhan Ulucan, Diclehan Ulucan, Marc Ebner. 960 [doi]
- Dual Moving Average Pseudo-Labeling for Source-Free Inductive Domain AdaptationHao Yan, Yuhong Guo. 965 [doi]
- Multi-Task Edge Prediction in Temporally-Dynamic Video GraphsOsman Ülger, Julian Wiederer, Mohsen Ghafoorian, Vasileios Belagiannis, Pascal Mettes. 968 [doi]
- Reading Chinese in Natural Scenes with a Bag-of-Radicals PriorYongbin Liu, Qingjie Liu, Jiaxin Chen, Yunhong Wang. 969 [doi]
- Quantitative Metrics for Evaluating Explanations of Video DeepFake DetectorsFederico Baldassarre, Quentin Debard, Gonzalo Fiz Pontiveros, Tri Kurniawan Wijaya. 972 [doi]
- Distilling Knowledge from Self-Supervised Teacher by Embedding Graph AlignmentYuchen Ma, Yanbei Chen, Zeynep Akata. 973 [doi]
- Contrastive Learning for Controllable Blind Video RestorationGivi Meishvili, Abdelaziz Djelouah, Shinobu Hattori, Christopher Schroers. 974 [doi]
- Semantic Segmentation under Adverse Conditions: A Weather and Nighttime-aware Synthetic Data-based ApproachAbdulrahman Kerim, Felipe C. Chamone, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang 0001. 977 [doi]
- Adapting branched networks to realise progressive intelligenceJack Dymond, Sebastian Stein 0001, Steve R. Gunn. 990 [doi]
- PatchSwap: A Regularization Technique for Vision TransformersSachin Chhabra, Hemanth Venkateswara, Baoxin Li. 996 [doi]
- Adversarial Vision Transformer for Medical Image Semantic Segmentation with Limited AnnotationsZiyang Wang, Will Zhao, Zixuan Ni, Yuchen Zheng. 1002 [doi]
- Domain Adaptation for the Segmentation of Confidential Medical ImagesSerban Stan, Mohammad Rostami. 1007 [doi]
- Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient MatchingCangxiong Chen, Neill D. F. Campbell. 1009 [doi]
- Overcoming Catastrophic Forgetting for Continual Learning via Feature PropagationXuejun Han, Yuhong Guo. 1011 [doi]
- Group Graph Convolutional Networks for 3D Human Pose EstimationZijian Zhang. 1019 [doi]
- Hugs Are Better Than Handshakes: Unsupervised Cross-Modal Transformer Hashing with Multi-granularity AlignmentJinpeng Wang, Ziyun Zeng, Bin Chen 0011, YuTing Wang, Dongliang Liao, Gongfu Li, Yiru Wang, Shu-Tao Xia. 1035 [doi]
- Program Generation from Diverse Video DemonstrationsAnthony Manchin, Jamie Sherrah, Qi Wu 0001, Anton van den Hengel. 1039 [doi]
- Data Augmentation-free Unsupervised Learning for 3D Point Cloud UnderstandingGuofeng Mei, Cristiano Saltori, Fabio Poiesi, Jian Zhang, Elisa Ricci 0001, Nicu Sebe, Qiang Wu. 1049 [doi]
- Mutual Conditional Probability for Self-Supervised LearningTakumi Kobayashi 0001. 1052 [doi]
- COAT: Correspondence-driven Object Appearance TransferSangryul Jeon, Zhifei Zhang, Zhe Lin 0001, Scott Cohen, Zhihong Ding, Kwanghoon Sohn. 1053 [doi]
- Anatomical prior-inspired label refinement for weakly supervised liver tumor segmentation with volume-level labelsFei Lyu 0004, Andy J. Ma, Pong Chi Yuen. 1054 [doi]
- Continuous Hand Gesture Recognition using Deep Coarse and Fine Hand FeaturesHazem Wannous, Jean-Philippe Vandeborre. 1055 [doi]
- Dense Contrastive Loss for Instance SegmentationHang Chen, Chufeng Tang, Xiaolin Hu 0001. 1062 [doi]
- Joint Reconstruction and Super Resolution of Hyper-Spectral CTIS ImagesMazen Mel, Alexander Gatto, Pietro Zanuttigh. 1063 [doi]
- Mutual Contrastive Low-rank Learning to Disentangle Whole Slide Image Representations for Glioma GradingLipei Zhang, Yiran Wei 0002, Ying Fu 0001, Stephen J. Price, Carola-Bibiane Schönlieb, Chao Li 0031. 1071 [doi]
- Sampling Based On Natural Image Statistics Improves Local Surrogate ExplainersRicardo Kleinlein, Alexander Hepburn, Raúl Santos-Rodríguez, Fernando Fernández-Martínez. 1083 [doi]