Abstract is missing.
- CNN2Graph: Building Graphs for Image ClassificationVivek Trivedy, Longin Jan Latecki. 1-11 [doi]
- Token Pooling in Vision Transformers for Image ClassificationDmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel. 12-21 [doi]
- 2F2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain AdaptationYuting Wang 0004, Ricardo Guerrero, Vladimir Pavlovic 0001. 22-31 [doi]
- ML-Decoder: Scalable and Versatile Classification HeadTal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben Baruch, Asaf Noy. 32-41 [doi]
- Large-Scale Open-Set Classification Protocols for ImageNetAndres Palechor, Annesha Bhoumik, Manuel Günther. 42-51 [doi]
- Composite Relationship Fields with Transformers for Scene Graph GenerationGeorge Adaimi, David Mizrahi, Alexandre Alahi. 52-64 [doi]
- CoKe: Contrastive Learning for Robust Keypoint DetectionYutong Bai, Angtian Wang, Adam Kortylewski, Alan L. Yuille. 65-74 [doi]
- Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient?Quentin Bouniot, Angélique Loesch, Amaury Habrard, Romaric Audigier. 75-84 [doi]
- Scaling Novel Object Detection with Weakly Supervised Detection TransformersTyler Labonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi. 85-96 [doi]
- Dense Prediction with Attentive Feature AggregationYung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu. 97-106 [doi]
- Boosting vision transformers for image retrievalChull Hwan Song, Jooyoung Yoon, Shunghyun Choi, Yannis Avrithis. 107-117 [doi]
- Is your noise correction noisy? PLS: Robustness to label noise with two stage detectionPaul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness. 118-127 [doi]
- Two-level Data Augmentation for Calibrated Multi-view DetectionMartin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua. 128-136 [doi]
- TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained VideosSoufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey, Eric Granger. 137-146 [doi]
- LAVA:Label-efficient Visual Learning and AdaptationIslam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari. 147-156 [doi]
- GEMS: Scene Expansion using Generative Models of GraphsRishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay. 157-166 [doi]
- Dynamic Mixture of Counter Network for Location-Agnostic Crowd CountingMingjie Wang, Hao Cai, Yong Dai, Minglun Gong. 167-177 [doi]
- Simultaneous Acquisition of High Quality RGB Image and Polarization Information using a Sparse Polarization SensorTeppei Kurita, Yuhi Kondo, Legong Sun, Yusuke Moriuchi. 178-188 [doi]
- DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style EditingsBingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi. 189-197 [doi]
- Lossy Image Compression with Quantized Hierarchical VAEsZhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu 0001. 198-207 [doi]
- Keys to Better Image Inpainting: Structure and Texture Go Hand in HandJitesh Jain, YuQian Zhou, Ning Yu, Humphrey Shi. 208-217 [doi]
- Frame Interpolation for Dynamic Scenes with Implicit Flow EncodingPedro Figueirêdo, Avinash Paliwal, Nima Khademi Kalantari. 218-228 [doi]
- I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field ImagesJiwan Hur, Jae Young Lee, Jaehyun Choi, Junmo Kim. 229-238 [doi]
- Burst Reflection Removal using Reflection Motion Aggregation CuesB. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra. 239-248 [doi]
- Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style TransferTai-Yin Chiu, Danna Gurari. 249-258 [doi]
- Panoptic-aware Image-to-Image TranslationLiyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura. 259-268 [doi]
- SimGlim: Simplifying glimpse based active visual reconstructionAbhishek Jha, Soroush Seifi, Tinne Tuytelaars. 269-278 [doi]
- Evaluating generative networks using Gaussian mixtures of image featuresLorenzo Luzi, Carlos Ortiz Marrero, Nile Wynar, Richard G. Baraniuk, Michael J. Henry. 279-288 [doi]
- More Control for Free! Image Synthesis with Semantic Diffusion GuidanceXihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell. 289-299 [doi]
- Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven KeyframesJames F. Mullen Jr., Divya Kothandaraman, Aniket Bera, Dinesh Manocha. 300-310 [doi]
- Surface normal estimation from optimized and distributed light sources using DNN-based photometric stereoTakafumi Iwaguchi, Hiroshi Kawasaki. 311-320 [doi]
- Interpolated SelectionConv for Spherical Images and SurfacesDavid Hart, Michael Whitney, Bryan S. Morse. 321-330 [doi]
- RAST: Restorable Arbitrary Style Transfer via Multi-restorationYingnan Ma, Chenqiu Zhao, Anup Basu, Xudong Li. 331-340 [doi]
- On Quantizing Implicit Neural RepresentationsCameron Gordon, Shin-Fang Chng, Lachlan E. MacDonald, Simon Lucey. 341-350 [doi]
- Lightweight Video Denoising using Aggregated Shifted Window AttentionLydia Lindner, Alexander Effland, Filip Ilic, Thomas Pock, Erich Kobler. 351-360 [doi]
- Federated Domain Generalization for Image Recognition via Cross-Client Style TransferJunming Chen, Meirui Jiang, Qi Dou 0001, Qifeng Chen. 361-370 [doi]
- CTrGAN: Cycle Transformers GAN for Gait TransferShahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi. 371-381 [doi]
- SALAD : Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and DetectionDivya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha. 382-391 [doi]
- Backprop Induced Feature Weighting for Adversarial Domain Adaptation with Iterative Label Distribution AlignmentThomas Westfechtel, Hao-Wei Yeh, Qier Meng, Yusuke Mukuta, Tatsuya Harada. 392-401 [doi]
- Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous LearningMd Mahmudur Rahman 0005, Rameswar Panda, Mohammad Arif Ul Alam. 402-411 [doi]
- Cross-identity Video Motion Retargeting with Joint Transformation and SynthesisHaomiao Ni, Yihao Liu 0003, Sharon X. Huang, Yuan Xue 0002. 412-422 [doi]
- ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based MixingGiulio Mattolin, Luca Zanella, Elisa Ricci 0001, Yiming Wang 0002. 423-433 [doi]
- Improving Diversity with Adversarially Learned Transformations for Domain GeneralizationTejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang. 434-443 [doi]
- Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated LearningDonald Shenaj, Eros Fanì, Marco Toldo, Debora Caldarola, Antonio Tavera, Umberto Michieli, Marco Ciccone, Pietro Zanuttigh, Barbara Caputo. 444-454 [doi]
- CellTranspose: Few-shot Domain Adaptation for Cellular Instance SegmentationMatthew R. Keaton, Ram J. Zaveri, Gianfranco Doretto. 455-466 [doi]
- CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head RedirectionSwati Jindal, Xin Eric Wang. 467-477 [doi]
- Towards Online Domain Adaptive Object DetectionVibashan VS, Poojan Oza, Vishal M. Patel. 478-488 [doi]
- Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object MixingKyusik Cho, Suhyeon Lee 0002, Hongje Seong, Euntai Kim. 489-498 [doi]
- Empirical Generalization Study: Unsupervised Domain Adaptation vs. Domain Generalization Methods for Semantic Segmentation in the WildFabrizio J. Piva, Daan de Geus, Gijs Dubbelman. 499-508 [doi]
- Intra-Source Style Augmentation for Improved Domain GeneralizationYumeng Li, Dan Zhang 0017, Margret Keuper, Anna Khoreva. 509-519 [doi]
- TVT: Transferable Vision Transformer for Unsupervised Domain AdaptationJinyu Yang, Jingjing Liu, Ning Xu, JunZhou Huang. 520-530 [doi]
- Learning Classifiers of Prototypes and Reciprocal Points for Universal Domain AdaptationSungsu Hur, Inkyu Shin, KwanYong Park, Sanghyun Woo, In-So Kweon. 531-540 [doi]
- Auxiliary Task-Guided CycleGAN for Black-Box Model Domain AdaptationMichael Essich, Markus Rehmann, Cristóbal Curio. 541-550 [doi]
- NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point CloudsWeiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi. 551-560 [doi]
- Mobile Robot Manipulation using Pure Object DetectionBrent Griffin. 561-571 [doi]
- SGPCR: Spherical Gaussian Point Cloud Representation and its Application to Object Registration and RetrievalDriton Salihu, Eckehard G. Steinbach. 572-581 [doi]
- GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic SegmentationMin-Seok Lee, Seok Woo Yang, Sung Won Han. 582-591 [doi]
- PointInverter: Point Cloud Reconstruction and Editing via a Generative Model with Shape PriorsJaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai Kit Yeung. 592-601 [doi]
- 3D-SpLineNet: 3D Traffic Line Detection using Parametric Spline RepresentationsMaximilian Pittner, Alexandru Condurache, Joel Janai. 602-611 [doi]
- Domain Adaptive Object Detection for Autonomous Driving under Foggy WeatherJinlong Li, Runsheng Xu, Jin Ma, Qin Zou 0001, Jiaqi Ma, Hongkai Yu. 612-622 [doi]
- SAILOR: Scaling Anchors via Insights into Latent Object RepresentationDusan Malic, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof. 623-632 [doi]
- Class-Level Confidence Based 3D Semi-Supervised LearningZhimin Chen, Longlong Jing, Liang Yang, Yingwei Li, Bing Li 0008. 633-642 [doi]
- MonoEdge: Monocular 3D Object Detection Using Local PerspectivesMinghan Zhu, Lingting Ge, Panqu Wang, Huei Peng. 643-652 [doi]
- Cross-Modality Feature Fusion Network for Few-Shot 3D Point Cloud ClassificationMinmin Yang, Jiajing Chen, Senem Velipasalar. 653-662 [doi]
- Dense Voxel Fusion for 3D Object DetectionAnas Mahmoud, Jordan S. K. Hu, Steven L. Waslander. 663-672 [doi]
- Real-time Concealed Weapon Detection on 3D Radar Images for Walk-through Screening SystemNagma S. Khan, Kazumine Ogura, Eric Cosatto, Masayuki Ariyoshi. 673-681 [doi]
- Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic Weight Average and Contextual Ground Truth SamplingDaeun Lee, Jinkyu Kim. 682-691 [doi]
- Far3Det: Towards Far-Field 3D DetectionShubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong. 692-701 [doi]
- UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translationDmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren. 702-712 [doi]
- Splatting-based Synthesis for Video Frame InterpolationSimon Niklaus, Ping Hu, Jiawen Chen. 713-723 [doi]
- CG-NeRF: Conditional Generative Neural Radiance Fields for 3D-aware Image SynthesisKyungmin Jo, Gyumin Shim, Sanghun Jung, Soyoung Yang, Jaegul Choo. 724-733 [doi]
- Spatially Multi-conditional Image GenerationNikola Popovic, Ritika Chakraborty, Danda Pani Paudel, Thomas Probst, Luc Van Gool. 734-743 [doi]
- WHFL: Wavelet-Domain High Frequency Loss for Sketch-to-Image TranslationMin Woo Kim, Nam Ik Cho. 744-754 [doi]
- DDNeRF: Depth Distribution Neural Radiance FieldsDavid Dadon, Ohad Fried, Yacov Hel-Or. 755-763 [doi]
- Multi-scale Contrastive Learning for Complex Scene GenerationHanbit Lee, Youna Kim, Sang-goo Lee. 764-774 [doi]
- SIRA: Relightable Avatars from a Single ImagePol Caselles, Eduard Ramon, Jaime Garcia Giraldez, Xavier Giró i Nieto, Francesc Moreno-Noguer, Gil Triginer. 775-784 [doi]
- Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene GenerationAditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, René Vidal. 785-794 [doi]
- Beyond RGB: Scene-Property Synthesis with Neural Radiance FieldsMingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang. 795-805 [doi]
- Vision Transformer for NeRF-Based View Synthesis from a Single Input ImageKai-En Lin, Yen-Chen Lin, Wei-Sheng Lai, Tsung-Yi Lin, Yi-Chang Shih, Ravi Ramamoorthi. 806-815 [doi]
- ScanNeRF: a Scalable Benchmark for Neural Radiance FieldsLuca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi di Stefano. 816-825 [doi]
- Controllable 3D Generative Adversarial Face Model via Disentangling Shape and AppearanceFariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando De la Torre, Steven Song, Aayush Prakash, Daeil Kim. 826-836 [doi]
- Ev-NeRF: Event Based Neural Radiance FieldInwoo Hwang, Junho Kim, Young Min Kim 0001. 837-847 [doi]
- Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image ManipulationChaerin Kong, Dong Hyeon Jeon, Ohjoon Kwon, Nojun Kwak. 848-857 [doi]
- Creating a Forensic Database of Shoeprints from Online Shoe-Tread PhotosSamia Shafique, Bailey Kong, Shu Kong, Charless C. Fowlkes. 858-868 [doi]
- Can Shadows Reveal Biometric InformationƒSafa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell. 869-879 [doi]
- Patch-level Gaze Distribution Prediction for Gaze FollowingQiaomu Miao, Minh Hoai, Dimitris Samaras. 880-889 [doi]
- Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze EstimationVikrant Nagpure, Kenji Okuma. 890-899 [doi]
- DeformIrisNet: An Identity-Preserving Model of Iris Texture DeformationSiamul Karim Khan, Patrick J. Tinsley, Adam Czajka. 900-908 [doi]
- Gait Recognition Using 3-D Human Body Shape InferenceHaidong Zhu, Zhaoheng Zheng, Ram Nevatia. 909-918 [doi]
- CAST: Conditional Attribute Subsampling Toolkit for Fine-grained EvaluationWes Robbins, Steven Zhou, Aman Bhatta, Chad Mello, Vítor Albiero, Kevin W. Bowyer, Terrance E. Boult. 919-929 [doi]
- Physically Plausible Animation of Human Upper Body from a Single ImageZiyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu 0001, C. Karen Liu. 930-939 [doi]
- Online Adaptive Temporal Memory with Certainty Estimation for Human Trajectory PredictionManh Huynh, Gita Alaghband. 940-949 [doi]
- Context-empowered Visual Attention Prediction in Pedestrian ScenariosIgor Vozniak, Philipp Müller, Lorena Hell, Nils Lipp, Ahmed Abouelazm, Christian Müller. 950-960 [doi]
- Misclassifications of Contact Lens Iris PAD Algorithms: Is it Gender Bias or Environmental Conditions?Akshay Agarwal 0001, Nalini K. Ratha, Afzel Noore, Richa Singh 0001, Mayank Vatsa. 961-970 [doi]
- Synthetic Latent Fingerprint GeneratorAndré Brasil Vieira Wyzykowski, Anil K. Jain 0001. 971-980 [doi]
- UPAR: Unified Pedestrian Attribute Recognition and Person RetrievalAndreas Specker, Mickael Cormier, Jürgen Beyerer. 981-990 [doi]
- Segmentation-free Direct Iris Localization NetworksTakahiro Toizumi, Koichi Takahashi, Masato Tsukada. 991-1000 [doi]
- THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervisionAhmed Tawfik Aboukhadra, Jameel Malik, Ahmed Elhayek, Nadia Robertini, Didier Stricker. 1001-1010 [doi]
- Fashion Image Retrieval with Text Feedback by Additive Attention Compositional LearningYuxin Tian, Shawn D. Newsam, Kofi Boakye. 1011-1021 [doi]
- Cross-modal Semantic Enhanced Interaction for Image-Sentence RetrievalXuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose. 1022-1031 [doi]
- Text-Guided Object Detector for Multi-modal Video Question AnsweringRuoyue Shen, Nakamasa Inoue, Koichi Shinoda. 1032-1042 [doi]
- DRAMA: Joint Risk Localization and Captioning in DrivingSrikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li 0001. 1043-1052 [doi]
- Interactive Image Manipulation with Complex Text InstructionsRyugo Morita, Zhiqiang Zhang, Man M. Ho, Jinjia Zhou. 1053-1062 [doi]
- InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for ImagesKonstantin Kobs, Michael Steininger, Andreas Hotho. 1063-1072 [doi]
- Learning by Hallucinating: Vision-Language Pre-training with Weak SupervisionTzu-Jui Julius Wang, Jorma Laaksonen, Tomas Langer, Heikki Arponen, Tom E. Bishop. 1073-1083 [doi]
- Barlow constrained optimization for Visual Question AnsweringAbhishek Jha, Badri N. Patro, Luc Van Gool, Tinne Tuytelaars. 1084-1093 [doi]
- A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location CuesJason Armitage, Leonardo Impett, Rico Sennrich. 1094-1103 [doi]
- Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language NavigationChia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira. 1104-1113 [doi]
- Dense but Efficient VideoQA for Intricate Compositional ReasoningJihyeon Lee, Woo-Young Kang, Eun-Sol Kim. 1114-1123 [doi]
- Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement LearningUkyo Honda, Taro Watanabe, Yuji Matsumoto 0001. 1124-1134 [doi]
- NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal EmbeddingsBhavin Jawade, Deen Dayal Mohan, Naji Mohamed Ali, Srirangaraj Setlur, Venu Govindaraju. 1135-1144 [doi]
- Image-Text Pre-Training for Logo RecognitionMark Hubenthal, Suren Kumar. 1145-1154 [doi]
- VLC-BERT: Visual Question Answering with Contextualized Commonsense KnowledgeSahithya Ravi, Aditya Chinchure, Leonid Sigal, Renjie Liao, Vered Shwartz. 1155-1165 [doi]
- TVCalib: Camera Calibration for Sports Field Registration in SoccerJonas Theiner, Ralph Ewerth. 1166-1175 [doi]
- 3D Change Localization and Captioning from Dynamic Scans of Indoor ScenesYue Qiu 0001, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki 0006, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh. 1176-1185 [doi]
- Adaptive Feature Fusion for Cooperative Perception using LiDAR Point CloudsDonghao Qiao, Farhana H. Zulkernine. 1186-1195 [doi]
- Centroid Distance Keypoint Detector for Colored Point CloudsHanzhe Teng, Dimitrios Chatziparaschis, Xinyue Kan, Amit K. Roy Chowdhury, Konstantinos Karydis. 1196-1205 [doi]
- PP4AV: A benchmarking Dataset for Privacy-preserving Autonomous DrivingLinh Trinh, Phuong Pham, Hoang Trinh, Nguyen Bach, Dung Nguyen, Giang Nguyen, Huy Nguyen. 1206-1215 [doi]
- Self-supervised Correspondence Estimation via Multiview RegistrationMohamed El Banani, Ignacio Rocco, David Novotný, Andrea Vedaldi, Natalia Neverova, Justin Johnson 0001, Benjamin Graham. 1216-1225 [doi]
- Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape ParsingJeonghyun Kim, Kaichun Mo, Minhyuk Sung, Woontack Woo. 1226-1235 [doi]
- Compressing Explicit Voxel Grid Representations: fast NeRFs become also smallChenxi Lola Deng, Enzo Tartaglione. 1236-1245 [doi]
- Nearest Neighbors Meet Deep Neural Networks for Point Cloud AnalysisRenrui Zhang, Liuhui Wang, Ziyu Guo, Jianbo Shi. 1246-1255 [doi]
- Generative Range Imaging for Learning Scene Priors of 3D LiDAR DataKazuto Nakashima, Yumi Iwashita, Ryo Kurazume. 1256-1266 [doi]
- Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictionsMarwane Hariat, Antoine Manzanera, David Filliat. 1267-1276 [doi]
- RSF: Optimizing Rigid Scene Flow From 3D Point Clouds Without LabelsDavid Deng, Avideh Zakhor. 1277-1286 [doi]
- Improving the Robustness of Point Convolution on k-Nearest Neighbor Neighborhoods with a Viewpoint-Invariant Coordinate TransformXingyi Li, Wenxuan Wu, Xiaoli Z. Fern, Fuxin Li. 1287-1297 [doi]
- : Joint Point Interaction-Dimension Search for 3D Point CloudTunhou Zhang, Mingyuan Ma, Feng Yan 0001, Hai Li 0001, Yiran Chen 0001. 1298-1307 [doi]
- Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial AttacksAbhishek Aich, Shasha Li, Chengyu Song, M. Salman Asif, Srikanth V. Krishnamurthy, Amit K. Roy Chowdhury. 1308-1318 [doi]
- Motif Mining: Finding and Summarizing Remixed Image ContentWilliam Theisen, Daniel Gonzalez Cedre, Zachariah Carmichael, Daniel Moreira, Tim Weninger, Walter J. Scheirer. 1319-1328 [doi]
- DeepPrivacy2: Towards Realistic Full-Body AnonymizationHåkon Hukkelås, Frank Lindseth. 1329-1338 [doi]
- A Continual Deepfake Detection Benchmark: Dataset, Methods, and EssentialsChuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc Van Gool. 1339-1349 [doi]
- Task Agnostic and Post-hoc Unseen Distribution DetectionRadhika Dua, Seongjun Yang, Yixuan Li, Edward Choi. 1350-1359 [doi]
- Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models DifferentlyFuta Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen. 1360-1368 [doi]
- My Face My Choice: Privacy Enhancing Deepfakes for Social Media AnonymizationUmur A. Ciftci, Gokturk Yuksek, Ilke Demir. 1369-1379 [doi]
- Do Adaptive Active Attacks Pose Greater Risk Than Static Attacks?Nathan Drenkow, Max Lennon, I-Jeng Wang, Philippe Burlina. 1380-1389 [doi]
- Neural Weight Search for Scalable Task Incremental LearningJian Jiang, Oya Çeliktutan. 1390-1399 [doi]
- Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture SearchThanh Vu, Yanqi Zhou, Chunfeng Wen, Yueqi Li, Jan-Michael Frahm. 1400-1410 [doi]
- Addressing Feature Suppression in Unsupervised Visual RepresentationsTianhong Li, Lijie Fan, Yuan Yuan 0002, Hao He 0011, Yonglong Tian, Rogério Feris, Piotr Indyk, Dina Katabi. 1411-1420 [doi]
- A Protocol for Evaluating Model Interpretation Methods from Visual ExplanationsHamed Behzadi Khormuji, José Oramas. 1421-1429 [doi]
- Realistic Full-Body Anonymization with Surface-Guided GANsHåkon Hukkelås, Morten Smebye, Rudolf Mester, Frank Lindseth. 1430-1440 [doi]
- Global-Local Self-Distillation for Visual Representation LearningTim Lebailly, Tinne Tuytelaars. 1441-1450 [doi]
- Large-to-small Image Resolution Asymmetry in Deep Metric LearningPavel Suma, Giorgos Tolias. 1451-1460 [doi]
- Learning How to MIMIC: Using Model Explanations to Guide Deep Learning TrainingMatthew Watson 0001, Bashar Awwad Shiekh Hasan, Noura Al Moubayed. 1461-1470 [doi]
- The Box Size Confidence Bias Harms Your Object DetectorJohannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll. 1471-1480 [doi]
- ProtoSeg: Interpretable Semantic Segmentation with Prototypical PartsMikolaj Sacha, Dawid Rymarczyk, Lukasz Struski, Jacek Tabor, Bartosz Zielinski 0001. 1481-1492 [doi]
- FreeREA: Training-Free Evolution-based Architecture SearchNiccolò Cavagnero, Luca Robbiano, Barbara Caputo, Giuseppe Averta. 1493-1502 [doi]
- SVD-NAS: Coupling Low-Rank Approximation and Neural Architecture SearchZhewen Yu, Christos-Savvas Bouganis. 1503-1512 [doi]
- Visually explaining 3D-CNN predictions for video classification with an adaptive occlusion sensitivity analysisTomoki Uchiyama, Naoya Sogi, Koichiro Niinuma, Kazuhiro Fukui. 1513-1522 [doi]
- Orthogonal Transforms For Learning Invariant Representations In Equivariant Neural NetworksJaspreet Singh, Chandan Singh, Ankur Rana. 1523-1530 [doi]
- Representation Disentanglement in Generative Models with Contrastive LearningShentong Mo, Zhun Sun, Chao Li 0013. 1531-1540 [doi]
- Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data PruningRishabh Patra, Ramya Hebbalaguppe, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig. 1541-1549 [doi]
- Patch-based Privacy Preserving Neural Network for Vision TasksMitsuhiro Mabuchi, Tetsuya Ishikawa. 1550-1559 [doi]
- PreViTS: Contrastive Pretraining with Video Tracking SupervisionBrian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik. 1560-1570 [doi]
- Efficient Visual Tracking with Exemplar TransformersPhilippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool. 1571-1581 [doi]
- Multi-view Tracking Using Weakly Supervised Human Motion PredictionMartin Engilberge, Weizhe Liu, Pascal Fua. 1582-1592 [doi]
- Planar Object Tracking via Weighted Optical FlowJonás Serých, Jirí Matas. 1593-1602 [doi]
- Feature Disentanglement Learning with Switching and Aggregation for Video-based Person Re-IdentificationMinjung Kim, MyeongAh Cho, Sangyoun Lee. 1603-1612 [doi]
- Body Part-Based Representation Learning for Occluded Person Re-IdentificationVladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi. 1613-1623 [doi]
- Camera Alignment and Weighted Contrastive Learning for Domain Adaptation in Video Person ReIDDjebril Mekhazni, Maximilien Dufau, Christian Desrosiers, Marco Pedersoli, Eric Granger. 1624-1633 [doi]
- MEVID: Multi-view Extended Videos with Identities for Video Person Re-IdentificationDaniel Davila, Dawei Du, Bryon Lewis, Christopher Funk, Joseph Van Pelt, Roderic Collins, Kellie Corona, Matt S. Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp. 1634-1643 [doi]
- Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings with Multivariate Occupancy Time SeriesThomas Kreutz, Max Mühlhäuser, Alejandro Sánchez Guinea. 1644-1653 [doi]
- AttTrack: Online Deep Attention Transfer for Multi-object TrackingKeivan Nalaie, Rong Zheng. 1654-1663 [doi]
- Multi-Frame Attention with Feature-Level Warping for Drone Crowd TrackingTakanori Asanomi, Kazuya Nishimura, Ryoma Bise. 1664-1673 [doi]
- BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoAli Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan. 1674-1683 [doi]
- Gallery Filter Network for Person SearchLucas Jaffe, Avideh Zakhor. 1684-1693 [doi]
- HIME: Efficient Headshot Image Super-Resolution with Multiple ExemplarsXiaoyu Xiang, Jon Morton, Fitsum A. Reda, Lucas D. Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan P. Allebach. 1694-1704 [doi]
- Fine-Context Shadow Detection using Shadow RemovalJeya Maria Jose Valanarasu, Vishal M. Patel. 1705-1714 [doi]
- Robust Real-world Image Enhancement Based on Multi-Exposure LDR ImagesHaoyu Ren, Yi Fan, Stephen Huang. 1715-1723 [doi]
- Pik-Fix: Restoring and Colorizing Old PhotosRunsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Alan C. Bovik, Hongkai Yu. 1724-1734 [doi]
- Fast Online Video Super-Resolution with Deformable Attention PyramidDario Fuoli, Martin Danelljan, Radu Timofte, Luc Van Gool. 1735-1744 [doi]
- Style-Guided Inference of Transformer for High-resolution Image SynthesisJonghwa Yim, Minjae Kim. 1745-1755 [doi]
- PSENet: Progressive Self-Enhancement Network for Unsupervised Extreme-Light Image EnhancementHue Nguyen, Diep Tran, Khoi Nguyen 0001, Rang Nguyen. 1756-1765 [doi]
- Cross-Resolution Flow Propagation for Foveated Video Super-ResolutionEugene Lee, Lien-Feng Hsu, Evan Chen, Chen-Yi Lee. 1766-1775 [doi]
- GeoFill: Reference-Based Image Inpainting with Better Geometric UnderstandingYunhan Zhao, Connelly Barnes, YuQian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless C. Fowlkes. 1776-1786 [doi]
- iColoriT: Towards Propagating Local Hints to the Right Region in Interactive Colorization by Leveraging Vision TransformerJooyeol Yun, Sanghyeon Lee, Minho Park, Jaegul Choo. 1787-1796 [doi]
- Deep Model-Based Super-Resolution with Non-uniform BlurCharles Laroche, Andrés Almansa, Matias Tassano. 1797-1808 [doi]
- SHARDS: Efficient SHAdow Removal using Dual Stage Network for High-Resolution ImagesMrinmoy Sen, Sai Pradyumna Chermala, Nazrinbanu Nurmohammad Nagori, Venkat Peddigari, Praful Mathur, B. H. Pawan Prasad, Moon-Hwan Jeong. 1809-1817 [doi]
- Guiding Users to Where to Give Color Hints for Efficient Interactive Sketch Colorization via Unsupervised Region PrioritizationYoungin Cho, Junsoo Lee, Soyoung Yang, Juntae Kim, Yeojeong Park, Haneol Lee, Mohammad Azam Khan, DaeSik Kim, Jaegul Choo. 1818-1827 [doi]
- Efficient Reference-based Video Super-Resolution (ERVSR): Single Reference Image Is All You NeedYoungrae Kim, Jinsu Lim, Hoonhee Cho, Minji Lee, Dongman Lee, Kuk-Jin Yoon, Ho-Jin Choi. 1828-1837 [doi]
- Efficient Flow-Guided Multi-frame De-fencingStavros Tsogkas, Fengjia Zhang, Allan D. Jepson, Alex Levinshtein. 1838-1847 [doi]
- Perceptual Image Enhancement for Smartphone Real-Time ApplicationsMarcos V. Conde, Florin-Alexandru Vasluianu, Javier Vazquez-Corral, Radu Timofte. 1848-1858 [doi]
- Expert-defined Keywords Improve Interpretability of Retinal Image CaptioningTing-Wei Wu, Jia-Hong Huang, Joseph Lin, Marcel Worring. 1859-1868 [doi]
- Diffeomorphic Image Registration with Neural Velocity FieldKun Han, Shanlin Sun, Xiangyi Yan, Chenyu You, Hao Tang, Junayed Naushad, Haoyu Ma, Deying Kong, Xiaohui Xie. 1869-1879 [doi]
- ATCON: Attention Consistency for Vision ModelsAli Mirzazadeh, Florian Dubost, Maxwell Pike, Krish Maniar, Max Zuo, Christopher Lee-Messer, Daniel Rubin. 1880-1889 [doi]
- Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video ProcessingFlorian Dubost, Erin Hong, Siyi Tang, Nandita Bhaskhar, Christopher Lee-Messer, Daniel L. Rubin. 1890-1899 [doi]
- Analysis of Master Vein Attacks on Finger Vein Recognition SystemsHuy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen. 1900-1908 [doi]
- Computer Vision to the Rescue: Infant Postural Symmetry Estimation from Incongruent AnnotationsXiaofei Huang, Michael Wan, Lingfei Luan, Bethany Tunik, Sarah Ostadabbas. 1909-1917 [doi]
- VSGD-Net: Virtual Staining Guided Melanocyte Detection on Histopathological ImagesKechun Liu, Beibin Li, Wenjun Wu, Caitlin J. May, Oliver Chang, Stevan Knezevich, Lisa M. Reisch, Joann G. Elmore, Linda G. Shapiro. 1918-1927 [doi]
- PINER: Prior-informed Implicit Neural Representation Learning for Test-time Adaptation in Sparse-view CT ReconstructionBowen Song, Liyue Shen, Lei Xing 0001. 1928-1937 [doi]
- Robust and Efficient Alignment of Calcium Imaging Data through Simultaneous Low Rank and Sparse DecompositionJunmo Cho, Seungjae Han, Eun-Seo Cho, Kijung Shin, Young-Gyu Yoon. 1938-1947 [doi]
- MRI Imputation based on Fused Index- and Intensity-RegistrationJiyoon Shin, Jungwoo Lee. 1948-1957 [doi]
- DBCE : A Saliency Method for Medical Deep Learning Through Anatomically-Consistent Free-Form DeformationsJoshua Peters, Léo Lebrat, Rodrigo Santa Cruz, Aaron Nicolson, Gregg Belous, Salamata Konate, Parnesh Raniga, Vincent Doré, Pierrick Bourgeat, Jurgen Mejan-Fripp, Clinton Fookes, Olivier Salvado. 1958-1968 [doi]
- Masked Image Modeling Advances 3D Medical Image AnalysisZekai Chen, Devansh Agarwal, Kshitij Aggarwal, Wiem Safta, Mariann Micsinai Balan, Kevin Brown. 1969-1979 [doi]
- EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development ClassificationTien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le. 1980-1989 [doi]
- Performer: A Novel PPG-to-ECG Reconstruction Transformer for a Digital Biomarker of Cardiovascular Disease DetectionElla Lan. 1990-1998 [doi]
- A Morphology Focused Diffusion Probabilistic Model for Synthesis of Histopathology ImagesPuria Azadi Moghadam, Sanne Van Dalen, Karina C. Martin, Jochen Lennerz, Stephen Yip, Hossein Farahani, Ali Bashashati. 1999-2008 [doi]
- Contrastive Losses Are Natural Criteria for Unsupervised Video SummarizationZongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara. 2009-2018 [doi]
- M-FUSE: Multi-frame Fusion for Scene Flow EstimationLukas Mehl, Azin Jahedi, Jenny Schmalfuss, Andrés Bruhn. 2019-2028 [doi]
- BoxMask: Revisiting Bounding Box Supervision for Video Object DetectionKhurram Azeem Hashmi, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal. 2029-2039 [doi]
- Lightweight Network For Video Motion MagnificationJasdeep Singh, Subrahmanyam Murala, G. Sankara Raju Kosuru. 2040-2049 [doi]
- TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps DistillationFeiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness. 2050-2059 [doi]
- BrightFlow: Brightness-Change-Aware Unsupervised Learning of Optical FlowRémi Marsal, Florian Chabot, Angélique Loesch, Hichem Sahbi. 2060-2069 [doi]
- FLAVR: Flow-Agnostic Video Representations for Fast Frame InterpolationTarun Kalluri, Deepak Pathak, Manmohan Chandraker, Du Tran. 2070-2081 [doi]
- MovieCLIP: Visual Scene Recognition in MoviesDigbalay Bose, Rajat Hebbar, Krishna Somandepalli, Haoyang Zhang, Yin Cui, Kree Cole-McLaughlin, Huisheng Wang, Shrikanth Narayanan. 2082-2091 [doi]
- Neural Implicit Representations for Physical Parameter Inference from a Single VideoFlorian Hofherr, Lukas Koestler, Florian Bernard, Daniel Cremers. 2092-2102 [doi]
- Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shiftsFlorian Kadner, Tobias Thomas, David Hoppe, Constantin A. Rothkopf. 2103-2113 [doi]
- Match Cutting: Finding Cuts with Smooth Visual TransitionsBoris Chen, Amir Ziai, Rebecca S. Tucker, Yuchen Xie. 2114-2124 [doi]
- TTTFlow: Unsupervised Test-Time Training with Normalizing FlowDavid Osowiechi, Gustavo Adolfo Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers. 2125-2126 [doi]
- Weakly-Supervised Optical Flow Estimation for Time-of-FlightMichael Schelling, Pedro Hermosilla, Timo Ropinski. 2134-2143 [doi]
- Meta-Learning for Adaptation of Deep Optical Flow NetworksChaerin Min, Taehyun Kim, Jongwoo Lim. 2144-2153 [doi]
- Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction VideosZecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Goutsu, Yoichi Sato. 2154-2162 [doi]
- Dissecting Deep Metric Learning Losses for Image-Text RetrievalHong Xuan, Xi Stephen Chen. 2163-2172 [doi]
- Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding MemoryTakayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto. 2173-2183 [doi]
- Complementary Cues from Audio Help Combat Noise in Weakly-Supervised Object DetectionCagri Gungor, Adriana Kovashka. 2184-2193 [doi]
- Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-ResolutionMariana-Iuliana Georgescu, Radu-Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan. 2194-2204 [doi]
- AudioViewer: Learning to Visualize SoundsChunjin Song, Yuchi Zhang, Willis Peng, Parmis Mohaghegh, Bastian Wandt, Helge Rhodin. 2205-2215 [doi]
- Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at ScaleAditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar. 2216-2225 [doi]
- Relaxing Contrastiveness in Multimodal Representation LearningZudi Lin, Erhan Bas, Kunwar Yashraj Singh, Gurumurthy Swaminathan, Rahul Bhotika. 2226-2235 [doi]
- Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video UnderstandingArda Senocak, Junsik Kim 0001, Tae Hyun Oh, Dingzeyu Li, In-So Kweon. 2236-2246 [doi]
- BirdSoundsDenoising: Deep Visual Audio Denoising for Bird SoundsYoushan Zhang, Jialu Li. 2247-2256 [doi]
- Audio-Visual Efficient Conformer for Robust Speech RecognitionMaxime Burchi, Radu Timofte. 2257-2266 [doi]
- Recipe2Video: Synthesizing Personalized Videos from Recipe TextsPrateksha Udhayanan, Suryateja BV, Parth Laturia, Dev Chauhan, Darshan Khandelwal, Stefano Petrangeli, Balaji Vasan Srinivasan. 2267-2276 [doi]
- Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source LocalizationDennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju. 2277-2286 [doi]
- Instance-Dependent Noisy Label Learning via Graphical ModellingArpit Garg, Cuong Nguyen 0006, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro. 2287-2297 [doi]
- Composite Learning for Robust and Effective Dense PredictionsMenelaos Kanakis, Thomas E. Huang, David Brüggemann, Fisher Yu, Luc Van Gool. 2298-2307 [doi]
- Improving Multi-fidelity Optimization with a Recurring Learning Rate for Hyperparameter TuningHyunjae Lee, GiHyeon Lee, Junhwan Kim, Sungjun Cho, Dohyun Kim, Donggeun Yoo. 2308-2317 [doi]
- Understanding the Role of Mixup in Knowledge Distillation: An Empirical StudyHongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan Turaga. 2318-2327 [doi]
- Cross-task Attention Mechanism for Dense Multi-task LearningIvan Lopes, Tuan-Hung Vu, Raoul de Charette. 2328-2337 [doi]
- Continual Learning with Dependency Preserving HypernetworksDupati Srikar Chandra, Sakshi Varshney, P. K. Srijith, Sunil Gupta. 2338-2347 [doi]
- FLOAT: Fast Learnable Once-for-All Adversarial Training for Tunable Trade-off between Accuracy and RobustnessSouvik Kundu 0002, Sairam Sundaresan, Massoud Pedram, Peter A. Beerel. 2348-2357 [doi]
- Online Knowledge Distillation for Multi-task LearningGeethu Miriam Jacob, Vishal Agarwal, Björn Stenger. 2358-2367 [doi]
- HyperPosePDF Hypernetworks Predicting the Probability Distribution on SO(3)Timon Höfer, Benjamin Kiefer, Martin Messmer, Andreas Zell. 2368-2378 [doi]
- Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth InformationAhmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy, Adrien Courtois, Gabriele Facciolo. 2379-2388 [doi]
- What can we Learn by Predicting Accuracy?Olivier Risser-Maroix, Benjamin Chamand. 2389-2398 [doi]
- AdvisIL - A Class-Incremental Learning AdvisorEva Feillet, Grégoire Petit, Adrian Popescu 0001, Marina Reyboz, Céline Hudelot. 2399-2408 [doi]
- Searching for Robust Binary Neural Networks via Bimodal Parameter PerturbationDaehyun Ahn, HyungJun Kim, Taesu Kim, Eunhyeok Park, Jae-Joon Kim. 2409-2418 [doi]
- RIFT: Disentangled Unsupervised Image Translation via Restricted Information FlowBen Usman, Dina Bashkirova, Kate Saenko. 2419-2428 [doi]
- Enabling ISPless Low-Power Computer VisionGourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel. 2429-2438 [doi]
- On the Importance of Denoising when Learning to Compress ImagesBenoit Brummer, Christophe De Vleeschouwer. 2439-2447 [doi]
- End-to-End Single-Frame Image Signal Processing for High Dynamic Range ScenesKhanh Quoc Dinh, Kwang-Pyo Choi. 2448-2457 [doi]
- No Reference Opinion Unaware Quality Assessment of Authentically Distorted ImagesNithin C. Babu, Vignesh Kannan, Rajiv Soundararajan. 2458-2467 [doi]
- HyperShot: Few-Shot Learning by Kernel HyperNetworksMarcin Sendera, Marcin Przewiezlikowski, Konrad Karanowski, Maciej Zieba, Jacek Tabor, Przemyslaw Spurek. 2468-2477 [doi]
- Contrastive Knowledge-Augmented Meta-Learning for Few-Shot ClassificationRakshith Subramanyam, Mark Heimann, T. S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan. 2478-2486 [doi]
- Few-shot Medical Image Segmentation with Cycle-resemblance AttentionHao Ding, Changchang Sun, Hao Tang 0005, Dawen Cai, Yan Yan 0002. 2487-2496 [doi]
- Neural Distributed Image Compression with Cross-Attention Feature AlignmentNitish Mital, Ezgi Özyilkan, Ali Garjani, Deniz Gündüz. 2497-2506 [doi]
- MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video ClassificationHuanle Zhang, Hamed Pirsiavash, Xin Liu. 2507-2516 [doi]
- Pixel-Wise Prediction based Visual Odometry via Uncertainty EstimationHao-Wei Chen, Ting-Hsuan Liao, Hsuan-Kung Yang, Chun-Yi Lee. 2517-2527 [doi]
- Universal Deep Image Compression via Content-Adaptive Optimization with AdaptersKoki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa. 2528-2537 [doi]
- Language-free Training for Zero-shot Video GroundingDahye Kim, Jungin Park, Jiyoung Lee, Seongheon Park, Kwanghoon Sohn. 2538-2547 [doi]
- Separating Partially-Polarized Diffuse and Specular Reflection Components under Unpolarized Light SourcesSoma Kajiyama, Taihe Piao, Ryo Kawahara, Takahiro Okabe. 2548-2557 [doi]
- Elimination of Non-Novel Segments at Multi-Scale for Few-Shot SegmentationAlper Kayabasi, Gülin Tüfekci, Ilkay Ulusoy. 2558-2566 [doi]
- An Unified Framework for Language Guided Image CompletionJihyun Kim, Seong-Hun Jeong, Kyeongbo Kong, Suk-Ju Kang. 2567-2577 [doi]
- Cross-Domain Video Anomaly Detection without Target Domain AdaptationAbhishek Aich, Kuan-Chuan Peng, Amit K. Roy Chowdhury. 2578-2590 [doi]
- Asymmetric Student-Teacher Networks for Industrial Anomaly DetectionMarco Rudolph, Tom Wehrbein, Bodo Rosenhahn, Bastian Wandt. 2591-2601 [doi]
- Heatmap-based Out-of-Distribution DetectionJulia Hornauer, Vasileios Belagiannis. 2602-2611 [doi]
- Anomaly Detection in 3D Point Clouds using Deep Geometric DescriptorsPaul Bergmann, David Sattlegger. 2612-2622 [doi]
- Training Auxiliary Prototypical Classifiers for Explainable Anomaly Detection in Medical Image SegmentationWonwoo Cho, Jeonghoon Park, Jaegul Choo. 2623-2632 [doi]
- Bi-directional Frame Interpolation for Unsupervised Video Anomaly DetectionHanqiu Deng, Zhaoxiang Zhang, Shihao Zou, Xingyu Li. 2633-2642 [doi]
- Hyperdimensional Feature Fusion for Out-of-Distribution DetectionSamuel Wilson, Tobias Fischer 0001, Niko Sünderhauf, Feras Dayoub. 2643-2653 [doi]
- Towards Interpretable Video Anomaly DetectionKeval Doshi, Yasin Yilmaz. 2654-2663 [doi]
- Normality Guided Multiple Instance Learning for Weakly Supervised Video Anomaly DetectionSeongheon Park, Hanjae Kim, Minsu Kim, Dahye Kim, Kwanghoon Sohn. 2664-2673 [doi]
- Mutual Learning for Long-Tailed RecognitionChanghwa Park, Junho Yim, Eunji Jun. 2674-2683 [doi]
- Representation Recovering for Self-Supervised Pre-training on Medical ImagesXiangyi Yan, Junayed Naushad, Shanlin Sun, Kun Han, Hao Tang, Deying Kong, Haoyu Ma, Chenyu You, Xiaohui Xie. 2684-2694 [doi]
- Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and BeyondCheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang, Yu-Chiang Frank Wang. 2695-2704 [doi]
- Similarity Contrastive Estimation for Self-Supervised Soft Contrastive LearningJulien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault, Stéphane Canu. 2705-2715 [doi]
- Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological ImagesPrakash Chandra Chhipa, Richa Upadhyay, Gustav Grund Pihlgren, Rajkumar Saini, Seiichi Uchida, Marcus Liwicki. 2716-2726 [doi]
- Motion Aware Self-Supervision for Generic Event Boundary DetectionAyush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor. 2727-2738 [doi]
- Improving Predicate Representation in Scene Graph Generation by Self-Supervised LearningSo Hasegawa, Masayuki Hiromoto, Akira Nakagawa, Yuhei Umeda. 2739-2748 [doi]
- An Embedding-Dynamic Approach to Self-Supervised LearningSuhong Moon, Domas Buracas, Seunghyun Park, Jinkyu Kim, John F. Canny. 2749-2757 [doi]
- TeST: Test-time Self-Training under Distribution ShiftSamarth Sinha, Peter V. Gehler, Francesco Locatello, Bernt Schiele. 2758-2768 [doi]
- SSSD: Self-Supervised Self DistillationWei-Chi Chen, Wei-Ta Chu. 2769-2776 [doi]
- Multi-level Contrastive Learning for Self-Supervised Vision TransformersShentong Mo, Zhun Sun, Chao Li 0013. 2777-2786 [doi]
- Self-Supervised 2D/3D Registration for X-Ray to CT Image FusionSrikrishna Jaganathan, Maximilian Kukla, Jian Wang 0009, Karthik Shetty, Andreas K. Maier. 2787-2797 [doi]
- FUSSL: Fuzzy Uncertain Self Supervised LearningSalman Mohamadi, Gianfranco Doretto, Donald A. Adjeroh. 2798-2807 [doi]
- Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data AugmentationAtsuyuki Miyai, Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa. 2808-2817 [doi]
- Self-Supervised Distilled Learning for Multi-modal Misinformation IdentificationMichael Mu, Sreyasee Das Bhattacharjee, Junsong Yuan. 2818-2827 [doi]
- Self-Distilled Self-supervised Representation LearningJiho Jang, Seonhoon Kim, KiYoon Yoo, Chaerin Kong, Jangho Kim, Nojun Kwak. 2828-2838 [doi]
- TransVLAD: Multi-Scale Attention-Based Global Descriptors for Visual Geo-LocalizationYifan Xu, Pourya Shamsolmoali, Eric Granger, Claire Nicodeme, Laurent Gardes, Jie Yang 0002. 2839-2848 [doi]
- Learnable Human Mesh Triangulation for 3D Human Pose and Shape EstimationSungho Chun, Sungbum Park, Ju Yong Chang. 2849-2858 [doi]
- COPE: End-to-end trainable Constant Runtime Object Pose EstimationStefan Thalhammer, Timothy Patten, Markus Vincze. 2859-2869 [doi]
- ElliPose: Stereoscopic 3D Human Pose Estimation by Fitting EllipsoidsChristian Grund, Julian Tanke, Juergen Gall. 2870-2880 [doi]
- Partially calibrated semi-generalized pose from hybrid point correspondencesSnehal Bhayani, Torsten Sattler, Viktor Larsson, Janne Heikkilä, Zuzana Kukelova. 2881-2890 [doi]
- ImPosing: Implicit Pose Encoding for Efficient Visual LocalizationArthur Moreau, Thomas Gilles, Nathan Piasco, Dzmitry Tsishkou, Bogdan Stanciulescu, Arnaud de La Fortelle. 2891-2901 [doi]
- Uplift and Upsample: Efficient 3D Human Pose Estimation with Uplifting TransformersMoritz Einfalt, Katja Ludwig, Rainer Lienhart. 2902-2912 [doi]
- Cross-View Image Sequence Geo-localizationXiaohan Zhang, Waqas Sultani, Safwan Wshah. 2913-2922 [doi]
- CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D AnnotationsCheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao 0009, Ke Zhang, Zhongyu Jiang, Jenq-Neng Hwang, Cheng-Hao Kuo. 2923-2932 [doi]
- Image-free Domain Generalization via CLIP for 3D Hand Pose EstimationSeongyeong Lee, Hansoo Park, Dong-Uk Kim, Jihyeon Kim, Muhammadjon Boboev, SeungRyul Baek. 2933-2943 [doi]
- Benchmarking Visual Localization for Autonomous NavigationLauri Suomela, Jussi Kalliola, Atakan Dag, Harry Edelman, Joni-Kristian Kämäräinen. 2944-2954 [doi]
- Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton FormatsIstván Sárándi, Alexander Hermans, Bastian Leibe. 2955-2965 [doi]
- 3D GAN Inversion with Pose OptimizationJaehoon Ko, Kyusun Cho, Daewon Choi, Kwangrok Ryoo, Seungryong Kim. 2966-2975 [doi]
- Marker-removal Networks to Collect Precise 3D Hand Data for RGB-based Estimation and its Application in PianoErwin Wu, Hayato Nishioka, Shinichi Furuya, Hideki Koike. 2976-2985 [doi]
- Vis2Rec: A Large-Scale Visual Dataset for Visit RecommendationMichaël Soumm, Adrian Popescu 0001, Bertrand Delezoide. 2986-2996 [doi]
- MixVPR: Feature Mixing for Visual Place RecognitionAmar Ali-bey, Brahim Chaib-draa, Philippe Giguère. 2997-3006 [doi]
- CountNet3D: A 3D Computer Vision Approach to Infer Counts of Occluded ObjectsPorter Jenkins, Kyle Armstrong, Stephen Nelson, Siddhesh Gotad, J. Stockton Jenkins, Wade Wilkey, Tanner Watts. 3007-3016 [doi]
- Temporally Consistent Online Depth Estimation in Dynamic ScenesZhaoshuo Li, Wei Ye, Dilin Wang, Francis X. Creighton, Russell H. Taylor, Ganesh Venkatesh, Mathias Unberath. 3017-3026 [doi]
- Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth CompletionKensuke Taguchi, Shogo Morita, Yusuke Hayashi, Wataru Imaeda, Hironobu Fujiyoshi. 3027-3035 [doi]
- nLMVS-Net: Deep Non-Lambertian Multi-View StereoKohei Yamashita, Yuto Enyo, Shohei Nobuhara, Ko Nishino. 3036-3045 [doi]
- Wiener Guided DIP for Unsupervised Blind Image DeconvolutionGustav Bredell, Ertunc Erdil, Bruno Weber, Ender Konukoglu. 3046-3055 [doi]
- 360MVSNet: Deep Multi-view Stereo Network with 360° Images for Indoor Scene ReconstructionChing-Ya Chiu, Yu-Ting Wu, I-Chao Shen, Yung-Yu Chuang. 3056-3065 [doi]
- Fast Differentiable Transient Rendering for Non-Line-of-Sight ReconstructionMarkus Plack, Clara Callenberg, Monika Schneider, Matthias B. Hullin. 3066-3075 [doi]
- MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic SegmentationAndra Petrovai, Sergiu Nedevschi. 3076-3085 [doi]
- DELS-MVS: Deep Epipolar Line Search for Multi-View StereoChristian Sormann, Emanuele Santellani, Mattia Rossi, Andreas Kuhn 0005, Friedrich Fraundorfer. 3086-3095 [doi]
- Probabilistic Volumetric Fusion for Dense Monocular SLAMAntoni Rosinol, John J. Leonard, Luca Carlone. 3096-3104 [doi]
- High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDFLu Sang, Bjoern Haefner, Xingxing Zuo, Daniel Cremers. 3105-3114 [doi]
- High-Resolution Depth Estimation for 360° Panoramas through Perspective and Panoramic Depth Images RegistrationChi-Han Peng, Jiayao Zhang 0005. 3115-3124 [doi]
- Multi-View Photometric Stereo RevisitedBerk Kaya, Suryansh Kumar, Carlos Eduardo Porto de Oliveira, Vittorio Ferrari, Luc Van Gool. 3125-3134 [doi]
- Automated Line Labelling: Dataset for Contour Detection and 3D ReconstructionHari Santhanam, Nehal Doiphode, Jianbo Shi. 3135-3144 [doi]
- Anisotropic Multi-Scale Graph Convolutional Network for Dense Shape CorrespondenceMohammad Farazi, Wenhui Zhu, Zhangsihao Yang, Yalin Wang 0001. 3145-3154 [doi]
- Automatically Annotating Indoor Images with CAD Models via RGB-D ScansStefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit. 3155-3163 [doi]
- Intra-Batch Supervision for Panoptic Segmentation on High-Resolution ImagesDaan de Geus, Gijs Dubbelman. 3164-3172 [doi]
- Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse ConditionsDavid Brüggemann, Christos Sakaridis, Prune Truong, Luc Van Gool. 3173-3183 [doi]
- Weakly Supervised Cell-Instance Segmentation with Two Types of Weak Labels by Single Instance PastingKazuya Nishimura, Ryoma Bise. 3184-3193 [doi]
- Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic SegmentationDipam Goswami, René Schuster, Joost van de Weijer 0001, Didier Stricker. 3194-3203 [doi]
- Semantic Segmentation of Degraded Images Using Layer-Wise Feature AdjustorKazuki Endo, Masayuki Tanaka 0001, Masatoshi Okutomi. 3204-3212 [doi]
- Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty QuantificationMatthias Rottmann, Marco Reese. 3213-3222 [doi]
- Full Contextual Attention for Multi-resolution Transformers in Semantic SegmentationLoic Themyr, Clément Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler. 3223-3232 [doi]
- WSNet: Towards An Effective Method for Wound Image SegmentationSubba Reddy Oota, Vijay Rowtula, Shahid Saleem Mohammed, Minghsun Liu, Manish Gupta 0001. 3233-3242 [doi]
- Autoencoder-based background reconstruction and foreground segmentation with background noise estimationBruno Sauvalle, Arnaud de La Fortelle. 3243-3254 [doi]
- LoopDA: Constructing Self-loops to Adapt Nighttime Semantic SegmentationFengyi Shen, Zador Pataki, Akhil Gurram, Ziyuan Liu, He Wang, Alois C. Knoll. 3255-3265 [doi]
- Image Segmentation-based Unsupervised Multiple Objects DiscoverySandra Kara, Hejer Ammar, Florian Chabot, Quoc-Cuong Pham. 3276-3285 [doi]
- X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View SegmentationShubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Kumar Yogamani, Fatih Porikli. 3286-3296 [doi]
- Modality Mixer for Multi-modal Action RecognitionSumin Lee, Sangmin Woo, Yeonju Park, Muhammad Adi Nugroho, Changick Kim. 3297-3306 [doi]
- Fine-grained Activities of People WorldwideJeffrey Byrne, Greg Castañón, Zhongheng Li, Gil J. Ettinger. 3307-3318 [doi]
- STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action RecognitionDasom Ahn, Sangwon Kim, HyunSu Hong, ByoungChul Ko. 3319-3328 [doi]
- Holistic Interaction Transformer Network for Action DetectionGueter Josmy Faure, Min-Hung Chen, Shang-Hong Lai. 3329-3339 [doi]
- VirtualHome Action Genome: A Simulated Spatio-Temporal Scene Graph Dataset with Consistent Relationship LabelsYue Qiu 0001, Yoshiki Nagasaki, Kensho Hara, Hirokatsu Kataoka, Ryota Suzuki 0006, Kenji Iwata, Yutaka Satoh. 3340-3349 [doi]
- Stop or Forward: Dynamic Layer Skipping for Efficient Action RecognitionJong-Hyeon Seon, Jaedong Hwang, Jonghwan Mun, Bohyung Han. 3350-3359 [doi]
- Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action RecognitionDawei Du, Ameya Shringi, Anthony Hoogs, Christopher Funk. 3360-3369 [doi]
- Multi-View Action Recognition using Contrastive LearningKetul Shah, Anshul Shah, Chun Pong Lau 0001, Celso M. de Melo, Rama Chellappa. 3370-3380 [doi]
- Multimodal Vision Transformers with Forced Attention for Behavior AnalysisTanay Agrawal, Michal Balazia, Philipp Müller, François Brémond. 3381-3391 [doi]
- Efficient Skeleton-Based Action Recognition via Joint-Mapping strategiesMin Seok Kang, Dongoh Kang, Hansaem Kim. 3392-3401 [doi]
- GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action PredictionSamrudhdhi B. Rangrej, Kevin J. Liang, Tal Hassner, James J. Clark. 3402-3412 [doi]
- Harnessing Unrecognizable Faces for Improving Face RecognitionSiqi Deng, Yuanjun Xiong, Meng Wang, Wei Xia, Stefano Soatto. 3413-3422 [doi]
- IFQA: Interpretable Face Quality AssessmentByungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong. 3433-3442 [doi]
- FaceDancer: Pose- and Occlusion-Aware High Fidelity Face SwappingFelix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund. 3443-3452 [doi]
- Fine Gaze Redirection Learning with Gaze Hardness-aware TransformationSangjin Park, Daeha Kim, Byung Cheol Song. 3453-3462 [doi]
- Scaling Neural Face Synthesis to High FPS and Low Latency by Neural CachingFrank Yu, Sid Fels, Helge Rhodin. 3463-3472 [doi]
- QMagFace: Simple and Accurate Quality-Aware Face RecognitionPhilipp Terhörst, Malte Ihlefeld, Marco Huber, Naser Damer, Florian Kirchbuchner, Kiran B. Raja, Arjan Kuijper. 3473-3483 [doi]
- FaceOff: A Video-to-Video Face Swapping SystemAditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar. 3484-3493 [doi]
- Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive LossTingyu Qu, Tinne Tuytelaars, Marie-Francine Moens. 3494-3503 [doi]
- Mesh-Tension Driven Expression-Based Wrinkles for Synthetic FacesChirag Raman, Charlie Hewitt, Erroll Wood, Tadas Baltrusaitis. 3504-3514 [doi]
- DigiFace-1M: 1 Million Digital Face Images for Face RecognitionGwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien P. C. Valentin, Roberto Cipolla, JingJing Shen. 3515-3524 [doi]
- 3DMM-RF: Convolutional Radiance Fields for 3D Face ModelingStathis Galanakis, Baris Gecer, Alexandros Lattas, Stefanos Zafeiriou. 3525-3536 [doi]
- Unifying Margin-Based Softmax Losses in Face RecognitionYang Zhang, Simao Herdade, Kapil Thadani, Eric Dodds, Jack Culpepper, Yueh-Ning Ku. 3537-3546 [doi]
- FastSwap: A Lightweight One-Stage Framework for Real-Time Face SwappingSahng-Min Yoo, Tae-Min Choi, Jae-Woo Choi, Jong-Hwan Kim. 3547-3556 [doi]
- Knowing What to Label for Few Shot Microscopy Image Cell SegmentationYoussef Dawoud, Arij Bouazizi, Katharina Ernst, Gustavo Carneiro, Vasileios Belagiannis. 3557-3566 [doi]
- A Deep Neural Framework to Detect Individual Advertisement (Ad) from VideosZongyi Liu. 3567-3576 [doi]
- Delving into Masked Autoencoders for Multi-Label Thorax Disease ClassificationJunfei Xiao, Yutong Bai, Alan L. Yuille, Zongwei Zhou. 3577-3589 [doi]
- OutfitTransformer: Learning Outfit Representations for Fashion RecommendationRohan Sarkar, Navaneeth Bodla, Mariya I. Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gerard Medioni. 3590-3598 [doi]
- LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich DocumentsPuneet Mathur, Rajiv Jain, Ashutosh Mehra 0002, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu Natarajan, Quan Hung Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu. 3599-3609 [doi]
- Color Recommendation for Vector Graphic Documents based on Multi-Palette RepresentationQianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki. 3610-3618 [doi]
- Probabilistic Integration of Object Level Annotations in Chest X-ray ClassificationTom van Sonsbeek, Xiantong Zhen, Dwarikanath Mahapatra, Marcel Worring. 3619-3629 [doi]
- D-Extract: Extracting Dimensional Attributes From Product ImagesPushpendu Ghosh, Nancy Wang, Promod Yenigalla. 3630-3638 [doi]
- Generative Colorization of Structured Mobile Web PagesKotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi. 3639-3648 [doi]
- The Fully Convolutional Transformer for Medical Image SegmentationAthanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier. 3649-3658 [doi]
- Multi-scale Cell-based Layout Representation for Document UnderstandingYuzhi Shi, Mijung Kim, Yeongnam Chae. 3659-3668 [doi]
- Efficient few-shot learning for pixel-precise handwritten document layout analysisAxel De Nardin, Silvia Zottin, Matteo Paier, Gian Luca Foresti, Emanuela Colombi, Claudio Piciarelli. 3669-3677 [doi]
- OCR-VQGAN: Taming Text-within-Image GenerationJuan A. Rodriguez, David Vázquez 0001, Issam H. Laradji, Marco Pedersoli, Pau Rodríguez. 3678-3687 [doi]
- Tracking Growth and Decay of Plant Roots in Minirhizotron ImagesAlexander Gillert, Bo Peters, Uwe Freiherr von Lukas, Jürgen Kreyling, Gesche Blume-Werry. 3688-3697 [doi]
- Handling Image and Label Resolution Mismatch in Remote SensingScott Workman, Armin Hadzic, M. Usman Rafique. 3698-3707 [doi]
- ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object DetectionRebbapragada V. C. Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N. Balasubramanian. 3708-3717 [doi]
- The CropAndWeed Dataset: a Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationDaniel Steininger, Andreas Trondl, Gerardus Croonen, Julia Simon, Verena Widhalm. 3718-3727 [doi]
- DSTrans: Dual-Stream Transformer for Hyperspectral Image RestorationDabing Yu, Qingwu Li, Xiaolin Wang, Zhiliang Zhang, Yixi Qian, Chang Xu. 3728-3738 [doi]
- Learning Few-shot Segmentation from Bounding Box AnnotationsByeolyi Han, Tae Hyun Oh. 3739-3748 [doi]
- Towards Discriminative and Transferable One-Stage Few-Shot Object DetectorsKarim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang 0009, Juergen Beyerer. 3749-3758 [doi]
- Learning incoherent light emission steering from metasurfaces using generative modelsPrasad P. Iyer, Saaketh Desai, Sadhvikas Addamane, Rémi Dingreville, Igal Brener. 3759-3766 [doi]
- Transformers For Recognition In Overhead Imagery: A Reality CheckFrancesco Luzi, Aneesh Gupta, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof. 3767-3776 [doi]
- Wavelength-aware 2D Convolutions for Hyperspectral ImagingLeon Amadeus Varga, Martin Messmer, Nuri Benbarka, Andreas Zell. 3777-3786 [doi]
- Semantic Segmentation in Aerial Imagery Using Multi-level Contrastive Learning with Local ConsistencyMaofeng Tang, Konstantinos Georgiou, Hairong Qi 0001, Cody Champion, Marc Bosch. 3787-3796 [doi]
- Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training?Antoine Vanderschueren, Christophe De Vleeschouwer. 3797-3806 [doi]
- Learning Attention Propagation for Compositional Zero-Shot LearningMuhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal. 3817-3826 [doi]
- Computer Vision for International Border LegibilityTrevor Ortega, Thomas Nelson, Skyler Crane, Josh Myers-Dean, Scott Wehrwein. 3827-3836 [doi]
- SONGs: Self-Organizing Neural GraphsLukasz Struski, Tomasz Danel, Marek Smieja, Jacek Tabor, Bartosz Zielinski 0001. 3837-3846 [doi]
- Learning Lightweight Neural Networks via Channel-Split Recurrent ConvolutionGuojun Wu, Xin Zhang, Ziming Zhang, Yanhua Li, Xun Zhou, Christopher G. Brinton, Zhenming Liu. 3847-3857 [doi]
- SPIQ: Data-Free Per-Channel Static Input QuantizationEdouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly. 3858-3867 [doi]
- Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label AnnotationsThomas Verelst, Paul K. Rubenstein, Marcin Eichner, Tinne Tuytelaars, Maxim Berman. 3868-3878 [doi]
- CORL: Compositional Representation Learning for Few-Shot ClassificationJu He, Adam Kortylewski, Alan L. Yuille. 3879-3888 [doi]
- Compact and Optimal Deep Learning with Recurrent Parameter GeneratorsJiayun Wang, Yubei Chen, Stella X. Yu, Brian Cheung, Yann LeCun. 3889-3899 [doi]
- FeTrIL: Feature Translation for Exemplar-Free Class-Incremental LearningGrégoire Petit, Adrian Popescu 0001, Hugo Schindler, David Picard, Bertrand Delezoide. 3900-3909 [doi]
- Gradient-Based Quantification of Epistemic Uncertainty for Deep Object DetectorsTobias Riedlinger, Matthias Rottmann, Marius Schubert, Hanno Gottschalk. 3910-3920 [doi]
- Adaptive Sample Selection for Robust Learning under Label NoiseDeep Patel, P. S. Sastry. 3921-3931 [doi]
- Randomness is the Root of All Evil: More Reliable Evaluation of Deep Active LearningYilin Ji, Daniel Kästner, Oliver Wirth, Christian Wressnegger. 3932-3941 [doi]
- PatchDropout: Economizing Vision Transformers Using Patch DropoutYue Liu, Christos Matsoukas, Fredrik Strand, Hossein Azizpour, Kevin Smith 0001. 3942-3951 [doi]
- PRN: Panoptic Refinement NetworkBo Sun, Jason Kuen, Zhe Lin 0001, Philippos Mordohai, Simon Chen. 3952-3962 [doi]
- Self-Attentive Pooling for Efficient Deep LearningFang Chen, Gourav Datta, Souvik Kundu 0002, Peter A. Beerel. 3963-3972 [doi]
- Accumulated Trivial Attention Matters in Vision Transformers on Small DatasetsXiangyu Chen, Qinghao Hu, Kaidong Li, Cuncong Zhong, Guanghui Wang 0001. 3973-3981 [doi]
- The Change You Want to SeeRagav Sachdeva, Andrew Zisserman. 3982-3991 [doi]
- Ancestor Search: Generalized Open Set Recognition via Hyperbolic Side Information LearningXiwen Dengxiong, Yu Kong. 3992-4001 [doi]
- Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision TransformersChang Chen, Jiaming Zhang 0001, Kailun Yang 0001, Kunyu Peng, Rainer Stiefelhagen. 4002-4011 [doi]
- More Knowledge, Less Bias: Unbiasing Scene Graph Generation with Explicit Ontological AdjustmentZhanwen Chen, Saed Rezayi, Sheng Li. 4012-4021 [doi]
- AFPSNet: Multi-Class Part Parsing based on Scaled Attention and Feature FusionNjuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji. 4022-4031 [doi]
- Foreground Guidance and Multi-Layer Feature Fusion for Unsupervised Object Discovery with TransformersZhiwei Lin, Zengyu Yang, Yongtao Wang. 4032-4042 [doi]
- Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion ModelShin-I Cheng, Yu-Jie Chen, Wei-chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee. 4043-4051 [doi]
- Single-Image HDR Reconstruction by Multi-Exposure GenerationPhuoc-Hieu Le, Quynh Le, Rang Nguyen, Binh-Son Hua. 4052-4061 [doi]
- Dynamic Neural PortraitsMichail Christos Doukas, Stylianos Ploumpis, Stefanos Zafeiriou. 4062-4072 [doi]
- Is Bigger Always Better? An Empirical Study on Efficient Architectures for Style Transfer and BeyondJie An, Tao Li 0040, Hao-Zhi Huang 0001, Jinwen Ma, Jiebo Luo. 4073-4083 [doi]
- SLI-pSp: Injecting Multi-Scale Spatial Layout in pSpAradhya Neeraj Mathur, Anish Madan, Ojaswa Sharma. 4084-4093 [doi]
- Semi-Supervised Learning for Low-light Image Restoration through Quality Assisted Pseudo-LabelingSameer Malik, Rajiv Soundararajan. 4094-4103 [doi]
- Contrastive Learning of Semantic Concepts for Open-set Cross-domain RetrievalAishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan, Biplab Banerjee. 4104-4113 [doi]
- Generative Alignment of Posterior Probabilities for Source-free Domain AdaptationSachin Chhabra, Hemanth Venkateswara, Baoxin Li. 4114-4123 [doi]
- FFM: Injecting Out-of-Domain Knowledge via Factorized Frequency ModificationZijian Wang, Yadan Luo, Zi Huang, Mahsa Baktashmotlagh. 4124-4133 [doi]
- Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action DetectionYifan Lu, Gurkirt Singh, Suman Saha 0001, Luc Van Gool. 4134-4145 [doi]
- Center-aware Adversarial Augmentation for Single Domain GeneralizationTianle Chen, Mahsa Baktashmotlagh, Zijian Wang, Mathieu Salzmann. 4146-4154 [doi]
- Self-Distillation for Unsupervised 3D Domain AdaptationAdriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi di Stefano. 4155-4166 [doi]
- CoNMix for Source-free Single and Multi-target Domain AdaptationVikash Kumar, Rohit Lal, Himanshu Patil, Anirban Chakraborty. 4167-4177 [doi]
- Domain Adaptation using Self-Training with Mixup for One-Stage Object DetectionJitender Maurya, Keyur R. Ranipa, Osamu Yamaguchi, Tomoyuki Shibata, Daisuke Kobayashi. 4178-4187 [doi]
- Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action RecognitionSofia Broomé, Ernest Pokropek, Boyu Li, Hedvig Kjellström. 4188-4198 [doi]
- Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain AdaptationAadarsh Sahoo, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das. 4199-4208 [doi]
- Learning Style Subspaces for Controllable Unpaired Domain TranslationGaurav Bhatt, Vineeth N. Balasubramanian. 4209-4218 [doi]
- TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object DetectionZhipeng Luo, Gongjie Zhang, Changqing Zhou, Tianrui Liu, Shijian Lu, Liang Pan. 4219-4228 [doi]
- Li3DeTr: A LiDAR based 3D Detection TransformerGopi Krishna Erabati, Helder Araújo. 4239-4248 [doi]
- ImpDet: Exploring Implicit Fields for 3D Object DetectionXuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, Xiangyang Xue. 4249-4259 [doi]
- Weakly-supervised Point Cloud Instance Segmentation with Geometric PriorsHeming Du, Xin Yu 0002, Farookh Hussain, Mohammad Ali Armin, Lars Petersson, Weihao Li. 4260-4269 [doi]
- Multivariate Probabilistic Monocular 3D Object DetectionXuepeng Shi, Zhixiang Chen 0003, Tae-Kyun Kim. 4270-4279 [doi]
- Learning to Detect 3D Lanes by Shape Matching and EmbeddingRuixin Liu, Zhihao Guan, Zejian Yuan, Ao Liu, Tong Zhou, Tang Kun, Erlong Li, Chao Zheng, Shuqi Mei. 4280-4288 [doi]
- DSAG: A Scalable Deep Framework for Action-Conditioned Multi-Actor Full Body Motion SynthesisDebtanu Gupta, Shubh Maheshwari, Sai Shashank Kalakonda, Manasvi Vaidyula, Ravi Kiran Sarvadevabhatla. 4289-4297 [doi]
- Self-improving Multiplane-to-layer Images for Novel View SynthesisPavel Solovev, Taras Khakhulin, Denis Korzhenkov. 4298-4307 [doi]
- SketchInverter: Multi-Class Sketch-Based Image Generation via GAN InversionZirui An, Jingbo Yu, Runtao Liu, Chuang Wang, Qian Yu. 4308-4318 [doi]
- Recovering Fine Details for Neural Implicit Surface ReconstructionDecai Chen, Peng Zhang, Ingo Feldmann, Oliver Schreer, Peter Eisert. 4319-4328 [doi]
- Control-NeRF: Editable Feature Volumes for Scene Rendering and ManipulationVerica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll. 4329-4339 [doi]
- Split to Learn: Gradient Split for Multi-Task Human Image AnalysisWeijian Deng, Yumin Suh, Xiang Yu 0002, Masoud Faraki, Liang Zheng 0001, Manmohan Chandraker. 4340-4349 [doi]
- A Suspect Identification Framework using Contrastive Relevance FeedbackDevansh Gupta, Aditya Saini, Sarthak Bhagat, Shagun Uppal, Rishi Raj Jain, Drishti Bhasin, Ponnurangam Kumaraguru, Rajiv Ratn Shah. 4350-4358 [doi]
- Towards A Framework for Privacy-Preserving Pedestrian AnalysisAnil Kunchala, Mélanie Bouroche, Bianca Schoen-Phelan. 4359-4369 [doi]
- Guiding Visual Question Answering with Attention PriorsThao Minh Le, Vuong Le, Sunil Gupta 0001, Svetha Venkatesh, Truyen Tran 0001. 4370-4379 [doi]
- Grounding Scene Graphs on Natural Images via Visio-Lingual Message PassingAditay Tripathi, Anand Mishra 0001, Anirban Chakraborty. 4380-4389 [doi]
- K-VQG: Knowledge-aware Visual Question Generation for Common-sense AcquisitionKohei Uehara, Tatsuya Harada. 4390-4398 [doi]
- PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent AttentionZineng Tang, Jaemin Cho 0001, Jie Lei 0003, Mohit Bansal. 4399-4409 [doi]
- Text and Image Guided 3D Avatar Generation and ManipulationZehranaz Canfes, M. Furkan Atasoy, Alara Dirik, Pinar Yanardag. 4410-4420 [doi]
- More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text MatchingYuxiao Chen 0002, Jianbo Yuan, Long Zhao 0003, Tianlang Chen, Rui Luo, Larry Davis 0001, Dimitris N. Metaxas. 4421-4429 [doi]
- Watching the News: Towards VideoQA Models that can ReadSoumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar. 4430-4439 [doi]
- How to Practice VQA on a Resource-limited Target DomainMingda Zhang, Rebecca Hwa, Adriana Kovashka. 4440-4449 [doi]
- Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image GenerationZhihong Pan 0001, Xin Zhou, Hao Tian. 4450-4460 [doi]
- GarSim: Particle Based Neural Garment SimulatorLokender Tiwari, Brojeshwar Bhowmick. 4461-4470 [doi]
- IDD-3D: Indian Driving Dataset for 3D Unstructured Road ScenesShubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar. 4471-4480 [doi]
- Fast and Accurate: Video Enhancement Using Sparse DepthYu Feng 0007, Patrick Hansen, Paul N. Whatmough, Guoyu Lu, Yuhao Zhu 0001. 4481-4489 [doi]
- Complementary Bi-directional Feature Compression for Indoor 360° Semantic Segmentation with Self-distillationZishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Zhijie Shen, Yao Zhao 0001. 4490-4499 [doi]
- Overlap-guided Gaussian Mixture Models for Point Cloud RegistrationGuofeng Mei, Fabio Poiesi, Cristiano Saltori, Jian Zhang, Elisa Ricci 0001, Nicu Sebe. 4500-4509 [doi]
- 3D Neural Sculpting (3DNS): Editing Neural Signed Distance FunctionsPetros Tzathas, Petros Maragos, Anastasios Roussos. 4510-4519 [doi]
- Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial Application Case on Autonomous DisassemblyChengzhi Wu, Xuelei Bi, Julius Pfrommer, Alexander Cebulla, Simon Mangold, Jürgen Beyerer. 4520-4529 [doi]
- Robustness of Trajectory Prediction Models Under Map-Based AttacksZhihao Zheng, Xiaowen Ying, Zhen Yao, Mooi Choo Chuah. 4530-4539 [doi]
- Inducing Data Amplification Using Auxiliary Datasets in Adversarial TrainingSaehyung Lee, Hyungyu Lee. 4540-4549 [doi]
- Certified Defense for Content Based Image RetrievalKazuya Kakizaki, Kazuto Fukuchi, Jun Sakuma. 4550-4559 [doi]
- Phantom Sponges: Exploiting Non-Maximum Suppression to Attack Deep Object DetectorsAvishag Shapira, Alon Zolfi, Luca Demetrio, Battista Biggio, Asaf Shabtai. 4560-4569 [doi]
- Explainability-Aware One Point Attack for Point Cloud Neural NetworksHanxiao Tan, Helena Kotthaus. 4570-4579 [doi]
- Image Completion with Heterogeneously Filtered Spectral HintsXingqian Xu, Shant Navasardyan, Vahram Tadevosyan, Andranik Sargsyan, Yadong Mu, Humphrey Shi. 4580-4590 [doi]
- Proactive Deepfake Defence via Identity WatermarkingYuan Zhao, Bo Liu 0001, Ming Ding 0001, Baoping Liu, Tianqing Zhu, Xin Yu. 4591-4600 [doi]
- Avoiding Lingering in Learning Active Recognition by Adversarial DisturbanceLei Fan, Ying Wu 0001. 4601-4610 [doi]
- DE-CROP: Data-efficient Certified Robustness for Pretrained ClassifiersGaurav Kumar Nayak, Ruchit Rawal, Anirban Chakraborty. 4611-4620 [doi]
- PatchZero: Defending against Adversarial Patch Attacks by Detecting and Zeroing the PatchKe Xu, Yao Xiao, Zhaoheng Zheng, Kaijie Cai, Ram Nevatia. 4621-4630 [doi]
- CFL-Net: Image Forgery Localization Using Contrastive LearningFahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo. 4631-4640 [doi]
- Indirect Adversarial Losses via an Intermediate Distribution for Training GANsRui Yang, Duc Minh Vo, Hideki Nakayama. 4641-4650 [doi]
- Adversarial robustness in discontinuous spaces via alternating sampling & descentRahul Venkatesh, Eric Wong 0001, Zico Kolter. 4651-4660 [doi]
- RANCER: Non-Axis Aligned Anisotropic Certification with Randomized SmoothingTaras Rumezhak, Francisco Girbal Eiras, Philip H. S. Torr, Adel Bibi. 4661-4669 [doi]
- Adversarial local distribution regularization for knowledge distillationThanh Nguyen-Duc, Trung Le, He Zhao 0001, Jianfei Cai 0001, Dinh Phung 0001. 4670-4679 [doi]
- 2Net: Temporal Identity Inconsistency Network for Deepfake DetectionBaoping Liu, Bo Liu 0001, Ming Ding 0001, Tianqing Zhu, Xin Yu. 4680-4689 [doi]
- Interpreting Disparate Privacy-Utility Tradeoff in Adversarial Learning via Attribute CorrelationLikun Zhang, Yahong Chen, Ang Li 0005, Binghui Wang, Yiran Chen 0001, FengHua Li, Jin Cao, Ben Niu 0001. 4690-4698 [doi]
- Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial MotionShruti Agarwal, Liwen Hu, Evonne Ng, Trevor Darrell, Hao Li 0015, Anna Rohrbach. 4699-4708 [doi]
- Augmentation by Counterfactual Explanation -Fixing an Overconfident ClassifierSumedha Singla, Nihal Murali, Forough Arabshahi, Sofia Triantafyllou, Kayhan Batmanghelich. 4709-4719 [doi]
- Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANsEnis Simsar, Umut Kocasari, Ezgi Gülperi Er, Pinar Yanardag. 4720-4729 [doi]
- Visualizing Global Explanations of Point Cloud DNNsHanxiao Tan. 4730-4739 [doi]
- Revisiting Training-free NAS Metrics: An Efficient Training-based MethodTaojiannan Yang, Linjie Yang, Xiaojie Jin, Chen Chen. 4740-4749 [doi]
- Encouraging Disentangled and Convex Representation with Controllable Interpolation RegularizationYunhao Ge, Zhi Xu, Yao Xiao, Gan Xin, Yunkui Pang, Laurent Itti. 4750-4758 [doi]
- RNAS-MER: A Refined Neural Architecture Search with Hybrid Spatiotemporal Operations for Micro-Expression RecognitionMonu Verma, Priyanka Lubal, Santosh Kumar Vipparthi, Mohamed Abdel-Mottaleb. 4759-4768 [doi]
- Concept Correlation and Its Effects on Concept-Based ModelsLena Heidemann, Maureen Monnet, Karsten Roscher. 4769-4777 [doi]
- Graph-Based Self-Learning for Robust Person Re-identificationYuqiao Xian, Jinrui Yang, Fufu Yu, Jun Zhang, Xing Sun. 4778-4787 [doi]
- Hard to Track Objects with Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching SpaceFan Yang, Shigeyuki Odashima, Shoichi Masui, Shan Jiang 0006. 4788-4797 [doi]
- Back to MLP: A Simple Baseline for Human Motion PredictionWen Guo, Yuming Du, Xi Shen 0001, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer. 4798-4808 [doi]
- SAT: Scale-Augmented Transformer for Person SearchMustansar Fiaz, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan. 4809-4818 [doi]
- HOOT: Heavy Occlusions in Object Tracking BenchmarkGozde Sahin, Laurent Itti. 4819-4828 [doi]
- Relation Preserving Triplet Mining for Stabilising the Triplet Loss in Re-identification SystemsAdhiraj Ghosh, Kuruparan Shanmugalingam, Wen-Yan Lin. 4829-4838 [doi]
- Detection Recovery in Online Multi-Object Tracking with Sparse Graph TrackerJeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-Yan Yeung. 4839-4848 [doi]
- MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking BenchmarkXiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang 0004, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu 0001. 4849-4858 [doi]
- TransMOT: Spatial-Temporal Graph Transformer for Multiple Object TrackingPeng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu 0001. 4859-4869 [doi]
- Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identificationLuca Piano, Filippo Gabriele Pratticò, Alessandro Sebastian Russo, Lorenzo Lanari, Lia Morra, Fabrizio Lamberti. 4870-4880 [doi]
- Modeling the Lighting in Scenes as Style for Auto White-Balance CorrectionFurkan Kinli, Doga Yilmaz, Baris Özcan, Furkan Kiraç. 4892-4902 [doi]
- Real-Time Restoration of Dark Stereo ImagesMohit Lamba, M. V. A. Suhas Kumar, Kaushik Mitra. 4903-4913 [doi]
- LRA&LDRA: Rethinking Residual Predictions for Efficient Shadow Detection and RemovalMehmet Kerim Yücel, Valia Dimaridou, Bruno Manganelli, Mete Ozay, Anastasios Drosou, Albert Saà-Garriga. 4914-4924 [doi]
- Single Image Super-Resolution via a Dual Interactive Implicit Neural NetworkQuan H. Nguyen, William J. Beksi. 4925-4934 [doi]
- Joint Video Rolling Shutter Correction and Super-ResolutionAkash Gupta, Sudhir Kumar Singh, Amit K. Roy Chowdhury. 4935-4944 [doi]
- Enriched CNN-Transformer Feature Aggregation Networks for Super-ResolutionJinsu Yoo, Taehoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae-Hyun Kim. 4945-4954 [doi]
- DSFormer: A Dual-domain Self-supervised Transformer for Accelerated Multi-contrast MRI ReconstructionBo Zhou 0009, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal Sofka. 4955-4964 [doi]
- RADIANT: Better rPPG estimation using signal embeddings and TransformerAnup Kumar Gupta, Rupesh Kumar, Lokendra Birla, Puneet Gupta 0002. 4965-4975 [doi]
- Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit ImbalancesAjay Jaiswal, Tianlong Chen, Justin F. Rousseau, Yifan Peng, Ying Ding 0001, Zhangyang Wang. 4976-4985 [doi]
- HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial NetworksGeorg Wölflein, In Hwa Um, David J. Harrison, Ognjen Arandjelovic. 4986-4996 [doi]
- EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Cardiac MeasurementXin Liu 0061, Brian L. Hill, Ziheng Jiang, Shwetak N. Patel, Daniel McDuff. 4997-5006 [doi]
- Improving Deep Facial Phenotyping for Ultra-rare Disorder Verification Using Model EnsemblesAlexander Hustinx, Fabio Hellmann, Ömer Sümer, Behnam Javanmardi, Elisabeth André, Peter Krawitz, Tzung-Chien Hsieh. 5007-5017 [doi]
- ALPINE: Improving Remote Heart Rate Estimation using Contrastive LearningLokendra Birla, Sneha Shukla, Anup Kumar Gupta, Puneet Gupta 0002. 5018-5027 [doi]
- Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression PredictionYan Yang, Md. Zakir Hossain, Eric A. Stone, Shafin Rahman. 5028-5037 [doi]
- Enhanced Bi-directional Motion Estimation for Video Frame InterpolationXin Jin, Longhai Wu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, Cheul-Hee Hahm. 5038-5046 [doi]
- Dance Style Transfer with Cross-modal TransformerWenjie Yin, Hang Yin, Kim Baraka, Danica Kragic, Mårten Björkman. 5047-5056 [doi]
- MFCFlow: A Motion Feature Compensated Multi-Frame Recurrent Network for Optical Flow EstimationYonghu Chen, Dongchen Zhu, Wenjun Shi, Guanghui Zhang, Tianyu Zhang, Xiaolin Zhang, Jiamao Li. 5057-5066 [doi]
- GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion EstimatesJerin Geo James, Devansh Jain, Ajit Rajwade 0001. 5067-5076 [doi]
- Towards Equivariant Optical Flow Estimation with Deep LearningStefano Savian, Pietro Morerio, Alessio Del Bue, Andrea A. Janes, Tammam Tillo. 5077-5086 [doi]
- Skew-Robust Human-Object Interactions in VideosApoorva Agarwal, Rishabh Dabral, Arjun Jain, Ganesh Ramakrishnan. 5087-5096 [doi]
- Video joint denoising and demosaicing with recurrent CNNsValéry Dewil, Adrien Courtois, Mariano Rodríguez, Thibaud Ehret, Nicola Brandonisio, Denis Bujoreanu, Gabriele Facciolo, Pablo Arias 0001. 5097-5108 [doi]
- Video Object Matting via Hierarchical Space-Time Semantic GuidanceYumeng Wang, Bo Xu, Ziwen Li, Han Huang, Cheng Lu 0006, Yandong Guo. 5109-5118 [doi]
- Exploiting Long-Term Dependencies for Generating Dynamic Scene GraphsShengyu Feng, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar, Subarna Tripathi. 5119-5128 [doi]
- Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object SegmentationSuhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee. 5129-5138 [doi]
- DCVNet: Dilated Cost Volume Networks for Fast Optical FlowHuaizu Jiang, Erik G. Learned-Miller. 5139-5146 [doi]
- AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event LocalizationTanvir Mahmud, Diana Marculescu. 5147-5156 [doi]
- SeCo: Separating Unknown Musical Visual Sounds with Consistency GuidanceXinchi Zhou, Dongzhan Zhou, Wanli Ouyang, Hang Zhou, Di Hu 0001. 5157-5166 [doi]
- Audio-Visual Face ReenactmentMadhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar. 5167-5176 [doi]
- LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream VideosJielin Qiu, Franck Dernoncourt, Trung Bui, Zhaowen Wang, Ding Zhao, Hailin Jin. 5177-5187 [doi]
- Exploiting Visual Context Semantics for Sound Source LocalizationXinchi Zhou, Dongzhan Zhou, Di Hu 0001, Hang Zhou, Wanli Ouyang. 5188-5197 [doi]
- Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronizationAnchit Gupta, Rudrabha Mukhopadhyay, Sindhu Balachandra, Faizan Farooq Khan, Vinay P. Namboodiri, C. V. Jawahar. 5198-5207 [doi]
- Towards Disturbance-Free Visual Mobile ManipulationTianwei Ni, Kiana Ehsani, Luca Weihs, Jordi Salvador. 5208-5220 [doi]
- Unsupervised Audio-Visual Lecture SegmentationDarshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi. 5221-5230 [doi]
- GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasksOnkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R. Sai Chandra Teja, Rekha Singhal. 5231-5240 [doi]
- MT-DETR: Robust End-to-end Multimodal Detection with Confidence FusionShih-Yun Chu, Ming-Sui Lee. 5241-5250 [doi]
- Hyperspherical Quantization: Toward Smaller and More Accurate ModelsDan Liu, Xi Chen, Chen Ma, Xue Liu. 5251-5261 [doi]
- Saliency Guided Experience Packing for Replay in Continual LearningGobinda Saha, Kaushik Roy 0001. 5262-5272 [doi]
- AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNsShiv Ram Dubey, Satish Kumar Singh, Bidyut Baran Chaudhuri. 5273-5282 [doi]
- Spike-Based Anytime PerceptionMatthew Dutson, Yin Li 0003, Mohit Gupta 0001. 5283-5293 [doi]
- Meta-OLE: Meta-learned Orthogonal Low-Rank EmbeddingZe Wang, Yue Lu, Qiang Qiu. 5294-5303 [doi]
- GEMS: Generating Efficient Meta-SubnetsVarad Pimpalkhute, Shruti Kunde, Rekha Singhal. 5304-5312 [doi]
- Serf: Towards better training of deep neural networks using log-Softplus ERror activation FunctionSayan Nag, Mayukh Bhattacharyya, Anuraag Mukherjee, Rohit Kundu. 5313-5322 [doi]
- Learning Latent Structural Relations with Message Passing PriorShaogang Ren, Hongliang Fei, Dingcheng Li, Ping Li 0001. 5323-5332 [doi]
- Bootstrapping the Relationship Between Images and Their Clean and Noisy LabelsBrandon Smart, Gustavo Carneiro. 5333-5343 [doi]
- Boosting neural video codecs by exploiting hierarchical redundancyReza Pourreza 0002, Hoang Le, Amir Said, Guillaume Sautière, Auke Wiggers. 5344-5353 [doi]
- A neural video codec with spatial rate-distortion controlNoor Fathima Ghouse, Jens Petersen, Guillaume Sautière, Auke Wiggers, Reza Pourreza 0002. 5354-5363 [doi]
- Burst Vision Using Single-Photon CamerasSizhuo Ma, Paul Mos, Edoardo Charbon, Mohit Gupta 0001. 5364-5374 [doi]
- Few-shot Object Detection via Improved Classification FeaturesXinyu Jiang, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Duoqian Miao. 5375-5384 [doi]
- EventPoint: Self-Supervised Interest Point Detection and Description for Event-based CameraZe Huang, Li Sun 0005, Cheng Zhao, Song Li, Songzhi Su. 5385-5394 [doi]
- Sim2RealVS: A New Benchmark for Video Stabilization with a Strong BaselineQi Rao, Xin Yu, Shant Navasardyan, Humphrey Shi. 5395-5404 [doi]
- Effective Invertible Arbitrary Image RescalingZhihong Pan 0001, Baopu Li, Dongliang He, Wenhao Wu, Errui Ding. 5405-5414 [doi]
- Self-Attention Message Passing for Contrastive Few-Shot LearningOjas Kishorkumar Shirekar, Anuj Singh, Hadi Jamali Rad. 5414-5425 [doi]
- One-Shot Doc Snippet Detection: Powering Search in Document Beyond TextAbhinav Java, Shripad Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy. 5426-5435 [doi]
- Semantic Guided Latent Parts Embedding for Few-Shot LearningFengyuan Yang, Ruiping Wang 0001, Xilin Chen 0001. 5436-5446 [doi]
- Event-based RGB sensing with structured lightSeyed Ehsan Marjani Bajestani, Giovanni Beltrame. 5447-5456 [doi]
- Discrete Cosin TransFormer: Image Modeling From Frequency DomainXinyu Li, Yanyi Zhang, Jianbo Yuan, Hanlin Lu, Yibo Zhu. 5457-5467 [doi]
- Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly TypesKihyuk Sohn, Jinsung Yoon, Chun-Liang Li, Chen-Yu Lee, Tomas Pfister. 5468-5479 [doi]
- Image-Consistent Detection of Road Anomalies as Unpredictable PatchesTomás Vojír, Jirí Matas. 5480-5489 [doi]
- GLAD: A Global-to-Local Anomaly DetectorAitor Artola, Yannis Kolodziej, Jean-Michel Morel, Thibaud Ehret. 5490-5499 [doi]
- No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly DetectionMohamed Yousef, Marcel Ackermann 0001, Unmesh Kurup, Tom E. Bishop. 5500-5509 [doi]
- Out-of-distribution Detection via Frequency-regularized Generative ModelsMu Cai, Yixuan Li. 5510-5519 [doi]
- Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained EnvironmentsJingyang Zhang, Nathan Inkawhich, Randolph Linderman, Yiran Chen 0001, Hai Li 0001. 5520-5529 [doi]
- DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection NetworkKamalakar Vijay Thakare, Yash Raghuwanshi, Debi Prosad Dogra, Heeseung Choi, Ig-Jae Kim. 5530-5539 [doi]
- Out-of-Distribution Detection with Reconstruction Error and Typicality-based PenaltyGenki Osada, Takahashi Tsubasa, Budrul Ahsan, Takashi Nishide. 5540-5552 [doi]
- Zero-shot versus Many-shot: Unsupervised Texture Anomaly DetectionToshimichi Aota, Lloyd Teh Tzer Tong, Takayuki Okatani. 5553-5561 [doi]
- ViewCLR: Learning Self-supervised Video Representation for Unseen ViewpointsSrijan Das, Michael S. Ryoo. 5562-5572 [doi]
- Progressive Video Summarization via Multimodal Self-supervised LearningHaopeng Li, Qiuhong Ke, Mingming Gong, Tom Drummond. 5573-5582 [doi]
- Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic RadiographsAmani Almalki, Longin Jan Latecki. 5583-5592 [doi]
- Cooperative Self-Training for Multi-Target Adaptive Semantic SegmentationYangsong Zhang, Subhankar Roy, Hongtao Lu, Elisa Ricci 0001, Stéphane Lathuilière. 5593-5602 [doi]
- Self Supervised Low Dose Computed Tomography Image Denoising Using Invertible Network Exploiting Inter Slice CongruenceSutanu Bera, Prabir Kumar Biswas. 5603-5612 [doi]
- Self-supervised Learning with Local Contrastive Loss for Detection and Semantic SegmentationAshraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke. 5613-5622 [doi]
- Self-Supervised Clustering based on Manifold Learning and Graph Convolutional NetworksLeonardo Tadeu Lopes, Daniel Carlos Guimarães Pedronette. 5623-5632 [doi]
- Unifying Distribution Alignment as a Loss for Imbalanced Semi-supervised LearningJustin Lazarow, Kihyuk Sohn, Chen-Yu Lee, Chun-Liang Li, Zizhao Zhang, Tomas Pfister. 5633-5642 [doi]
- Accelerating Self-Supervised Learning via Efficient Training StrategiesMustafa Taha Koçyigit, Timothy M. Hospedales, Hakan Bilen. 5643-5653 [doi]
- ETR: An Efficient Transformer for Re-ranking in Visual Place RecognitionHao Zhang, Xin Chen, Heming Jing, Yingbin Zheng, Yuan Wu, Cheng Jin. 5654-5663 [doi]
- HandGCNFormer: A Novel Topology-Aware Transformer Network for 3D Hand Pose EstimationYintong Wang, Lili Chen, Jiamao Li, Xiaolin Zhang. 5664-5673 [doi]
- SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose EstimationGuowei Li, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Tianyu Zhang, Xiaolin Zhang, Jiamao Li. 5674-5683 [doi]
- Rethinking the Data Annotation Process for Multiview 3D Pose Estimation with Active Learning and Self-TrainingQi Feng, Kun He, He Wen, Cem Keskin, Yuting Ye. 5684-5693 [doi]
- Self-supervised Relative Pose with Homography Model-fitting in the LoopBruce R. Muller, William A. P. Smith. 5694-5703 [doi]
- HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave RadarShih-Po Lee, Niraj Prakash Kini, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang. 5704-5713 [doi]
- Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in VideosKyung-Min Jin, Byoung-Sung Lim, Gun Hee Lee,