Abstract is missing.
- MVTN: Multi-View Transformation Network for 3D Shape RecognitionAbdullah Hamdi, Silvio Giancola, Bernard Ghanem. 1-11 [doi]
- GLiT: Neural Architecture Search for Global and Local Image TransformerBoyu Chen, Peixia Li, Chuming Li, Baopu Li, Lei Bai 0001, Chen Lin 0003, Ming Sun 0008, Junjie Yan, Wanli Ouyang. 12-21 [doi]
- CvT: Introducing Convolutions to Vision TransformersHaiping Wu, Bin Xiao, Noel Codella, Mengchen Liu, Xiyang Dai, Lu Yuan, Lei Zhang 0001. 22-31 [doi]
- Going deeper with Image TransformersHugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve, Hervé Jégou. 32-42 [doi]
- DTMNet: A Discrete Tchebichef Moments-based Deep Neural Network for Multi-focus Image FusionBin Xiao 0002, Haifeng Wu, Xiuli Bi. 43-51 [doi]
- CrossNorm and SelfNorm for Generalization under Distribution ShiftsZhiqiang Tang 0001, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li 0003, Dimitris N. Metaxas. 52-61 [doi]
- NGC: A Unified Framework for Learning with Open-World Noisy DataZhi-Fan Wu, Tong Wei 0001, Jianwen Jiang, Chaojie Mao, Mingqian Tang, Yu-Feng Li. 62-71 [doi]
- Learning with Noisy Labels via Sparse RegularizationXiong Zhou, Xianming Liu, Chenyang Wang, Deming Zhai, Junjun Jiang, Xiangyang Ji. 72-81 [doi]
- Asymmetric Loss For Multi-Label ClassificationTal Ridnik, Emanuel Ben Baruch, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor. 82-91 [doi]
- Procrustean Training for Imbalanced Deep LearningHan-Jia Ye, De-Chuan Zhan, Wei-Lun Chao. 92-102 [doi]
- Conditional Variational Capsule Network for Open Set RecognitionYunrui Guo, Guglielmo Camporese, Wenjing Yang 0002, Alessandro Sperduti, Lamberto Ballan. 103-111 [doi]
- ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-ShotJiarui Cai, Yizhou Wang 0005, Jenq-Neng Hwang. 112-121 [doi]
- FREE: Feature Refinement for Generalized Zero-Shot LearningShiming Chen 0002, Wenjie Wang, Beihao Xia, Qinmu Peng, Xinge You, Feng Zheng, Ling Shao 0001. 122-131 [doi]
- Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object LocalizationJinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, LinLin Shen. 132-141 [doi]
- Z-Score Normalization, Hubness, and Few-Shot LearningNanyi Fei, Yizhao Gao, Zhiwu Lu 0001, Tao Xiang. 142-151 [doi]
- Spatio-Temporal Representation Factorization for Video-based Person Re-IdentificationAbhishek Aich, Meng Zheng, Srikrishna Karanam, Terrence Chen, Amit K. Roy Chowdhury, Ziyan Wu. 152-162 [doi]
- Transformer-based Dual Relation Graph for Multi-label Image RecognitionJiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li 0003. 163-172 [doi]
- Dance with Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in VideosDidik Purwanto, Yie-Tarng Chen, Wen-Hsien Fang. 173-183 [doi]
- Residual Attention: A Simple but Effective Method for Multi-Label RecognitionKe Zhu, Jianxin Wu. 184-193 [doi]
- Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and BeyondMing Li, Xinming Huang 0001, Ziming Zhang. 194-204 [doi]
- Heterogeneous Relational Complement for Vehicle Re-identificationJiajian Zhao, Yifan Zhao, Jia Li 0003, Ke Yan, Yonghong Tian 0001. 205-214 [doi]
- Attack-Guided Perceptual Data Generation for Real-world Re-IdentificationYukun Huang, Xueyang Fu, Zheng-Jun Zha. 215-224 [doi]
- Syncretic Modality Collaborative Learning for Visible Infrared Person Re-IdentificationZiyu Wei, Xi Yang 0011, Nannan Wang 0001, Xinbo Gao 0001. 225-234 [doi]
- Distilling Virtual Examples for Long-tailed RecognitionYin-Yin He, Jianxin Wu, Xiu-Shen Wei. 235-244 [doi]
- Neural Photofit: Gaze-based Mental Image ReconstructionFlorian Strohm, Ekta Sood, Sven Mayer, Philipp Müller 0001, Mihai Bâce, Andreas Bulling. 245-254 [doi]
- When Pigs Fly: Contextual Reasoning in Synthetic and Natural ScenesPhilipp Bomatter, Mengmi Zhang, Dimitar Karev, Spandan Madan, Claire Tseng, Gabriel Kreiman. 255-264 [doi]
- MAAS: Multi-modal Assignation for Active Speaker DetectionJuan León Alcázar, Fabian Caba Heilbron, Ali K. Thabet, Bernard Ghanem. 265-274 [doi]
- Move2Hear: Active Audio-Visual Source SeparationSagnik Majumder, Ziad Al-Halah, Kristen Grauman. 275-285 [doi]
- Image2Reverb: Cross-Modal Reverb Impulse Response SynthesisNikhil Singh 0003, Jeff Mentch, Jerry Ng, Matthew Beveridge, Iddo Drori. 286-295 [doi]
- Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face VideoMinsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro. 296-306 [doi]
- BN-NAS: Neural Architecture Search with Batch NormalizationBoyu Chen, Peixia Li, Baopu Li, Chen Lin 0003, Chuming Li, Ming Sun 0008, Junjie Yan, Wanli Ouyang. 307-316 [doi]
- Differentiable Dynamic Wirings for Neural NetworksKun Yuan, Quanquan Li, Shaopeng Guo, Dapeng Chen, Aojun Zhou, Fengwei Yu, Ziwei Liu 0002. 317-326 [doi]
- AutoSpace: Neural Architecture Search with Less Human InterferenceDaquan Zhou, Xiaojie Jin, Xiaochen Lian, Linjie Yang, Yujing Xue, Qibin Hou, Jiashi Feng. 327-336 [doi]
- Zen-NAS: A Zero-Shot NAS for High-Performance Image RecognitionMing Lin, Pichao Wang, Zhenhong Sun, Hesen Chen, Xiuyu Sun, Qi Qian 0001, Hao Li 0030, Rong Jin 0001. 337-346 [doi]
- CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image ClassificationChun-Fu (Richard) Chen, Quanfu Fan, Rameswar Panda. 347-356 [doi]
- Conformer: Local Features Coupling Global Representations for Visual RecognitionZhiliang Peng, Wei Huang, Shanzhi Gu, Lingxi Xie, Yaowei Wang, Jianbin Jiao, Qixiang Ye. 357-366 [doi]
- Scalable Vision Transformers with Hierarchical PoolingZizheng Pan, Bohan Zhuang, Jing Liu, Haoyu He, Jianfei Cai 0001. 367-376 [doi]
- Vision Transformer with Progressive SamplingXiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip H. S. Torr, Wayne Zhang, Dahua Lin. 377-386 [doi]
- Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder TransformersHila Chefer, Shir Gur, Lior Wolf. 387-396 [doi]
- Learning Canonical View Representation for 3D Shape Recognition with Arbitrary ViewsXin Wei, Yifei Gong, Fudong Wang, Xing Sun, Jian Sun. 397-406 [doi]
- MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object DetectionCheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao. 407-417 [doi]
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal VisionXiaoshi Wu, Hadar Averbuch-Elor, Jin Sun, Noah Snavely. 418-427 [doi]
- Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object InteractionBo Xu, Han Huang, Cheng Lu, Ziwen Li, Yandong Guo. 428-437 [doi]
- An Asynchronous Kalman Filter for Hybrid Event CamerasZiwei Wang, Yonhon Ng, Cedric Scheerlinck, Robert E. Mahony. 438-447 [doi]
- Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency DomainGuangyao Chen, Peixi Peng, Li Ma, Jia Li 0003, Lin Du, Yonghong Tian 0001. 448-457 [doi]
- MicroNet: Improving Image Recognition with Extremely Low FLOPsYunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen 0001, Mengchen Liu, Lu Yuan, Zicheng Liu 0001, Lei Zhang 0001, Nuno Vasconcelos. 458-467 [doi]
- Group-wise Inhibition based Feature Regularization for Robust ClassificationHaozhe Liu, Haoqian Wu, Weicheng Xie 0001, Feng Liu 0013, LinLin Shen. 468-476 [doi]
- Exploration and Estimation for Model CompressionYanfu Zhang, Shangqian Gao, Heng Huang. 477-486 [doi]
- Learning to Resize Images for Computer Vision TasksHossein Talebi, Peyman Milanfar. 487-496 [doi]
- Learning Meta-class Memory for Few-Shot Semantic SegmentationZhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai 0001. 497-506 [doi]
- Aggregation with Feature DetectionShuyang Sun, Xiaoyu Yue, Xiaojuan Qi, Wanli Ouyang, Victor Prisacariu, Philip H. S. Torr. 507-516 [doi]
- Continual Learning on Noisy Data Streams via Self-Purified ReplayChris Dongjoo Kim, Jinseo Jeong, Sangwoo Moon, Gunhee Kim. 517-527 [doi]
- Point Cloud Augmentation with Weighted Local TransformationsSihyeon Kim, Sanghyeok Lee, Dasol Hwang, Jaewon Lee, Seong Jae Hwang, Hyunwoo J. Kim. 528-537 [doi]
- Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNetLi Yuan 0007, Yunpeng Chen, Tao Wang, Weihao Yu, Yujun Shi, Zihang Jiang, Francis E. H. Tay, Jiashi Feng, Shuicheng Yan. 538-547 [doi]
- Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without ConvolutionsWenhai Wang, Enze Xie, Xiang Li 0028, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo 0002, Ling Shao 0001. 548-558 [doi]
- Incorporating Convolution Designs into Visual TransformersKun Yuan, Shaopeng Guo, Ziwei Liu 0002, Aojun Zhou, Fengwei Yu, Wei Wu. 559-568 [doi]
- Visformer: The Vision-friendly TransformerZhengsu Chen, Lingxi Xie, Jianwei Niu 0002, Xuefeng Liu 0001, Longhui Wei, Qi Tian 0001. 569-578 [doi]
- Visual Transformers: Where Do Transformers Really Belong in Vision Models?Bichen Wu, Chenfeng Xu, Xiaoliang Dai, Alvin Wan, Peizhao Zhang, Zhicheng Yan, Masayoshi Tomizuka, Joseph Gonzalez 0001, Kurt Keutzer, Peter Vajda. 579-589 [doi]
- Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal DataXuhui Jia, Kai Han, Yukun Zhu, Bradley Green. 590-599 [doi]
- Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-grained RecognitionShaoli Huang, Xinchao Wang, Dacheng Tao. 600-609 [doi]
- Self Supervision to Distillation for Long-Tailed Visual RecognitionTianhao Li, Limin Wang 0002, Gangshan Wu. 610-619 [doi]
- Semantic Diversity Learning for Zero-Shot Multi-label ClassificationAvi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor. 620-630 [doi]
- Shallow Bayesian Meta Learning for Real-World Few-Shot RecognitionXueting Zhang, Debin Meng, Henry Gouk, Timothy M. Hospedales. 631-640 [doi]
- Adversarial Attacks are Reversible with Natural SupervisionChengzhi Mao, Mia Chiquier, Hao Wang 0014, Junfeng Yang, Carl Vondrick. 641-651 [doi]
- Architecture Disentanglement for Deep Neural NetworksJie Hu 0018, Liujuan Cao, Tong Tong, Qixiang Ye, Shengchuan Zhang, Ke Li, Feiyue Huang, Ling Shao 0001, Rongrong Ji. 652-661 [doi]
- Exploiting Explanations for Model Inversion AttacksXuejun Zhao, Wencan Zhang, Xiaokui Xiao, Brian Y. Lim. 662-672 [doi]
- Explaining in Style: Training a GAN to explain a classifier in StyleSpaceOran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan, Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal Irani, Inbar Mosseri. 673-682 [doi]
- Ground-truth or DAER: Selective Re-query of Secondary InformationStephan J. Lemmer, Jason J. Corso. 683-694 [doi]
- Parametric Contrastive LearningJiequan Cui, Zhisheng Zhong, Shu Liu 0005, Bei Yu 0001, Jiaya Jia. 695-704 [doi]
- Learning Fast Sample Re-weighting Without Reward DataZizhao Zhang, Tomas Pfister. 705-714 [doi]
- Influence-Balanced Loss for Imbalanced Visual ClassificationSeulki Park, Jongin Lim, Younghan Jeon, Jin Young Choi 0002. 715-724 [doi]
- Statistically Consistent Saliency EstimationShunyan Luo, Emre Barut, Fang Jin. 725-733 [doi]
- Contrastive Multimodal Fusion with TupleInfoNCEYunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong 0003, Thomas A. Funkhouser, Li Yi. 734-743 [doi]
- Recursively Conditional Gaussian for Ordinal Unsupervised Domain AdaptationXiaofeng Liu 0001, Site Li, Yubin Ge, Pengyi Ye, Jane You, Jun Lu. 744-753 [doi]
- TrivialAugment: Tuning-free Yet State-of-the-Art Data AugmentationSamuel G. Müller, Frank Hutter. 754-762 [doi]
- FcaNet: Frequency Channel Attention NetworksZequn Qin, Pengyi Zhang, Fei Wu, Xi Li 0001. 763-772 [doi]
- Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNsMd. Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis, Neil D. B. Bruce. 773-781 [doi]
- Neural Video Portrait Relighting in Real-time via Consistency ModelingLongwen Zhang, Qixuan Zhang, Minye Wu, Jingyi Yu, Lan Xu. 782-792 [doi]
- OpenGAN: Open-Set Recognition via Open Data GenerationShu Kong, Deva Ramanan. 793-802 [doi]
- MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep SubnetworksAlexandre Ramé, Rémy Sun, Matthieu Cord. 803-813 [doi]
- Learning to Diversify for Single Domain GeneralizationZijian Wang, Yadan Luo, Ruihong Qiu, Zi Huang, Mahsa Baktashmotlagh. 814-823 [doi]
- SS-IL: Separated Softmax for Incremental LearningHongjoon Ahn, Jihwan Kwak, Subin Lim, Hyeonsu Bang, Hyojun Kim, Taesup Moon. 824-833 [doi]
- Multimodal Knowledge ExpansionZihui Xue, Sucheng Ren, Zhengqi Gao, Hang Zhao. 834-843 [doi]
- FaPN: Feature-aligned Pyramid Network for Dense Image PredictionShihua Huang, Zhichao Lu, Ran Cheng, Cheng He 0001. 844-853 [doi]
- Grafit: Learning fine-grained image representations with coarse labelsHugo Touvron, Alexandre Sablayrolles, Matthijs Douze, Matthieu Cord, Hervé Jégou. 854-864 [doi]
- Attentional Pyramid Pooling of Salient Visual Residuals for Place RecognitionGuohao Peng, Jun Zhang 0042, Heshan Li, Danwei Wang. 865-874 [doi]
- Interpretable Image Recognition by Constructing Transparent Embedding SpaceJiaqi Wang, Huafeng Liu 0001, Xinyue Wang, Liping Jing. 875-884 [doi]
- Generating Attribution Maps with Disentangled Masked BackpropagationAdria Ruiz, Antonio Agudo, Francesc Moreno-Noguer. 885-894 [doi]
- Walk in the Cloud: Learning Curves for Point Clouds Shape AnalysisTiange Xiang, Chaoyi Zhang, Yang Song 0001, Jianhui Yu, Weidong Cai 0001. 895-904 [doi]
- End-to-End Trainable Trident Person Search Network Using Adaptive Gradient PropagationByeong-Ju Han, Kuhyeun Ko, Jae-Young Sim. 905-913 [doi]
- Graph-based Asynchronous Event Processing for Rapid Object RecognitionYijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang 0001. 914-923 [doi]
- Parsing Table Structures in the WildRujiao Long, Wen Wang, Nan Xue 0001, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia. 924-932 [doi]
- SketchLattice: Latticed Representation for Sketch ManipulationYonggang Qi, Guoyao Su, Pinaki Nath Chowdhury, Mingkang Li, Yi-Zhe Song. 933-941 [doi]
- Spatial and Semantic Consistency Regularizations for Pedestrian Attribute RecognitionJian Jia, Xiaotang Chen, Kaiqi Huang. 942-951 [doi]
- Detecting Persuasive Atypicality by Modeling Contextual CompatibilityMeiqi Guo, Rebecca Hwa, Adriana Kovashka. 952-962 [doi]
- Text is Text, No Matter What: Unifying Text Recognition using Knowledge DistillationAyan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Yi-Zhe Song. 963-972 [doi]
- DocFormer: End-to-End Transformer for Document UnderstandingSrikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha. 973-983 [doi]
- LayoutTransformer: Layout Generation and Completion with Self-attentionKamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis 0001, Vijay Mahadevan, Abhinav Shrivastava. 984-994 [doi]
- Effectively Leveraging Attributes for Visual SimilaritySamarth Mishra, Zhongping Zhang, Yuan Shen, Ranjitha Kumar, Venkatesh Saligrama, Bryan A. Plummer. 995-1004 [doi]
- Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identificationYongming Rao, Guangyi Chen 0002, Jiwen Lu, Jie Zhou 0001. 1005-1014 [doi]
- Learning Canonical 3D Object Representation for Fine-Grained RecognitionSunghun Joung, Seungryong Kim, Minsu Kim, Ig-Jae Kim, Kwanghoon Sohn. 1015-1025 [doi]
- SCOUTER: Slot Attention-based Classifier for Explainable Image RecognitionLiangZhi Li, Bowen Wang, Manisha Verma, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara. 1026-1035 [doi]
- Beyond Trivial Counterfactual Explanations with Diverse Valuable ExplanationsPau Rodríguez, Massimo Caccia, Alexandre Lacoste, Lee Zamparo, Issam H. Laradji, Laurent Charlin, David Vázquez 0001. 1036-1045 [doi]
- From Culture to Clothing: Discovering the World Events Behind A Century of Fashion ImagesWei-Lin Hsiao, Kristen Grauman. 1046-1055 [doi]
- De-rendering Stylized TextsWataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi. 1056-1065 [doi]
- Handwriting TransformersAnkan Kumar Bhunia, Salman H. Khan 0001, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Mubarak Shah. 1066-1074 [doi]
- Interpreting Attributions and Interactions of Adversarial AttacksXin Wang 0108, Shuyun Lin, Hao Zhang, Yufei Zhu, Quanshi Zhang. 1075-1084 [doi]
- The Right to Talk: An Audio-Visual Transformer ApproachThanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang-Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu. 1085-1094 [doi]
- Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?Yue Song, Nicu Sebe, Wei Wang 0108. 1095-1103 [doi]
- Striking a Balance between Stability and Plasticity for Class-Incremental LearningGuile Wu, Shaogang Gong, Pan Li Queen. 1104-1113 [doi]
- Predicting with Confidence on Unseen DistributionsDevin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell, Ludwig Schmidt. 1114-1124 [doi]
- Transforms based Tensor Robust PCA: Corrupted Low-Rank Tensors Recovery via Convex OptimizationCanyi Lu. 1125-1132 [doi]
- CODEs: Chamfer Out-of-Distribution Examples against Overconfidence IssueKeke Tang, Dingruibo Miao, Weilong Peng, Jianpeng Wu, Yawen Shi, Zhaoquan Gu, Zhihong Tian, Wenping Wang. 1133-1142 [doi]
- IDARTS: Interactive Differentiable Architecture SearchSong Xue, Runqi Wang, Baochang Zhang 0001, Tian Wang, Guodong Guo, David S. Doermann. 1143-1152 [doi]
- MeshTalk: 3D Face Animation from Speech using Cross-Modality DisentanglementAlexander Richard, Michael Zollhöfer, YanDong Wen, Fernando De la Torre, Yaser Sheikh. 1153-1162 [doi]
- Audio-Visual Floorplan ReconstructionSenthil Purushwalkam, Sebastia Vicenc Amengual Gari, Vamsi Krishna Ithapu, Carl Schissler, Philip Robinson, Abhinav Gupta 0001, Kristen Grauman. 1163-1172 [doi]
- How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the WildOkan Köpüklü, Maja Taseska, Gerhard Rigoll. 1173-1183 [doi]
- Visual Scene Graphs for Audio Source SeparationMoitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian. 1184-1193 [doi]
- Better Aggregation in Test-Time AugmentationDivya Shanmugam, Davis W. Blalock, Guha Balakrishnan, John V. Guttag. 1194-1203 [doi]
- Explaining Local, Global, And Higher-Order Interactions In Deep LearningSamuel Lerman, Charles Venuto, Henry A. Kautz, Chenliang Xu. 1204-1213 [doi]
- Explanations for Occluded ImagesHana Chockler, Daniel Kroening, Youcheng Sun. 1214-1223 [doi]
- e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language TasksMaxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz. 1224-1234 [doi]
- Broaden Your Views for Self-Supervised Video LearningAdrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraaucean, Florent Altché, Michal Valko, Jean-Bastien grill, Aäron Van Den Oord, Andrew Zisserman. 1235-1245 [doi]
- Hypergraph Neural Networks for Hypergraph MatchingXiaowei Liao, Yong Xu, Haibin Ling. 1246-1255 [doi]
- Embed Me If You Can: A Geometric PerceptronPavlo Melnyk, Michael Felsberg, Mårten Wadenbäck. 1256-1264 [doi]
- Learning to Discover Reflection Symmetry via Polar Matching ConvolutionAhyun Seo, Woohyeon Shim, Minsu Cho. 1265-1274 [doi]
- TGRNet: A Table Graph Reconstruction Network for Table Structure RecognitionWenyuan Xue, Baosheng Yu, Wen Wang, Dacheng Tao, Qingyong Li. 1275-1284 [doi]
- Adaptive Boundary Proposal Network for Arbitrary Shape Text DetectionShi-Xue Zhang, Xiaobin Zhu 0001, Chun Yang, Hongfa Wang, Xu-Cheng Yin. 1285-1294 [doi]
- Shape-Biased Domain Generalization via Shock Graph EmbeddingsMaruthi Narayanan, Vickram Rajendran, Benjamin B. Kimia. 1295-1305 [doi]
- Towards Learning Spatially Discriminative Feature RepresentationsChaofei Wang, Jiayu Xiao, Yizeng Han, Qisen Yang, Shiji Song, Gao Huang. 1306-1315 [doi]
- Towards Better Explanations of Class Activation MappingHyungsik Jung, Youngrock Oh. 1316-1324 [doi]
- Finding Representative Interpretations on Convolutional Neural NetworksPeter Cho-Ho Lam, Lingyang Chu, Maxim Torgonskiy, Jian Pei, Yong Zhang, Lanjun Wang. 1325-1334 [doi]
- LFI-CAM: Learning Feature Importance for Better Visual ExplanationKwang Hee Lee, Chaewon Park, Junghyun Oh, Nojun Kwak. 1335-1343 [doi]
- Panoptic Narrative GroundingCristina González, Nicolás Ayobi, Isabela Hernández, José Hernández, Jordi Pont-Tuset, Pablo Arbeláez. 1344-1353 [doi]
- Who's Waldo? Linking People Across Text and ImagesClaire Yuqing Cui, Apoorv Khandelwal 0001, Yoav Artzi, Noah Snavely, Hadar Averbuch-Elor. 1354-1364 [doi]
- YouRefIt: Embodied Reference Understanding with Language and GestureYixin Chen, Qing Li 0003, Deqian Kong, Yik Lun Kei, Song Chun Zhu, Tao Gao, Yixin Zhu, Siyuan Huang. 1365-1375 [doi]
- Synthesis of Compositional Animations from Textual DescriptionsAnindita Ghosh, Noshaba Cheema, Cennet Oguz, Christian Theobalt, Philipp Slusallek. 1376-1386 [doi]
- In Defense of Scene Graphs for Image CaptioningKien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen. 1387-1396 [doi]
- Unshuffling Data for Improved Generalization in Visual Question AnsweringDamien Teney, Ehsan Abbasnejad, Anton van den Hengel. 1397-1407 [doi]
- Compressing Visual-linguistic Model via Knowledge DistillationZhiyuan Fang, Jianfeng Wang, Xiaowei Hu 0006, Lijuan Wang, Yezhou Yang, Zicheng Liu 0001. 1408-1418 [doi]
- UniT: Multimodal Multitask Learning with a Unified TransformerRonghang Hu, Amanpreet Singh. 1419-1429 [doi]
- CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video RepresentationsMohammadreza Zolfaghari, Yi Zhu, Peter V. Gehler, Thomas Brox. 1430-1439 [doi]
- Graph Constrained Data Representation Learning for Human Motion SegmentationMariella Dimiccoli, Lluís Garrido, Guillem Rodríguez Corominas, Herwig Wendt. 1440-1449 [doi]
- Zero-shot Natural Language Video LocalizationJinwoo Nam, Daechul Ahn, Dongyeop Kang, Seong Jong Ha, Jonghyun Choi. 1450-1459 [doi]
- Learning Temporal Dynamics from Cycles in Narrated VideoDave Epstein, Jiajun Wu 0001, Cordelia Schmid, Chen Sun 0002. 1460-1469 [doi]
- Dense Interaction Learning for Video-based Person Re-identificationTianyu He, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen 0001, Xian-Sheng Hua 0001. 1470-1481 [doi]
- 2CLR: Video and Image for Visual Contrastive Learning of RepresentationAli Diba, Vivek Sharma 0001, Reza Safdari, Dariush Lotfi, M. Saquib Sarfraz, Rainer Stiefelhagen, Luc Van Gool. 1482-1492 [doi]
- MGSampler: An Explainable Sampling Strategy for Video Action RecognitionYuan-zhi, Zhan Tong, Limin Wang 0002, Gangshan Wu. 1493-1502 [doi]
- Fast Video Moment RetrievalJunyu Gao, Changsheng Xu. 1503-1512 [doi]
- STVGBert: A Visual-linguistic Transformer based Framework for Spatio-temporal Video GroundingRui Su, Qian Yu, Dong Xu 0001. 1513-1522 [doi]
- Motion Guided Region Message Passing for Video CaptioningShaoxiang Chen 0001, Yu-Gang Jiang. 1523-1532 [doi]
- Dynamic Context-Sensitive Filtering Network for Video Salient Object DetectionMiao Zhang, Jie Liu, Yifei Wang, Yongri Piao, Shunyu Yao, Wei Ji, Jingjing Li, Huchuan Lu, Zhongxuan Luo. 1533-1543 [doi]
- Learning Motion-Appearance Co-Attention for Zero-Shot Video Object SegmentationShu Yang, Lu Zhang 0053, Jinqing Qi, Huchuan Lu, Shuo Wang, Xiaoxing Zhang. 1544-1553 [doi]
- Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question AnsweringCorentin Dancette, Rémi Cadène, Damien Teney, Matthieu Cord. 1554-1563 [doi]
- Greedy Gradient Ensemble for Robust Visual Question AnsweringXinzhe Han, Shuhui Wang, Chi Su, Qingming Huang, Qi Tian 0001. 1564-1573 [doi]
- Self-Motivated Communication Agent for Real-World Vision-Dialog NavigationYi Zhu 0004, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao. 1574-1583 [doi]
- Contrast and Classify: Training Robust VQA ModelsYash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal. 1584-1593 [doi]
- Linguistically Routing Capsule Network for Out-of-distribution Visual Question AnsweringQingxing Cao, Wentao Wan, Keze Wang, Xiaodan Liang, Liang Lin. 1594-1603 [doi]
- LapsCore: Language-guided Person Search via Color ReasoningYushuang Wu, Zizheng Yan, Xiaoguang Han, Guanbin Li, Changqing Zou, Shuguang Cui. 1604-1613 [doi]
- Airbert: In-domain Pretraining for Vision-and-Language NavigationPierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid. 1614-1623 [doi]
- Vision-Language Navigation with Random Environmental MixupChong Liu 0002, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, ZongYuan Ge, Yi-Dong Shen. 1624-1634 [doi]
- The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language NavigationYuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang 0001, Anton van den Hengel, Qi Wu 0001. 1635-1644 [doi]
- VLGrammar: Grounded Grammar Induction of Vision and LanguageYining Hong, Qing Li 0003, Song Chun Zhu, Siyuan Huang. 1645-1654 [doi]
- Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic EnvironmentsDifei Gao, Ruiping Wang 0001, Ziyi Bai, Xilin Chen 0001. 1655-1665 [doi]
- Just Ask: Learning to Answer Questions from Millions of Narrated VideosAntoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid. 1666-1677 [doi]
- HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question AnsweringFei Liu, Jing Liu 0001, Weining Wang, Hanqing Lu. 1678-1687 [doi]
- Video Question Answering Using Language-Guided Deep Compressed-Domain Video FeatureNayoung Kim, Seong Jong Ha, Je-Won Kang. 1688-1697 [doi]
- Multiple Pairwise Ranking Networks for Personalized Video SummarizationYassir Saquil, Da Chen, Yuan He, Chuan Li, Yong-Liang Yang. 1698-1707 [doi]
- Frozen in Time: A Joint Video and Image Encoder for End-to-End RetrievalMax Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman. 1708-1718 [doi]
- Video Instance Segmentation with a Propose-Reduce ParadigmHuaijia Lin, Ruizheng Wu, Shu Liu 0005, Jiangbo Lu, Jiaya Jia. 1719-1728 [doi]
- Deep 3D Mask Volume for View Synthesis of Dynamic ScenesKai-En Lin, Lei Xiao, Feng Liu 0015, Guowei Yang, Ravi Ramamoorthi. 1729-1738 [doi]
- Unsupervised Deep Video DenoisingDev Yashpal Sheth, Sreyas Mohan, Joshua L. Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli, Carlos Fernandez-Granda. 1739-1748 [doi]
- TransVG: End-to-End Visual Grounding with TransformersJiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, Houqiang Li. 1749-1759 [doi]
- MDETR - Modulated Detection for End-to-End Multi-Modal UnderstandingAishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion. 1760-1770 [doi]
- InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual ReferringZhihao Yuan, Xu Yan 0014, Yinghong Liao, Ruimao Zhang, Sheng Wang 0001, Zhen Li 0026, Shuguang Cui. 1771-1780 [doi]
- Detector-Free Weakly Supervised Grounding by SeparationAssaf Arbelle, Sivan Doveh, Amit Alfassy, Joseph Shtok, Guy Lev, Eli Schwartz, Hilde Kuehne, Hila Barak Levi, Prasanna Sattigeri, Rameswar Panda, Chun-Fu Chen 0001, Alex M. Bronstein, Kate Saenko, Shimon Ullman, Raja Giryes, Rogério Feris, Leonid Karlinsky. 1781-1792 [doi]
- Wasserstein Coupled Graph Learning for Cross-Modal RetrievalYun Wang, Tong Zhang 0021, Xueya Zhang, Zhen Cui 0001, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang 0003. 1793-1802 [doi]
- Learning to Generate Scene Graph from Natural Language SupervisionYiwu Zhong, Jing Shi 0005, Jianwei Yang, Chenliang Xu, Yin Li 0003. 1803-1814 [doi]
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial QueryGuanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun. 1815-1824 [doi]
- Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal RegionsShuang Li, Yilun Du, Antonio Torralba 0001, Josef Sivic, Bryan C. Russell. 1825-1835 [doi]
- SAT: 2D Semantics Assisted Training for 3D Visual GroundingZhengyuan Yang, Songyang Zhang, Liwei Wang 0009, Jiebo Luo. 1836-1846 [doi]
- Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language InferenceJuncheng Li 0006, Siliang Tang, Linchao Zhu, Haochen Shi, Xuanwen Huang, Fei Wu 0001, Yi Yang 0001, Yueting Zhuang. 1847-1857 [doi]
- Interpretable Visual Reasoning via Induced Symbolic SpaceZhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong, Wen-mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi. 1858-1867 [doi]
- Factorizing Perception and Policy for Interactive Instruction FollowingKunal Pratap Singh, Suvaansh Bhambri, Byeonghwi Kim, Roozbeh Mottaghi, Jonghyun Choi. 1868-1877 [doi]
- Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual DialogueShoya Matsumori, Kosuke Shingyouchi, Yuki Abe 0002, Yosuke Fukuchi, Komei Sugiura, Michita Imai. 1878-1887 [doi]
- Weakly Supervised Relative Spatial Reasoning for Visual Question AnsweringPratyay Banerjee, Tejas Gokhale, Yezhou Yang, Chitta Baral. 1888-1898 [doi]
- Mixed SIGNals: Sign Language Production via a Mixture of Motion PrimitivesBen Saunders, Necati Cihan Camgöz, Richard Bowden. 1899-1909 [doi]
- Localize to Binauralize: Audio Spatialization from Visual Sound Source LocalizationKranthi Kumar Rachavarapu, Aakanksha, Vignesh Sundaresha, A. N. Rajagopalan 0001. 1910-1919 [doi]
- Spatial-Temporal Consistency Network for Low-Latency Trajectory ForecastingShijie Li, Yanying Zhou, Jinhui Yi, Juergen Gall. 1920-1929 [doi]
- T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence LearningZhen Zhong, Guobao Xiao, Linxin Zheng, Yan Lu, Jiayi Ma 0001. 1930-1939 [doi]
- IntraTomo: Self-supervised Learning-based Tomography via Sinogram Synthesis and PredictionGuangming Zang, Ramzi Idoughi, Rui Li 0054, Peter Wonka, Wolfgang Heidrich. 1940-1950 [doi]
- Describing and Localizing Multiple Changes with TransformersYue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki 0006, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh. 1951-1960 [doi]
- Cross-Camera Convolutional Color ConstancyMahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, Francois Bleibel. 1961-1970 [doi]
- IICNet: A Generic Framework for Reversible Image ConversionKa Leong Cheng, Yueqi Xie, Qifeng Chen. 1971-1980 [doi]
- Dual-Camera Super-Resolution with Aligned Attention ModulesTengfei Wang 0002, Jiaxin Xie, Wenxiu Sun, Qiong Yan, Qifeng Chen. 1981-1990 [doi]
- Let's See Clearly: Contaminant Artifact Removal for Moving CamerasXiaoyu Li, Bo Zhang 0025, Jing Liao 0001, Pedro V. Sander. 1991-2000 [doi]
- Explainable Video Entailment with Grounded Visual EvidenceJunwen Chen, Yu Kong Golisano. 2001-2010 [doi]
- Pano-AVQA: Grounded Audio-Visual Question Answering on 360° VideosHeeseung Yun, Youngjae Yu, Wonsuk Yang, Kangil Lee, Gunhee Kim. 2011-2021 [doi]
- Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA ModelsLinjie Li, Jie Lei, Zhe Gan, Jingjing Liu 0001. 2022-2031 [doi]
- AESOP: Abstract Encoding of Stories, Objects, and PicturesHareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia. 2032-2043 [doi]
- On the hidden treasure of dialog in video question answeringDeniz Engin, François Schnitzler, Ngoc Q. K. Duong, Yannis Avrithis. 2044-2053 [doi]
- TRAR: Routing the Attention Spans in Transformer for Visual Question AnsweringYiyi Zhou, Tianhe Ren, Chaoyang Zhu, Xiaoshuai Sun, Jianzhuang Liu, Xinghao Ding, Mingliang Xu, Rongrong Ji. 2054-2064 [doi]
- StyleCLIP: Text-Driven Manipulation of StyleGAN ImageryOr Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, Dani Lischinski. 2065-2074 [doi]
- Viewpoint-Agnostic Change Captioning with Cycle ConsistencyHoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim. 2075-2084 [doi]
- *Rui Li, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Dong Liu 0002, Tao Mei 0001. 2085-2094 [doi]
- Language-Guided Global Image Editing via Cross-Modal Cyclic MechanismWentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin 0002, Si Liu 0001. 2095-2104 [doi]
- Image Retrieval on Real-life Images with Pre-trained Vision-and-Language ModelsZheyuan Liu 0002, Cristian Rodriguez Opazo, Damien Teney, Stephen Gould. 2105-2114 [doi]
- Dual Transfer Learning for Event-based End-task Prediction via Pluggable Event to Image TranslationLin Wang 0025, Yujeong Chae, Kuk-Jin Yoon. 2115-2125 [doi]
- N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event CamerasJunho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim 0001. 2126-2136 [doi]
- Patch Craft: Video Denoising by Deep Modeling and Patch MatchingGregory Vaksman, Michael Elad, Peyman Milanfar. 2137-2146 [doi]
- LocTex: Learning Data-Efficient Visual Representations from Localized Textual SupervisionZhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han 0003. 2147-2156 [doi]
- Hierarchical Graph Attention Network for Few-shot Visual-Semantic LearningChengxiang Yin 0001, Kun Wu, Zhengping Che, Bo Jiang, Zhiyuan Xu, Jian Tang 0008. 2157-2166 [doi]
- Partial Off-policy Learning: Balance Accuracy and Diversity for Human-Oriented Image CaptioningJiahe Shi, Yali Li, Shengjin Wang. 2167-2176 [doi]
- Auto-Parsing Network for Image Captioning and Visual Question AnsweringXu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai 0001. 2177-2187 [doi]
- COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language RepresentationKeyu Wen, Jin Xia, Yuanyuan Huang, Linyang Li, Jiayan Xu, Jie Shao. 2188-2197 [doi]
- Adversarial Attack on Deep Cross-Modal Hamming RetrievalChao Li, Shangqian Gao, Cheng Deng, Wei Liu 0005, Heng Huang. 2198-2207 [doi]
- Defocus Map Estimation and Deblurring from a Single Dual-Pixel ImageShumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, Rahul Garg. 2208-2218 [doi]
- How to Train Neural Networks for Flare RemovalYicheng Wu, Qiurui He 0001, Tianfan Xue, Rahul Garg, Jiawen Chen, Ashok Veeraraghavan, Jonathan T. Barron. 2219-2227 [doi]
- Hyperspectral Image Denoising with Realistic DataTao Zhang, Ying Fu 0001, Cheng Li. 2228-2237 [doi]
- Dynamic CT Reconstruction from Limited Views with Implicit Neural Representations and Parametric Motion FieldsAlbert W. Reed, HyoJin Kim, Rushil Anirudh, K. Aditya Mohan, Kyle Champley, Jingu Kang, Suren Jayasuriya. 2238-2248 [doi]
- High Quality Disparity Remapping with Two-Stage WarpingBing Li 0024, Chia-Wen Lin, Cheng Zheng 0003, Shan Liu 0001, Junsong Yuan, Bernard Ghanem, C. C. Jay Kuo. 2249-2258 [doi]
- Semantic-embedded Unsupervised Spectral Reconstruction from Single RGB Images in the WildZhiyu Zhu, Hui Liu 0032, Junhui Hou, Huanqiang Zeng, Qingfu Zhang 0001. 2259-2268 [doi]
- Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel DataAbdullah Abuolaim, Mauricio Delbracio, Damien Kelly, Michael S. Brown, Peyman Milanfar. 2269-2278 [doi]
- Hybrid Neural Fusion for Full-frame Video StabilizationYu-Lun Liu, Wei-Sheng Lai, Ming-Hsuan Yang 0001, Yung-Yu Chuang, Jia-Bin Huang. 2279-2288 [doi]
- Spatially-Adaptive Image Restoration using Distortion-Guided NetworksKuldeep Purohit, Maitreya Suin, A. N. Rajagopalan 0001, Vishnu Naresh Boddeti. 2289-2299 [doi]
- Anonymizing Egocentric VideosDaksh Thapar, Aditya Nigam, Chetan Arora 0001. 2300-2309 [doi]
- What You Can Learn by Staring at a Blank WallPrafull Sharma, Miika Aittala, Yoav Y. Schechner, Antonio Torralba 0001, Gregory W. Wornell, William T. Freeman, Frédo Durand. 2310-2319 [doi]
- Inference of Black Hole Fluid-Dynamics from Sparse Interferometric MeasurementsAviad Levis, Daeyoung Lee, Joel A. Tropp, Charles F. Gammie, Katherine L. Bouman. 2320-2329 [doi]
- C2N: Practical Generative Noise Modeling for Real-World DenoisingGeonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee. 2330-2339 [doi]
- Fourier Space Losses for Efficient Perceptual Image Super-ResolutionDario Fuoli, Luc Van Gool, Radu Timofte. 2340-2349 [doi]
- Lucas-Kanade Reloaded: End-to-End Super-Resolution from Raw Image BurstsBruno Lecouat, Jean Ponce, Julien Mairal. 2350-2359 [doi]
- Variable-Rate Deep Image Compression through Spatially-Adaptive Feature TransformMyungseo Song, Jinyoung Choi, Bohyung Han. 2360-2369 [doi]
- V-DESIRR: Very Fast Deep Embedded Single Image Reflection RemovalB. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra, Sanjoy Chowdhury. 2370-2379 [doi]
- NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic CamerasLin Zhu 0012, Jianing Li, Xiao Wang, Tiejun Huang 0001, Yonghong Tian 0001. 2380-2389 [doi]
- Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed IlluminationDongyoung Kim, Jinwoo Kim, Seonghyeon Nam, Dongwoo Lee, Yeonkyung Lee, Nahyup Kang, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han, Seon Joo Kim. 2390-2399 [doi]
- A Light Stage on Every DeskSoumyadip Sengupta, Brian Curless, Ira Kemelmacher-Shlizerman, Steven M. Seitz. 2400-2409 [doi]
- A Dark Flash Normal CameraZhihao Xia, Jason Lawrence, Supreeth Achar. 2410-2419 [doi]
- Virtual light transport matrices for non-line-of-sight imagingJulio Marco, Adrián Jarabo, Ji Hyun Nam, Xiaochun Liu, Miguel Ángel Cosculluela, Andreas Velten, Diego Gutierrez. 2420-2429 [doi]
- Learning Dynamic Interpolation for Extremely Sparse Light Fields with Wide BaselinesMantang Guo, Jing Jin 0006, Hui Liu 0032, Junhui Hou. 2430-2439 [doi]
- Deep Reparametrization of Multi-Frame Super-Resolution and DenoisingGoutam Bhat, Martin Danelljan, Fisher Yu, Luc Van Gool, Radu Timofte. 2440-2450 [doi]
- Real-time Image Enhancer via Learnable Spatial-aware 3D Lookup TablesTao Wang, Yong Li 0008, Jingyang Peng, Yipeng Ma, Xian Wang, Fenglong Song, Youliang Yan. 2451-2460 [doi]
- Distillation-guided Image InpaintingMaitreya Suin, Kuldeep Purohit, A. N. Rajagopalan 0001. 2461-2470 [doi]
- SeLFVi: Self-supervised Light-Field Video Reconstruction from Stereo VideoPrasan Shedligeri, Florian Schiffers, Sushobhan Ghosh, Oliver Cossairt, Kaushik Mitra. 2471-2481 [doi]
- HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark DatasetGuanying Chen, Chaofeng Chen, Shi Guo, Zhetong Liang, Kwan-Yee K. Wong, Lei Zhang 0006. 2482-2491 [doi]
- Photon-Starved Scene Inference using Single Photon CamerasBhavya Goyal, Mohit Gupta. 2492-2501 [doi]
- Unsupervised Non-Rigid Image Distortion Removal via Grid DeformationNianyi Li, Simron Thapa, Cameron Whyte, Albert Reed, Suren Jayasuriya, Jinwei Ye. 2502-2512 [doi]
- Super Resolve Dynamic Scene from Continuous Spike StreamsJing Zhao 0011, Jiyu Xie, Ruiqin Xiong, Jian Zhang, Zhaofei Yu, Tiejun Huang 0001. 2513-2522 [doi]
- COMISR: Compression-Informed Video Super-ResolutionYinxiao Li, Pengchong Jin, Feng Yang, Ce Liu, Ming-Hsuan Yang 0001, Peyman Milanfar. 2523-2532 [doi]
- Multitask AET with Orthogonal Tangent Regularity for Dark Object DetectionZiteng Cui, Guo-Jun Qi, Lin Gu, Shaodi You, Zenghui Zhang, Tatsuya Harada. 2533-2542 [doi]
- Event-based Video Reconstruction Using TransformerWenming Weng, Yueyi Zhang, Zhiwei Xiong. 2543-2552 [doi]
- Learning Privacy-preserving Optics for Human Pose EstimationCarlos Hinojosa, Juan Carlos Niebles, Henry Arguello. 2553-2562 [doi]
- Motion Deblurring with Real EventsFang Xu, Lei Yu 0006, Bishan Wang, Wen Yang 0001, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu. 2563-2572 [doi]
- Objects as Cameras: Estimating High-Frequency Illumination from ShadowsTristan Swedish, Connor Henley, Ramesh Raskar. 2573-2582 [doi]
- A Simple Framework for 3D Lensless Imaging with Programmable MasksYucheng Zheng, Yi-Hua, Aswin C. Sankaranarayanan, M. Salman Asif. 2583-2592 [doi]
- Universal and Flexible Optical Aberration Correction Using Deep-Prior Based DeconvolutionXiu Li, Jinli Suo, Weihang Zhang, Xin Yuan 0002, Qionghai Dai. 2593-2601 [doi]
- Self-supervised Neural Networks for Spectral Snapshot Compressive ImagingZiyi Meng, Zhenming Yu, Kun Xu, Xin Yuan 0002. 2602-2611 [doi]
- Extreme-Quality Computational Imaging via Degradation FrameworkShiqi Chen, Huajun Feng, Keming Gao, Zhihai Xu, Yueting Chen. 2612-2621 [doi]
- Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous ConvolutionsHyeongseok Son, Junyong Lee, Sunghyun Cho, Seungyong Lee 0001. 2622-2630 [doi]
- Single-shot Hyperspectral-Depth Imaging with Learned Diffractive OpticsSeung-Hwan Baek, Hayato Ikoma, Daniel S. Jeon, Yuqi Li, Wolfgang Heidrich, Gordon Wetzstein, Min H. Kim 0001. 2631-2640 [doi]
- Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural NetworksWei Fang, Zhaofei Yu, Yanqi Chen, Timothée Masquelier, Tiejun Huang 0001, Yonghong Tian 0001. 2641-2651 [doi]
- Multispectral illumination estimation using deep unrolling networkYuqi Li, Qiang Fu 0002, Wolfgang Heidrich. 2652-2661 [doi]
- A Hybrid Frequency-Spatial Domain Model for Sparse Image Reconstruction in Scanning Transmission Electron MicroscopyBintao He, Fa Zhang, Huanshui Zhang, Renmin Han. 2662-2671 [doi]
- Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging SystemsEdwin Vargas, Julien N. P. Martel, Gordon Wetzstein, Henry Arguello. 2672-2682 [doi]
- Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object DetectionChaoqi Chen, JiongCheng Li, Zebiao Zheng, Yue Huang 0001, Xinghao Ding, Yizhou Yu. 2683-2692 [doi]
- The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object DetectionZhikang Zou, Xiaoqing Ye, Liang Du, Xianhui Cheng, Xiao Tan 0001, Li Zhang 0040, Jianfeng Feng, Xiangyang Xue, Errui Ding. 2693-2702 [doi]
- Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object DetectionJiageng Mao, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu. 2703-2712 [doi]
- Learning Multi-Scene Absolute Pose Regression with TransformersYoli Shavit, Ron Ferens, Yosi Keller. 2713-2722 [doi]
- Improving 3D Object Detection with Channel-wise TransformerHualian Sheng, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua 0001, Min-Jian Zhao 0001. 2723-2732 [doi]
- HPNet: Deep Primitive Segmentation Using Hybrid RepresentationsSiming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, Qixing Huang. 2733-2742 [doi]
- GraphFPN: Graph Feature Pyramid Network for Object DetectionGangming Zhao, Weifeng Ge, Yizhou Yu. 2743-2752 [doi]
- SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose EstimationKai Chen, Qi Dou. 2753-2762 [doi]
- Instance Segmentation in 3D Scenes using Semantic Superpoint Tree NetworksZhihao Liang, Zhihao Li, Songcen Xu, Mingkui Tan, Kui Jia. 2763-2772 [doi]
- PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6D Pose EstimationGuangyuan Zhou, Huiqun Wang, Jiaxin Chen, Di Huang 0001. 2773-2782 [doi]
- Contrastive Attention Maps for Self-supervised Co-localizationMinsong Ki, Youngjung Uh, Junsuk Choe, Hyeran Byun. 2783-2792 [doi]
- Sparse-shot Learning with Exclusive Cross-Entropy for Extremely Many LocalisationsAndreas Panteli, Jonas Teuwen, Hugo M. Horlings, Efstratios Gavves. 2793-2803 [doi]
- Prior to Segment: Foreground Cues for Weakly Annotated Classes in Partially Supervised Instance SegmentationDavid Biertimpel, Sindi Shkodrani, Anil S. Baslamisli, Nóra Baka. 2804-2813 [doi]
- Weakly Supervised 3D Semantic Segmentation Using Cross-Image Consensus and Inter-Voxel Affinity RelationsXiaoyu Zhu, Jeffrey Chen, Xiangrui Zeng, Junwei Liang 0001, Chengqi Li, Sinuo Liu, Sima Behpour, Min Xu 0009. 2814-2824 [doi]
- Self-Supervised Image Prior Learning with GMM from a Single Noisy ImageHaosen Liu, Xuan Liu, Jiangbo Lu, Shan Tan. 2825-2834 [doi]
- Human Detection and Segmentation via Multi-view ConsensusIsinsu Katircioglu, Helge Rhodin, Jörg Spörri, Mathieu Salzmann, Pascal Fua. 2835-2844 [doi]
- PreDet: Large-scale weakly supervised pre-training for detectionVignesh Ramanathan, Rui Wang 0067, Dhruv Mahajan 0001. 2845-2855 [doi]
- Boosting Weakly Supervised Object Detection via Learning Bounding Box AdjustersBowen Dong, Zitong Huang, Yuelin Guo, Qilong Wang, Zhenxing Niu, Wangmeng Zuo. 2856-2865 [doi]
- TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object LocalizationWei Gao, Fang Wan, Xingjia Pan, Zhiliang Peng, Qi Tian 0001, Zhenjun Han, Bolei Zhou, Qixiang Ye. 2866-2875 [doi]
- Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual LabelsJiannan Guo 0003, Haochen Shi, Yangyang Kang, Kun Kuang, Siliang Tang, Zhuoren Jiang, Changlong Sun, Fei Wu 0001, Yueting Zhuang. 2876-2885 [doi]
- An End-to-End Transformer Model for 3D Object DetectionIshan Misra, Rohit Girdhar, Armand Joulin. 2886-2897 [doi]
- RangeDet: In Defense of Range View for LiDAR-based 3D Object DetectionLue Fan, Xuan Xiong, Feng Wang, Naiyan Wang, Zhaoxiang Zhang. 2898-2907 [doi]
- 3DVG-Transformer: Relation Modeling for Visual Grounding on Point CloudsLichen Zhao, Daigang Cai, Lu Sheng, Dong Xu 0001. 2908-2917 [doi]
- Gated3D: Monocular 3D Object Detection From Temporal Illumination CuesFrank D. Julca-Aguilar, Jason Taylor, Mario Bijelic, Fahim Mannan, Ethan Tseng, Felix Heide. 2918-2928 [doi]
- Group-Free 3D Object Detection via TransformersZe Liu, Zheng Zhang 0022, Yue Cao 0001, Han Hu 0004, Xin Tong 0001. 2929-2938 [doi]
- Body-Face Joint Detection via Embedding and Head HookJunfeng Wan, Jiangfan Deng, Xiaosong Qiu, Feng Zhou 0002. 2939-2948 [doi]
- ELSD: Efficient Line Segment Detector and DescriptorHaotian Zhang, Yicheng Luo, Fangbo Qin, Yijia He, Xiao Liu. 2949-2958 [doi]
- WB-DETR: Transformer-Based Detector without BackboneFanfan Liu, Haoran Wei, Wenzhe Zhao, Guozhen Li, Jingquan Peng, Zihao Li. 2959-2967 [doi]
- Dynamic DETR: End-to-End Object Detection with Dynamic AttentionXiyang Dai, Yinpeng Chen, Jianwei Yang, Pengchuan Zhang, Lu Yuan, Lei Zhang 0001. 2968-2977 [doi]
- Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingPengchuan Zhang, Xiyang Dai, Jianwei Yang, Bin Xiao, Lu Yuan, Lei Zhang 0001, Jianfeng Gao. 2978-2988 [doi]
- Rank & Sort Loss for Object Detection and Instance SegmentationKemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan. 2989-2998 [doi]
- Switchable K-class Hyperplanes for Noise-Robust Representation LearningBoxiao Liu, Guanglu Song, Manyuan Zhang, Haihang You, Yu Liu. 2999-3008 [doi]
- DecentLaM: Decentralized Momentum SGD for Large-batch Deep TrainingKun Yuan, Yiming Chen 0003, Xinmeng Huang, Yingya Zhang, Pan Pan, Yinghui Xu, Wotao Yin. 3009-3019 [doi]
- Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image ClassificationZhuoning Yuan, Yan Yan 0006, Milan Sonka, Tianbao Yang. 3020-3029 [doi]
- Robust Small-scale Pedestrian Detection with Cued Recall via Memory LearningJung-Uk Kim, Sungjune Park, Yong Man Ro. 3030-3039 [doi]
- End-to-End Semi-Supervised Object Detection with Soft TeacherMengde Xu, Zheng Zhang 0022, Han Hu 0004, Jianfeng Wang, Lijuan Wang, Fangyun Wei, Xiang Bai, Zicheng Liu 0001. 3040-3049 [doi]
- CaT: Weakly Supervised Object Detection with Category TransferTianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya Zhang 0002, Yan-Feng Wang. 3050-3059 [doi]
- ADNet: Leveraging Error-Bias Towards Normal Direction in Face AlignmentYangyu Huang, Hao Yang, Chong Li, Jongyoo Kim, Fangyun Wei. 3060-3070 [doi]
- Causal Attention for Unbiased Visual RecognitionTan Wang, Chang Zhou, Qianru Sun, Hanwang Zhang. 3071-3080 [doi]
- MLVSNet: Multi-level Voting Siamese Network for 3D Visual TrackingZhoutao Wang, Qian Xie, Yu-Kun Lai, Jing Wu, Kun Long, Jun Wang 0039. 3081-3090 [doi]
- Geometry Uncertainty Projection Network for Monocular 3D Object DetectionYan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu 0001, Junjie Yan, Wanli Ouyang. 3091-3101 [doi]
- Multi-Instance Pose Networks: Rethinking Top-Down Pose EstimationRawal Khirodkar, Visesh Chari, Amit Agrawal 0002, Ambrish Tyagi. 3102-3111 [doi]
- OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud RegistrationHao Xu, Shuaicheng Liu, Guangfu Wang, Guanghui Liu, Bing Zeng. 3112-3121 [doi]
- Is Pseudo-Lidar needed for Monocular 3D Object detection?Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li 0031, Adrien Gaidon. 3122-3132 [doi]
- LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D DetectorXiaoyang Guo, Shaoshuai Shi, Xiaogang Wang 0001, Hongsheng Li 0001. 3133-3143 [doi]
- Voxel Transformer for 3D Object DetectionJiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu. 3144-3153 [doi]
- Detecting Invisible PeopleTarasha Khurana, Achal Dave, Deva Ramanan. 3154-3164 [doi]
- CrossDet: Crossline Representation for Object DetectionHeqian Qiu, Hongliang Li 0001, Qingbo Wu 0001, Jianhua Cui, Zichen Song, Lanxiao Wang, Minjian Zhang. 3175-3184 [doi]
- Towards A Universal Model for Cross-Dataset Crowd CountingZhiheng Ma, Xiaopeng Hong, Xing Wei, Yunfeng Qiu, Yihong Gong. 3185-3194 [doi]
- Exploiting sample correlation for crowd counting with multi-expert networkXinyan Liu, Guorong Li, Zhenjun Han, Weigang Zhang, Yifan Yang, Qingming Huang, Nicu Sebe. 3195-3204 [doi]
- Are we Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection?Andrea Simonelli, Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder, Elisa Ricci 0001. 3205-3213 [doi]
- Uniformity in Heterogeneity: Diving Deep into Count Interval Partition for Crowd CountingChangan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma 0001, Yang Wu. 3214-3222 [doi]
- Efficient Large Scale Inlier Voting for Geometric Vision ProblemsDror Aiger, Simon Lynen, Jan Hosang, Bernhard Zeisl. 3223-3231 [doi]
- Continual Learning for Image-Based Camera LocalizationShuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala. 3232-3242 [doi]
- Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional NetworksGuangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma, Shih-Fu Chang. 3243-3252 [doi]
- Multi-Source Domain Adaptation for Object DetectionXingxu Yao, Sicheng Zhao, Pengfei Xu 0013, Jufeng Yang. 3253-3262 [doi]
- RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object DetectionYongming Rao, Benlin Liu, Yi Wei, Jiwen Lu, Cho-Jui Hsieh, Jie Zhou 0001. 3263-3272 [doi]
- You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and TrackingJiaming Sun, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang 0001, Hujun Bao, Xiaowei Zhou. 3265-3174 [doi]
- Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object DetectionHanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang 0196, Zhenguo Li, Luc Van Gool. 3273-3282 [doi]
- RePOSE: Fast 6D Object Pose Refinement via Deep Texture RenderingShun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani. 3283-3292 [doi]
- PICCOLO: Point Cloud-Centric Omnidirectional LocalizationJunho Kim, Changwoon Choi, Hojun Jang, Young Min Kim 0001. 3293-3303 [doi]
- GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape CompletionCheng Chi, Shuran Song. 3304-3313 [doi]
- Personalized and Invertible Face De-identification by Disentangled Identity Information ManipulationJingyi Cao, Bo Liu 0001, Yunqian Wen, Rong Xie, Li Song 0001. 3314-3322 [doi]
- Long-Term Temporally Consistent Unpaired Video Translation from Simulated Surgical 3D DataDominik Rivoir, Micha Pfeiffer, Reuben Docea, Fiona Kolbinger, Carina Riediger, Jürgen Weitz, Stefanie Speidel. 3323-3333 [doi]
- Multi-scale Matching Networks for Semantic CorrespondenceDongyang Zhao, Ziyang Song, Zhenghao Ji, Gangming Zhao, Weifeng Ge, Yizhou Yu. 3334-3344 [doi]
- Rethinking Counting and Localization in Crowds: A Purely Point-Based FrameworkQingyu Song, Changan Wang, Zhengkai Jiang, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yang Wu. 3345-3354 [doi]
- Learning to Better Segment Objects from Unseen Classes with Unlabeled VideosYuming Du, Yang Xiao 0007, Vincent Lepetit. 3355-3364 [doi]
- Foreground Activation Maps for Weakly Supervised Object LocalizationMeng Meng, Tianzhu Zhang, Qi Tian 0001, Yongdong Zhang 0001, Feng Wu 0001. 3365-3375 [doi]
- ICON: Learning Regular Maps Through Inverse ConsistencyHastings Greer, Roland Kwitt, François-Xavier Vialard, Marc Niethammer. 3376-3385 [doi]
- DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box SupervisionShiyi Lan, Zhiding Yu, Christopher B. Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar. 3386-3396 [doi]
- Exploring Classification Equilibrium in Long-Tailed Object DetectionChengjian Feng, Yujie Zhong, Weilin Huang. 3397-3406 [doi]
- Normalization Matters in Weakly Supervised Object LocalizationJeesoo Kim, Junsuk Choe, Sangdoo Yun, Nojun Kwak. 3407-3416 [doi]
- Training Multi-Object Detector by Estimating Bounding Box Distribution for Input ImageJaeyoung Yoo, Hojun Lee, Inseop Chung, Geonseok Seo, Nojun Kwak. 3417-3426 [doi]
- Semi-Supervised Active Learning with Temporal Output DiscrepancySiyu Huang, Tianyang Wang, Haoyi Xiong, Jun Huan, Dejing Dou. 3427-3436 [doi]
- FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance SegmentationYuhang Zang, Chen Huang, Chen Change Loy. 3437-3446 [doi]
- Learning Hierarchical Graph Neural Networks for Image ClusteringYifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto. 3447-3457 [doi]
- Big Self-Supervised Models Advance Medical Image ClassificationShekoofeh Azizi, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi 0002. 3458-3468 [doi]
- Collaborative and Adversarial Learning of Focused and Dispersive Representations for Semi-supervised Polyp SegmentationHuisi Wu, Guilian Chen, Zhenkun Wen, Jing Qin 0001. 3469-3478 [doi]
- Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse ContextsHong-Yu Zhou, Chixiang Lu, Sibei Yang, Xiaoguang Han, Yizhou Yu. 3479-3489 [doi]
- TOOD: Task-aligned One-stage Object DetectionChengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang. 3490-3499 [doi]
- Oriented R-CNN for Object DetectionXingxing Xie, Gong Cheng 0003, Jiabao Wang, Xiwen Yao, Junwei Han. 3500-3509 [doi]
- Towards Rotation Invariance in Object DetectionAgastya Kalra, Guy Stoppi, Bradley Brown, Rishav Agarwal, Achuta Kadambi. 3510-3520 [doi]
- FMODetect: Robust Detection of Fast Moving ObjectsDenys Rozumnyi, Jirí Matas, Filip Sroubek, Marc Pollefeys, Martin R. Oswald. 3521-3529 [doi]
- Visual Relationship Detection Using Part-and-Sum Transformers with Composite QueriesQi Dong, Zhuowen Tu, Haofu Liao, Yuting Zhang, Vijay Mahadevan, Stefano Soatto. 3530-3539 [doi]
- DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose ConsistencyJiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia, Yuanqing Li. 3540-3549 [doi]
- SimROD: A Simple Adaptation Method for Robust Object DetectionRindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong Bai, Yong Zhang. 3550-3559 [doi]
- Disentangled High Quality Salient Object DetectionLv Tang, Bo Li, Yijie Zhong, Shouhong Ding, Mofei Song. 3560-3570 [doi]
- G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature ImitationLewei Yao, Renjie Pi, Hang Xu, Wei Zhang 0196, Zhenguo Li, Tong Zhang. 3571-3580 [doi]
- TransFER: Learning Relation-aware Facial Expression Representations with TransformersFanglei Xue, Qiangchang Wang, Guodong Guo. 3581-3590 [doi]
- Rethinking Transformer-based Set Prediction for Object DetectionZhiqing Sun, Shengcao Cao, Yiming Yang, Kris Kitani. 3591-3600 [doi]
- Fast Convergence of DETR with Spatially Modulated Co-AttentionPeng Gao 0007, Minghang Zheng, Xiaogang Wang 0001, Jifeng Dai, Hongsheng Li 0001. 3601-3610 [doi]
- Reconcile Prediction Consistency for Balanced Object DetectionKeyang Wang, Lei Zhang. 3611-3620 [doi]
- Mutual Supervision for Dense Object DetectionZiteng Gao, Limin Wang 0002, Gangshan Wu. 3621-3630 [doi]
- Conditional DETR for Fast Training ConvergenceDepu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang 0001. 3631-3640 [doi]
- Meta Pairwise Relationship Distillation for Unsupervised Person Re-identificationHaoxuanye Ji, Le Wang 0003, Sanping Zhou, Wei Tang, Nanning Zheng 0001, Gang Hua 0001. 3641-3650 [doi]
- Teacher-Student Adversarial Depth Hallucination to Improve Face RecognitionHardik Uppal, Alireza Sepas-Moghaddam, Michael A. Greenspan, Ali Etemad. 3651-3660 [doi]
- Fake it till you make it: face analysis in the wild using synthetic data aloneErroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Sebastian Dziadzio, Thomas J. Cashman 0001, Jamie Shotton. 3661-3671 [doi]
- Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization PerspectiveXuege Hou, Yali Li, Shengjin Wang. 3672-3681 [doi]
- Cross-Encoder for Unsupervised Gaze Representation LearningYunjia Sun, Jiabei Zeng, Shiguang Shan, Xilin Chen 0001. 3682-3691 [doi]
- VENet: Voting Enhancement Network for 3D Object DetectionQian Xie, Yu-Kun Lai, Jing Wu, Zhoutao Wang, Dening Lu, Mingqiang Wei, Jun Wang 0039. 3692-3701 [doi]
- Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point CloudMingtao Feng, Zhen Li, Qi Li, Liang Zhang 0010, Xiangdong Zhang, Guangming Zhu 0001, Hui Zhang 0023, Yaonan Wang, Ajmal Mian. 3702-3711 [doi]
- Real-time Vanishing Point Detector Integrating Under-parameterized RANSAC and Hough TransformJianping Wu, Liang Zhang, Ye Liu, Ke Chen 0014. 3712-3721 [doi]
- Looking here or there? Gaze Following in 360-Degree ImagesYunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo. 3722-3731 [doi]
- Towards Efficient Graph Convolutional Networks for Point Cloud HandlingYawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory S. Chirikjian, Luc Van Gool. 3732-3742 [doi]
- Multi-Echo LiDAR for 3D Object DetectionYunze Man, Xinshuo Weng, Prasanna Kumar Sivakumar, Matthew O'Toole, Kris Kitani. 3743-3752 [doi]
- CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional ConvolutionLizhe Liu, Xiaohao Chen, Siyu Zhu, Ping Tan. 3753-3762 [doi]
- CrackFormer: Transformer Network for Fine-Grained Crack DetectionHuajun Liu, Xiangyu Miao, Christoph Mertz, Chengzhong Xu 0001, Hui Kong. 3763-3772 [doi]
- DWKS : A Local Descriptor of Deformations Between Meshes and Point CloudsRobin Magnet, Maks Ovsjanikov. 3773-3782 [doi]
- Physics-Enhanced Machine Learning for Virtual Fluorescence MicroscopyColin L. V. Cooke, Fanjie Kong, Amey Chaware, Kevin C. Zhou, Kanghyun Kim, Rong Xu, D. Michael Ando, Samuel J. Yang, Pavan Chandra Konda, Roarke Horstmeyer. 3783-3793 [doi]
- DAM: Discrepancy Alignment Metric for Face RecognitionJiaheng Liu, Yudong Wu, Yichao Wu, Chuming Li, Xiaolin Hu, Ding Liang, Mengyu Wang. 3794-3803 [doi]
- Topologically Consistent Multi-View Face Inference Using Volumetric SamplingTianye Li, Shichen Liu, Timo Bolkart, Jiayi Liu, Hao Li 0015, Yajie Zhao. 3804-3814 [doi]
- Generalizing Gaze Estimation with Outlier-guided Collaborative AdaptationYunfei Liu, Ruicong Liu, Haofei Wang, Feng Lu. 3815-3824 [doi]
- Learn to Cluster Faces via Pairwise ClassificationJunfu Liu, Di Qiu, Pengfei Yan, Xiaolin Wei. 3825-3833 [doi]
- End-to-end robust joint unsupervised image alignment and clusteringXiangrui Zeng, Gregory Howe, Min Xu 0009. 3834-3846 [doi]
- FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute LearningChenxu Zhang, Yifan Zhao, Yifei Huang, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo. 3847-3856 [doi]
- Disentangled Lifespan Face SynthesisSen He, Wentong Liao, Michael Ying Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang. 3857-3866 [doi]
- Retrieve in Style: Unsupervised Facial Feature Transfer and RetrievalMin Jin Chong, Wen-Sheng Chu, Abhishek Kumar, David A. Forsyth. 3867-3876 [doi]
- Towards Face Encryption by Generating Adversarial Identity MasksXiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su 0006, Jun Zhu 0001, Yuefeng Chen, Hui Xue 0001. 3877-3887 [doi]
- Re-Aging GAN: Toward Personalized Face Age TransformationFarkhod Makhmudkhujaev, Sungeun Hong, In Kyu Park. 3888-3897 [doi]
- Recurrent Mask Refinement for Few-Shot Medical Image SegmentationHao Tang, Xingwei Liu, Shanlin Sun, Xiangyi Yan, Xiaohui Xie. 3898-3908 [doi]
- Generative Adversarial Registration for Improved Conditional Deformable TemplatesNeel Dey, Mengwei Ren, Adrian V. Dalca, Guido Gerig. 3909-3921 [doi]
- GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image RecognitionShih-Cheng Huang, Liyue Shen, Matthew P. Lungren, Serena Yeung. 3922-3931 [doi]
- Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images with Artificial Neural NetworksAlireza Naghizadeh, Hongye Xu, Mohab Mohamed, Dimitris N. Metaxas, Dongfang Liu. 3932-3941 [doi]
- T-AutoML: Automated Machine Learning for Lesion Segmentation using Transformers in 3D Medical ImagingDong Yang 0005, Andriy Myronenko, Xiaosong Wang, Ziyue Xu, Holger R. Roth, Daguang Xu. 3942-3954 [doi]
- RFNet: Region-aware Fusion Network for Incomplete Multi-modal Brain Tumor SegmentationYuhang Ding, Xin Yu, Yi Yang 0001. 3955-3964 [doi]
- Visual-Textual Attentive Semantic Consistency for Medical Report GenerationYi Zhou 0007, Lei Huang 0015, Tao Zhou 0002, Huazhu Fu, Ling Shao 0001. 3965-3974 [doi]
- The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled VideoJohn Gideon, Simon Stent. 3975-3984 [doi]
- Multi-Class Cell Detection Using Spatial Context RepresentationShahira Abousamra, David Belinsky, John S. Van Arnam, Felicia Allard, Eric Yee, Rajarsi Gupta, Tahsin M. Kurç, Dimitris Samaras, Joel H. Saltz, Chao Chen 0012. 3985-3994 [doi]
- Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide ImagesRichard J. Chen, Ming Y. Lu, Wei-Hung Weng, Tiffany Y. Chen, Drew F. K. Williamson, Trevor Manz, Maha Shady, Faisal Mahmood. 3995-4005 [doi]
- CDNet: Centripetal Direction Network for Nuclear Instance SegmentationHongliang He, Zhongyi Huang, Yao Ding 0006, Guoli Song, Lin Wang, Qian Ren, Pengxu Wei, Zhiqiang Gao, Jie Chen 0001. 4006-4015 [doi]
- Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology ImageZunlei Feng, Zhonghua Wang, Xinchao Wang, Yining Mao, Thomas Li, Jie Lei, Yuexuan Wang, Mingli Song. 4016-4025 [doi]
- Deep survival analysis with longitudinal X-rays for COVID-19Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih. 4026-4035 [doi]
- Self-Supervised Cryo-Electron Tomography Volumetric Image Restoration from Single Noisy Volume with Sparsity ConstraintZhidong Yang, Fa Zhang 0001, Renmin Han. 4036-4045 [doi]
- CryoDRGN2: Ab initio neural reconstruction of 3D protein structures from real cryo-EM imagesEllen D. Zhong, Adam Lerer, Joseph H. Davis, Bonnie Berger. 4046-4055 [doi]
- Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image RescalingJingyun Liang, Andreas Lugmayr, Kai Zhang 0008, Martin Danelljan, Luc Van Gool, Radu Timofte. 4056-4065 [doi]
- Learning Dual Priors for JPEG Compression Artifacts RemovalXueyang Fu, Xi Wang, Aiping Liu, Junwei Han, Zheng-Jun Zha. 4066-4075 [doi]
- Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-ResolutionJingyun Liang, Guolei Sun, Kai Zhang 0008, Luc Van Gool, Radu Timofte. 4076-4085 [doi]
- STAR: A Structure-aware Lightweight Transformer for Real-time Image EnhancementZhaoyang Zhang, Yitong Jiang, Jun Jiang, Xiaogang Wang 0001, Ping Luo 0002, Jinwei Gu. 4086-4095 [doi]
- Perceptual Variousness Motion Deblurring with Light Global Context RefinementJichun Li, Weimin Tan, Bo Yan. 4096-4105 [doi]
- StarEnhancer: Learning Real-Time and Style-Aware Image EnhancementYuda Song, Hui Qian 0001, Xin Du. 4106-4115 [doi]
- MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object DetectionYongri Piao, Jian Wang, Miao Zhang, Huchuan Lu. 4116-4125 [doi]
- Uncertainty-Guided Transformer Reasoning for Camouflaged Object DetectionFan Yang 0054, Qiang Zhai, Xin Li 0079, Rui Huang 0008, Ao Luo, Hong Cheng 0002, Deng-Ping Fan. 4126-4135 [doi]
- Scene Context-Aware Salient Object DetectionAvishek Siris, Jianbo Jiao, Gary K. L. Tam, Xianghua Xie, Rynson W. H. Lau. 4136-4146 [doi]
- Summarize and Search: Learning Consensus-aware Dynamic Convolution for Co-Saliency DetectionNi Zhang, Junwei Han, Nian Liu, Ling Shao 0001. 4147-4156 [doi]
- Light Source Guided Single-Image Flare Removal from Unpaired DataXiaotian Qiao, Gerhard P. Hancke 0002, Rynson W. H. Lau. 4157-4165 [doi]
- PlaneTR: Structure-Guided Transformers for 3D Plane RecoveryBin Tan, Nan Xue 0001, Song Bai, Tianfu Wu 0001, Gui-Song Xia. 4166-4175 [doi]
- ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel LossWei-Ting Chen, Hao-Yu Fang, Cheng-Lin Hsieh, Cheng-Che Tsai, I-Hsiang Chen, Jian-Jiun Ding, Sy-Yen Kuo. 4176-4185 [doi]
- Exploring Visual Engagement Signals for Representation LearningMenglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge J. Belongie, Ser-Nam Lim. 4186-4197 [doi]
- TransView: Inside, Outside, and Across the Cropping View BoundariesZhiyu Pan, Zhiguo Cao 0001, Kewei Wang, Hao Lu 0003, Weicai Zhong. 4198-4207 [doi]
- Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter VideoBin Fan, Yuchao Dai. 4208-4217 [doi]
- Structure-Preserving Deraining with Residue Channel Prior GuidanceQiaosi Yi, Juncheng Li 0003, Qinyan Dai, Faming Fang, Guixu Zhang, Tieyong Zeng. 4218-4227 [doi]
- ReconfigISP: Reconfigurable Camera Image Processing PipelineKe Yu, Zexian Li, Yue Peng, Chen Change Loy, Jinwei Gu. 4228-4237 [doi]
- Event-Intensity Stereo: Estimating Depth by the Best of Both WorldsS. Mohammad Mostafavi I., Kuk-Jin Yoon, Jonghyun Choi. 4238-4247 [doi]
- End-to-end Piece-wise Unwarping of Document ImagesSagnik Das, Kunwar Yashraj Singh, Jon Wu, Erhan Bas, Vijay Mahadevan, Rahul Bhotika, Dimitris Samaras. 4248-4257 [doi]
- Context Reasoning Attention Network for Image Super-ResolutionYulun Zhang, Donglai Wei, Can Qin, Huan Wang 0014, Hanspeter Pfister, Yun Fu 0001. 4258-4267 [doi]
- Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-ResolutionSalma Abdel Magid, Yulun Zhang, Donglai Wei, Won-Dong Jang, Zudi Lin, Yun Fu 0001, Hanspeter Pfister. 4268-4277 [doi]
- Pyramid Architecture Search for Real-Time Image DeblurringXiaobin Hu, Wenqi Ren, Kaicheng Yu, Kaihao Zhang, Xiaochun Cao, Wei Liu 0005, Bjoern H. Menze. 4278-4287 [doi]
- Learning Frequency-aware Dynamic Network for Efficient Super-ResolutionWenbin Xie, Dehua Song, Chang Xu 0002, Chunjing Xu, Hui Zhang, Yunhe Wang 0001. 4288-4297 [doi]
- Unsupervised Real-World Super-Resolution: A Domain Adaptation PerspectiveWei Wang, Haochen Zhang, Zehuan Yuan, Changhu Wang. 4298-4307 [doi]
- Dynamic Attentive Graph Learning for Image RestorationChong Mou, Jian Zhang, Zhuoyuan Wu. 4308-4317 [doi]
- RGB-D Saliency Detection via Cascaded Mutual Information MinimizationJing Zhang 0052, Deng-Ping Fan, Yuchao Dai, Xin Yu 0002, Yiran Zhong, Nick Barnes, Ling Shao 0001. 4318-4327 [doi]
- Learning RAW-to-sRGB Mappings with Inaccurately Aligned SupervisionZhilu Zhang, Haolin Wang, Ming Liu 0018, Ruohao Wang, Jiawei Zhang 0002, Wangmeng Zuo. 4328-4338 [doi]
- Deep Structured Instance Graph for Distilling Object DetectorsYixin Chen, Pengguang Chen, Shu Liu 0005, Liwei Wang, Jiaya Jia. 4339-4348 [doi]
- Learning Unsupervised Metaformer for Anomaly DetectionJhih-Ciang Wu, Ding-Jie Chen, Chiou-Shann Fuh, Tyng-Luh Liu. 4349-4358 [doi]
- Equivariant Imaging: Learning Beyond the Range SpaceDongdong Chen 0004, Julián Tachella, Mike E. Davies. 4359-4368 [doi]
- Multi-Level Curriculum for Training A Distortion-Aware Barrel Distortion Rectification ModelKang Liao, Chunyu Lin, Lixin Liao, Yao Zhao 0001, Weiyao Lin. 4369-4378 [doi]
- Zero-Shot Day-Night Domain Adaptation with a Physics PriorAttila Lengyel, Sourav Garg, Michael Milford, Jan C. van Gemert. 4379-4389 [doi]
- MixMix: All You Need for Data-Free Compression Are Feature and Data MixingYuhang Li, Feng Zhu, Ruihao Gong, Mingzhu Shen, Xin Dong 0009, Fengwei Yu, Shaoqing Lu, Shi Gu. 4390-4399 [doi]
- Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective AlignmentLin Zhang, Yong Luo 0002, Yan Bai, Bo Du, Ling-Yu Duan. 4400-4408 [doi]
- Omniscient Video Super-ResolutionPeng Yi 0002, Zhongyuan Wang 0001, Kui Jiang, Junjun Jiang, Tao Lu 0001, Xin Tian 0006, Jiayi Ma 0001. 4409-4418 [doi]
- Adaptive Unfolding Total Variation Network for Low-Light Image EnhancementChuanjun Zheng, Daming Shi 0001, Wentian Shi. 4419-4428 [doi]
- Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral LearningZhuoran Zheng, Wenqi Ren, Xiaochun Cao, Tao Wang, Xiuyi Jia. 4429-4438 [doi]
- Representative Color Transform for Image EnhancementHanul Kim, Su-Min Choi, Chang-Su Kim 0001, Yeong Jun Koh. 4439-4448 [doi]
- Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot ExemplarPeike Li, Xin Yu, Yi Yang 0001. 4449-4459 [doi]
- Event Stream Super-Resolution via Spatiotemporal Constraint LearningSiqi Li, Yutong Feng, Yipeng Li, Yu Jiang, Changqing Zou, Yue Gao 0002. 4460-4469 [doi]
- Self-Conditioned Probabilistic Learning of Video RescalingYuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao. 4470-4479 [doi]
- A New Journey from SDRTV to HDRTVXiangyu Chen, Zhengwen Zhang, Jimmy S. Ren, Lynhoo Tian, Yu Qiao 0001, Chao Dong. 4480-4489 [doi]
- ResRep: Lossless CNN Pruning via Decoupling Remembering and ForgettingXiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu 0002, Jungong Han, Yuchen Guo, Guiguang Ding. 4490-4500 [doi]
- Efficient Video Compression via Content-Adaptive Super-ResolutionMehrdad Khani Shirkoohi, Vibhaalakshmi Sivaraman, Mohammad Alizadeh. 4501-4510 [doi]
- Bringing Events into Video Deblurring with Non-consecutively Blurry FramesWei Shang, Dongwei Ren, Dongqing Zou, Jimmy S. Ren, Ping Luo 0002, Wangmeng Zuo. 4511-4520 [doi]
- SUNet: Symmetric Undistortion Network for Rolling Shutter CorrectionBin Fan, Yuchao Dai, Mingyi He. 4521-4530 [doi]
- Robust Automatic Monocular Vehicle Speed Estimation for Traffic SurveillanceJérôme Revaud, Martin Humenberger. 4531-4541 [doi]
- Augmenting Depth Estimation with Geospatial ContextScott Workman, Hunter Blanton. 4542-4551 [doi]
- Real-Time Video Inference on Edge Devices via Adaptive Model StreamingMehrdad Khani Shirkoohi, Pouya Hamadanian, Arash Nasr-Esfahany, Mohammad Alizadeh. 4552-4562 [doi]
- Score-Based Point Cloud DenoisingShitong Luo, Wei Hu. 4563-4572 [doi]
- Rethinking Noise Synthesis and Modeling in Raw DenoisingYi Zhang, Hongwei Qin, Xiaogang Wang, Hongsheng Li. 4573-4581 [doi]
- Extensions of Karger's Algorithm: Why They Fail in Theory and How They Are Useful in PracticeErik Jenner, Enrique Fita Sanmartín, Fred A. Hamprecht. 4582-4591 [doi]
- Low-Rank Tensor Completion by Approximating the Tensor Average RankZhanliang Wang, Junyu Dong, Xinguo Liu, Xueying Zeng. 4592-4600 [doi]
- RDI-Net: Relational Dynamic Inference NetworksHuanyu Wang, Songyuan Li, Shihao Su, Zequn Qin, Xi Li 0001. 4601-4610 [doi]
- Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature ModulationJiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li 0009, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen 0001, Chuang Zhang, Ming Wu 0001. 4611-4620 [doi]
- Rethinking Coarse-to-Fine Approach in Single Image DeblurringSung-Jin Cho 0002, Seo-Won Ji, Jun-Pyo Hong, Seung-Won Jung, Sung Jea Ko. 4621-4630 [doi]
- Cross-Patch Graph Convolutional Network for Image DenoisingYao Li, Xueyang Fu, Zheng-Jun Zha. 4631-4640 [doi]
- PnP-DETR: Towards Efficient Visual Analysis with TransformersTao Wang 0053, Li Yuan 0007, Yunpeng Chen, Jiashi Feng, Shuicheng Yan. 4641-4650 [doi]
- DCT-SNN: Using DCT to Distribute Spatial Information over Time for Low-Latency Spiking Neural NetworksIsha Garg, Sayeed Shafayet Chowdhury, Kaushik Roy 0001. 4651-4660 [doi]
- Specificity-preserving RGB-D Saliency DetectionTao Zhou 0002, Huazhu Fu, Geng Chen 0001, Yi Zhou 0007, Deng-Ping Fan, Ling Shao 0001. 4661-4671 [doi]
- High-Fidelity Pluralistic Image Completion with TransformersZiyu Wan, Jingbo Zhang, Dongdong Chen 0001, Jing Liao 0001. 4672-4681 [doi]
- Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and ReweightingLei Zhu, Ke Xu 0010, Zhanghan Ke, Rynson W. H. Lau. 4682-4691 [doi]
- Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative GuidanceNian Liu, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao 0001. 4692-4701 [doi]
- Visual Saliency TransformerNian Liu, Ni Zhang, Kaiyuan Wan, Ling Shao 0001, Junwei Han. 4702-4712 [doi]
- HiNet: Deep Image Hiding by Invertible NetworkJunpeng Jing, Xin Deng 0002, Mai Xu, Jianyi Wang, Zhenyu Guan. 4713-4722 [doi]
- CANet: A Context-Aware Network for Shadow RemovalZipei Chen, Chengjiang Long, Ling Zhang, Chunxia Xiao. 4723-4732 [doi]
- Unpaired Learning for Deep Image Deraining with Rain Direction RegularizerYang Liu 0119, Ziyu Yue, Jinshan Pan, Zhixun Su. 4733-4741 [doi]
- DivAug: Plug-in Automated Data Augmentation with Explicit Diversity MaximizationZirui Liu, Haifeng Jin, Ting-Hsiang Wang, Kaixiong Zhou, Xia Hu. 4742-4750 [doi]
- Morphable Detector for Object Detection on DemandXiangyun Zhao, Xu Zou, Ying Wu. 4751-4760 [doi]
- Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning SchemeXi Yang, Wangmeng Xiang, Hui Zeng, Lei Zhang 0006. 4761-4770 [doi]
- Designing a Practical Degradation Model for Deep Blind Image Super-ResolutionKai Zhang 0008, Jingyun Liang, Luc Van Gool, Radu Timofte. 4771-4780 [doi]
- Learning A Single Network for Scale-Arbitrary Super-ResolutionLongguang Wang, Yingqian Wang 0002, Zaiping Lin, Jungang Yang 0001, Wei An, Yulan Guo. 4781-4790 [doi]
- Deep Blind Video Super-resolutionJinshan Pan, Haoran Bai, Jiangxin Dong, Jiawei Zhang 0002, Jinhui Tang. 4791-4800 [doi]
- Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning SearchZheng Zhan 0001, Yifan Gong 0004, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David R. Kaeli, Bin Ren, Xue Lin, Yanzhi Wang. 4801-4811 [doi]
- SSH: A Self-Supervised Framework for Image HarmonizationYifan Jiang, He Zhang, Jianming Zhang 0001, Yilin Wang, Zhe L. Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang. 4812-4821 [doi]
- Out-of-boundary View Synthesis Towards Full-Frame Video StabilizationYufei Xu, Jing Zhang, Dacheng Tao. 4822-4831 [doi]
- R-SLAM: Optimizing Eye Tracking from Rolling Shutter Video of the RetinaJay Shenoy, James Fong, Jeffrey Tan, Austin Roorda, Ren Ng. 4832-4841 [doi]
- Attentive and Contrastive Learning for Joint Depth and Motion Field EstimationSeokJu Lee, François Rameau, Fei Pan, In-So Kweon. 4842-4851 [doi]
- Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention NetworksVivien Sainte Fare Garnot, Loïc Landrieu. 4852-4861 [doi]
- EvIntSR-Net: Event Guided Multiple Latent Frames Reconstruction and Super-resolutionJin Han, Yixin Yang, Chu Zhou, Chao Xu 0006, Boxin Shi. 4862-4871 [doi]
- Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive ImagingZhuoyuan Wt, Jian Zhangt, Chong Mou. 4872-4881 [doi]
- Video Matting via Consistency-Regularized Graph Neural NetworksTiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang 0001. 4882-4891 [doi]
- Collaborative Unsupervised Visual Representation Learning from Decentralized DataWeiming Zhuang, Xin Gan, Yonggang Wen 0001, Shuai Zhang, Shuai Yi. 4892-4901 [doi]
- Full-Duplex Strategy for Video Object SegmentationGe-Peng Ji, Keren Fu, Zhe Wu, Deng-Ping Fan, Jianbing Shen, Ling Shao 0001. 4902-4913 [doi]
- iNAS: Integral NAS for Device-Aware Salient Object DetectionYuchao Gu, Shang-hua Gao, Xu-Sheng Cao, Peng Du, Shao-Ping Lu, Ming-Ming Cheng. 4914-4924 [doi]
- A Machine Teaching Framework for Scalable RecognitionPei Wang, Nuno Vasconcelos. 4925-4934 [doi]
- The Benefit of Distraction: Denoising Camera-Based Physiological Measurements using Inverse AttentionEwa Magdalena Nowara, Daniel McDuff, Ashok Veeraraghavan. 4935-4944 [doi]
- Adaptive Graph Convolution for Point Cloud AnalysisHaoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin 0001, Tong Lu. 4945-4954 [doi]
- Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude LearningYu Tian, Guansong Pang, Yuanhong Chen, Rajvinder Singh, Johan W. Verjans, Gustavo Carneiro. 4955-4966 [doi]
- Improving De-raining Generalization via Neural ReorganizationJie Xiao, Man Zhou, Xueyang Fu, Aiping Liu, Zheng-Jun Zha. 4967-4976 [doi]
- Towards Flexible Blind JPEG Artifacts RemovalJiaxi Jiang, Kai Zhang 0008, Radu Timofte. 4977-4986 [doi]
- Learning to Remove Refractive Distortions from Underwater ImagesSimron Thapa, Nianyi Li, Jinwei Ye. 4987-4996 [doi]
- Location-aware Single Image Reflection RemovalZheng Dong, Ke Xu 0010, Yin Yang 0002, Hujun Bao, Weiwei Xu, Rynson W. H. Lau. 4997-5006 [doi]
- DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided NetworkYeying Jin, Aashish Sharma, Robby T. Tan. 5007-5016 [doi]
- Polarimetric Helmholtz StereopsisYuqi Ding, Yu Ji 0001, Mingyuan Zhou, Sing Bing Kang, Jinwei Ye. 5017-5026 [doi]
- Self-born Wiring for Neural TreesYing Chen, Feng Mao, Jie Song, Xinchao Wang, Huiqiong Wang, Mingli Song. 5027-5036 [doi]
- Student Customized Knowledge Distillation: Bridging the Gap Between Student and TeacherYichen Zhu, Yi Wang. 5037-5046 [doi]
- Adaptive Curriculum LearningYajing Kong, Liu Liu, Jun Wang, Dacheng Tao. 5047-5056 [doi]
- BlockPlanner: City Block Generation with Vectorized Graph RepresentationLinning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin. 5057-5066 [doi]
- Rethinking Deep Image Prior for DenoisingYeonsik Jo, Se Young Chun, Jonghyun Choi. 5067-5076 [doi]
- NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of ModelsHang Xu, Ning Kang 0001, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li. 5077-5086 [doi]
- Learning Multiple Pixelwise Tasks Based on Loss Scale BalancingJae-Han Lee, Chul Lee, Chang-Su Kim 0001. 5087-5096 [doi]
- Pixel Difference Networks for Efficient Edge DetectionZhuo Su 0002, Wenzhe Liu, Zitong Yu, Dewen Hu, Qing Liao 0001, Qi Tian 0001, Matti Pietikäinen, Li Liu 0002. 5097-5107 [doi]
- Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic SegmentationRobin Chan, Matthias Rottmann, Hanno Gottschalk. 5108-5117 [doi]
- Spectral Leakage and Rethinking the Kernel Size in CNNsNergis Tomen, Jan C. van Gemert. 5118-5127 [doi]
- MUSIQ: Multi-scale Image Quality TransformerJunjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang. 5128-5137 [doi]
- BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online PoliciesThomas Verelst, Tinne Tuytelaars. 5138-5147 [doi]
- SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCamYonggan Fu, Yang Zhang, Yue Wang 0036, Zhihan Lu, Vivek Boominathan, Ashok Veeraraghavan, Yingyan Lin. 5148-5157 [doi]
- Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality AssessmentPengfei Chen, Leida Li, Jinjian Wu, Weisheng Dong, Guangming Shi. 5158-5167 [doi]
- Bit-Mixer: Mixed-precision networks with runtime bit-width selectionAdrian Bulat, Georgios Tzimiropoulos. 5168-5177 [doi]
- ReCU: Reviving the Dead Weights in Binary Neural NetworksZihan Xu, Mingbao Lin, Jianzhuang Liu, Jie Chen 0001, Ling Shao 0001, Yue Gao 0002, Yonghong Tian 0001, Rongrong Ji. 5178-5188 [doi]
- HIRE-SNN: Harnessing the Inherent Robustness of Energy-Efficient Deep Spiking Neural Networks by Training with Crafted Input NoiseSouvik Kundu 0002, Massoud Pedram, Peter A. Beerel. 5189-5198 [doi]
- *Peng Chen, Bohan Zhuang, Chunhua Shen. 5199-5208 [doi]
- Towards Memory-Efficient Neural Networks via Multi-Level in situ GenerationJiaqi Gu, Hanqing Zhu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan.