Abstract is missing.
- Salient Object Detection for Point CloudsSonglin Fan, Wei Gao 0003, Ge Li 0002. 1-19 [doi]
- Learning Semantic Segmentation from Multiple Datasets with Label ShiftsDongwan Kim, Yi-Hsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han. 20-36 [doi]
- Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance DiscriminationKangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, Ben M. Chen. 37-55 [doi]
- Towards Open-Vocabulary Scene Graph Generation with Prompt-Based FinetuningTao He 0007, Lianli Gao, Jingkuan Song, Yuan-Fang Li. 56-73 [doi]
- Variance-Aware Weight Initialization for Point Convolutional Neural NetworksPedro Hermosilla, Michael Schelling, Tobias Ritschel 0001, Timo Ropinski. 74-89 [doi]
- Break and Make: Interactive Structural Understanding Using LEGO BricksAaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox. 90-107 [doi]
- Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow EstimationWencan Cheng, Jong Hwan Ko. 108-124 [doi]
- 3DG-STFM: 3D Geometric Guided Student-Teacher Feature MatchingRunyu Mao, Chen Bai, Yatong An, Fengqing Zhu, Cheng Lu. 125-142 [doi]
- Video Restoration Framework and Its Meta-adaptations to Data-Poor ConditionsPrashant W. Patil, Sunil Gupta 0001, Santu Rana, Svetha Venkatesh. 143-160 [doi]
- MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point CloudMichaël Ramamonjisoa, Sinisa Stekovic, Vincent Lepetit. 161-177 [doi]
- Scene Text Recognition with Permuted Autoregressive Sequence ModelsDarwin Bautista, Rowel Atienza. 178-196 [doi]
- When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression RecognitionBohan Li, Ye Yuan, Dingkang Liang, Xiao Liu, Zhilong Ji, Jinfeng Bai, Wenyu Liu 0001, Xiang Bai. 197-214 [doi]
- Detecting Tampered Scene Text in the WildYuxin Wang, Hongtao Xie, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang 0001. 215-232 [doi]
- Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement LearningJingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai. 233-248 [doi]
- GLASS: Global to Local Attention for Scene-Text SpottingRoi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha. 249-266 [doi]
- COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated TextsJeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa. 267-283 [doi]
- Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and SpottingChuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip H. S. Torr, Song Bai. 284-302 [doi]
- Toward Understanding WordArt: Corner-Guided Transformer for Scene Text RecognitionXudong Xie, Ling Fu, Zhifei Zhang, Zhaowen Wang, Xiang Bai. 303-321 [doi]
- Levenshtein OCRCheng Da, Peng Wang, Cong Yao. 322-338 [doi]
- Multi-granularity Prediction for Scene Text RecognitionPeng Wang, Cheng Da, Cong Yao. 339-355 [doi]
- Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text SpottingYing Chen, Liang Qiao 0001, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Xi Li. 356-373 [doi]
- Contextual Text Block Detection Towards Scene Text UnderstandingChuhui Xue, Jiaxing Huang 0001, Wenqing Zhang, Shijian Lu, Changhu Wang, Song Bai. 374-391 [doi]
- CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression RecognitionWenqi Zhao, Liangcai Gao. 392-408 [doi]
- Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global ContextChongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding. 409-426 [doi]
- TextAdaIN: Paying Attention to Shortcut Learning in Text RecognizersOren Nuriel, Sharon Fogel, Ron Litman. 427-445 [doi]
- Multi-modal Text Recognition Networks: Interactive Enhancements Between Visual and Semantic FeaturesByeonghu Na, Yoonsik Kim, Sungrae Park. 446-463 [doi]
- SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text RecognitionDajian Zhong, Shujing Lyu, Palaiahnakote Shivakumara, Bing Yin, Jiajia Wu, Umapada Pal 0001, Yue Lu 0001. 464-480 [doi]
- Pure Transformer with Integrated Experts for Scene Text RecognitionYew Lee Tan, Adams Wai-Kin Kong, Jung-jae Kim. 481-497 [doi]
- OCR-Free Document Understanding TransformerGeewook Kim, Teakgyu Hong, Moonbin Yim, JeongYeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park. 498-517 [doi]
- CAR: Class-Aware Regularizations for Semantic SegmentationYe Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Linchao Bao, Xiangjian He. 518-534 [doi]
- Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic SegmentationYuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee. 535-552 [doi]
- SeqFormer: Sequential Transformer for Video Instance SegmentationJunfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai. 553-569 [doi]
- Saliency Hierarchy Modeling via Generative Kernels for Salient Object DetectionWenhu Zhang, Liangli Zheng, Huanyu Wang, Xintian Wu, Xi Li 0001. 570-587 [doi]
- In Defense of Online Models for Video Instance SegmentationJunfeng Wu, Qihao Liu, Yi Jiang, Song Bai, Alan L. Yuille, Xiang Bai. 588-605 [doi]
- Active Pointly-Supervised Instance SegmentationChufeng Tang, Lingxi Xie, Gang Zhang, Xiaopeng Zhang 0008, Qi Tian 0001, Xiaolin Hu 0001. 606-623 [doi]
- A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context MiningBowen Shi, Dongsheng Jiang, Xiaopeng Zhang 0008, Han Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian 0001. 624-639 [doi]
- XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory ModelHo Kei Cheng, Alexander G. Schwing. 640-658 [doi]
- Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous DrivingJiale Li, Hang Dai, Yong Ding 0003. 659-676 [doi]
- 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point CloudsYan Xu 0014, Jiantao Gao, Chaoda Zheng, Chao Zheng, Ruimao Zhang, Shuguang Cui, Zhen Li 0026. 677-695 [doi]
- Extract Free Dense Labels from CLIPChong Zhou, Chen Change Loy, Bo Dai. 696-712 [doi]
- 3D Compositional Zero-Shot Learning with DeCompositional ConsensusMuhammad Ferjad Naeem, Evin Pinar Örnek, Yongqin Xian, Luc Van Gool, Federico Tombari. 713-730 [doi]
- Video Mask Transfiner for High-Quality Video Instance SegmentationLei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu. 731-747 [doi]