Abstract is missing.
- Hierarchical Part-Attention Networks for 3D Human ReconstructionJinwei Li, Yongkang Cheng, Yonghe Zhang, Pengcheng Wang. [doi]
- Moving Object Tracking based on Kernel and Random-coupled Neural NetworkYiran Chen, Haoran Liu, Mingzhe Liu, Yanhua Liu, Ruili Wang, Peng Li. [doi]
- SS-FS CSA: Self-Supervised and Fully Supervised Integration for 3D Cerebrovascular SegmentationChenxi Niu, Ziyu Liu, Xiangjian He. [doi]
- A method for detecting hands off the steering wheelYujia Xu, Deyu Pan, Ling Ding. [doi]
- ScaMo: Towards Text to Video Storyboard Generation Using Scale and Movement of ShotsXu Gu, Xihua Wang, Chuhao Jin, Ruihua Song. [doi]
- FeedMatch: Evolves for Semi-Supervised Multimedia Classification from Student FeedbackJunjiang Liu, Dandan Sun, Hailun Xia, Jiangtao Bai, Xinyue Fan. [doi]
- RandommaskFormer: Light Weight Remote Sensing Scene Classification with Masked TransformerXianbin Hu, Wei Wu, Zhu Li. [doi]
- Low-Light Image Enhancement via FourierTMamba: A Hybrid Frequency-Spatial ApproachShuwei Peng, Xu Zhang, Aiwen Jiang, Changhong Liu, Jihua Ye. [doi]
- Watermarking Vision-Language ModelsShan Wan, Wu Liu, Yijun Liu, Feiniu Yuan, Chunli Meng. [doi]
- Dlpp-Net: Degradation Location Prior Prediction Network for Image RestorationYongjian Liu, Shunwei Zhang, Jinyu Xu, Jiachen Li 0002, Yanchun Ma, Qing Xie 0002. [doi]
- SITransformer: Shared Information-Guided Transformer for Extreme Multimodal SummarizationSicheng Liu, Lintao Wang, Xiaogang Zhu, Xuequan Lu, Zhiyong Wang 0001, Kun Hu. [doi]
- Dual-Stream Keyframe Enhancement for Video Question AnsweringZhenzhen Hu, Xin Guan, Jia Li 0013, Zijie Song, Richang Hong. [doi]
- MambaVesselNet: A Hybrid CNN-Mamba Architecture for 3D Cerebrovascular SegmentationYanming Chen, Ziyu Liu, Xiangjian He. [doi]
- QoS-Diff: Adaptive Auto-tuning Framework for Low-latency Diffusion Model InferencePingyi Huo, Ajay Narayanan Sridhar, Md Fahim Faysal Khan, Kiwan Maeng, Vijaykrishnan Narayanan. [doi]
- TCFusion: A Three-branch Cross-domain Fusion Network for Infrared and Visible ImagesWenyu Shao, Hongbo Liu. [doi]
- MicroMamba: State Space Model with Partitioned Window Scan for Micro-Expression RecognitionTianchen Zhou, Jiateng Liu, Yue Jin, Li Yao. [doi]
- On the Robustness of Deep Face Inpainting: An Adversarial PerspectiveWenhao Gao, Zhenbo Song, Zhenyuan Zhang, Jianfeng Lu 0003. [doi]
- Fast Online Adaptation of Visual SLAM via Variational Information Transfer and PreservationSangni Xu, Hao Xiong 0001, Qiuxia Wu, Zhihui Wang, Shlomo Berkovsky, Zhiyong Wang. [doi]
- MRGait: A Multi-range feature learning framework for Cross-View Gait RecognitionMuhammad Saad Shakeel, Kun Liu 0029, Xiaochuan Liao, Wenxiong Kang. [doi]
- Feature-weighted Multi-stage Bayesian Prototype for Few-shot ClassificationXiaocong Zhou, Fan Liu 0003, Chuanyi Zhang, Feifan Li, Wenwen Cai, Jun Zhou 0001. [doi]
- DiffuseST: Unleashing the Capability of the Diffusion Model for Style TransferYing Hu, Chenyi Zhuang, Pan Gao. [doi]
- CISampler: Correlated Information Guided Frame Sampling for Gesture Recognition in VideoYuanyuan Shi, Yunan Li 0001, Huizhou Chen, Siyu Liang, Qiguang Miao. [doi]
- CSUNet: Contour-Sensitive Underwater Salient Object DetectionYu Wei, Yi Wang, Shijun Yan, Tianzhu Wang, Zhihan Wang, Weirong Sun, Yu Zhao, Xinwei Xue. [doi]
- MFTAnet: Two-step Aggregation Net of Multiscale Features for Pneumoconiosis ScreeningQingjin Wei, Xiaozhuo Li, Dinglu Liu, Zhiwu Liao. [doi]
- Local Feature-Emphasizing Transformer for Cloth-Changing Person Re-identificationJieqiong Zhou, Guoqing Zhang, Yuhui Zheng, Fuguo Zhang. [doi]
- Fire and Smoke Detection with Burning Intensity RepresentationXiaoyi Han, Yanfei Wu, Nan Pu, Zunlei Feng, Qifei Zhang, Yijun Bei, Lechao Cheng. [doi]
- Dual-stream Multi-modal Interactive Vision-language TrackingZhiyi Mo, Guangtong Zhang, Jian Nong, Bineng Zhong, Zhi Li. [doi]
- Multi-domain Acoustic Feature Fusion for Speaker RecognitionShanshan Yao, Tian Li. [doi]
- Layout Relationship Decoupling Framework for Multi-target Domain Adaptative Semantic SegmentationYuhang Zhang 0011, Cuixin Yang, Muxin Liao, Shishun Tian, Wenbin Zou, Chen Xu 0004. [doi]
- Point Cloud Normal Estimation via Representation Learning on Height MapsYang Yi, Dasith de Silva Edirimuni, Ye Zhu, Shang Gao 0003, Zhiyong Wang 0001, Antonio Robles-Kelly, Xuequan Lu. [doi]
- Advancing Multimodal LLMs: A Focus on Geometry Problem Solving Reasoning and Sequential ScoringRaj Jaiswal, Avinash Anand, Rajiv Ratn Shah. [doi]
- SpikMamba: When SNN meets Mamba in Event-based Human Action RecognitionJiaqi Chen, Yan Yang, Shizhuo Deng, Da Teng, Liyuan Pan. [doi]
- Description-Driven Audiovisual Embedding Space Learning for Enhanced Movie UnderstandingWei-Lun Huang, Shao-Hung Wu, Hung-Chang Huang, Min-Chun Hu 0001, Tse-Yu Pan. [doi]
- Flexible Semantic Watermarking for Robust Diffusion Model Detection and TracingZhitong Zhu, Jing Yu 0007, Keke Gai, Jiamin Zhuang, Gaopeng Gou, Gang Xiong 0001. [doi]
- Where You See Is What You Know: A Visual-Semantic Conceptual ExplainerLuhao Zhu, Xiangwei Kong, Runsen Li, Guodong Guo. [doi]
- Multi-Modality Semantic-Shared Cross-View Ground-to-Aerial LocalizationKai Zhang, Xia Yuan, Shuntong Chen, Di Hu, Chunxia Zhao. [doi]
- DCEPNet: Dual-Channel Emotional Perception Network for Speech Emotion RecognitionFei Xiang, Hongbo Liu 0001, Ruili Wang, Junjie Hou, Xingang Wang. [doi]
- MS-GeodesicPSIM: Predicting the Quality of Static Mesh with Texture Map via multi-scale Geodesic Patch SimilarityBingyang Cui, Yujie Zhang, Qi Yang 0003, Yiling Xu. [doi]
- Exploring Annotation-free Image Captioning with Retrieval-augmented Pseudo Sentence GenerationZhiyuan Li, Dongnan Liu, Heng Wang 0007, Chaoyi Zhang, Weidong Cai 0002. [doi]
- Sketch-based 3D Model Retrieval with Cross-Modal RepresentationHairui Yang, Ning Wang, Zhihui Wang, Lei Wang. [doi]
- Prompting Industrial Anomaly Segment with Large Vision-Language ModelsJinheng Zhou, Wu Liu, Guang Yang, He Zhao, Feiniu Yuan. [doi]
- Mix-fine-tune: An Alternate Fine-tuning Strategy for Domain Adaptation and Generalization of Low-resource ASRChengxi Lei, Satwinder Singh, Feng Hou, Ruili Wang. [doi]
- Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the WildTianqi Wei, Zhi Chen 0010, Xin Yu 0002. [doi]
- Dual-Enhanced Disentangled Multi-View ClusteringZhiqian Dong, Sheng Yang, Peng Zhou 0006. [doi]
- HuBERT-CLAP: Contrastive Learning-Based Multimodal Emotion Recognition using Self-Alignment ApproachLong H. Nguyen, Nhat Truong Pham, Mustaqeem Khan 0001, Alice Othmani, Abdulmotaleb El-Saddik. [doi]
- Incorporating Pre-ordering Representations for Low-resource Neural Machine TranslationYuan Gao, Feng Hou, Ruili Wang. [doi]
- Multimodal Energy Prompting for Video Salient Object DetectionTao Jiang, Feng Hou, Yi Wang. [doi]
- A Robust Few-shot Learning Framework via Dual-branch Adversarial Noise PretrainingJiale Wang, Xueliang Liu, Yuling Su. [doi]
- Adaptive Feature Inheritance and Thresholding for Ingredient Recognition in Multimedia Cooking InstructionsYixin Zhang, Yoko Yamakata, Keishi Tajima. [doi]
- Multimodal Sign Language Knowledge Graph and Representation: Text Video KeyFrames and Motion TrajectoriesZiqiang Liu, Gongwei Fang, Wentong Wang, Qiang Liu. [doi]
- CS-HOI: Human Object Interaction Detection Enhanced by Common SenseCheng-Kang Tan, Wei-Ta Chu. [doi]
- LMoW: A Latent Random Variable Model for Unconditional Human Motion GenerationFaisal Ahmed, Justin Rozeboom, Hanran Song, Chenqiu Zhao, Anup Basu. [doi]
- CFRL: Coarse-Fine Decoupled Representation Learning For Long-Tailed RecognitionYiran Song, Qianyu Zhou 0001, Kun Hu, Lizhuang Ma, Xuequan Lu. [doi]
- Personalized Sentiment Estimation Based on Recall and Resting Ratio of Frontal EEGShun Katada, Kazunori Komatani. [doi]
- LoopAnimate: Loopable Salient Object AnimationFanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang. [doi]
- HSMnet: Hybrid Sampling and Matching Network for DETR-based Person SearchZhengjie Lu, Jinjia Peng, Huibing Wang, Qingxuan Shi, Bin Wang. [doi]
- MFNet: Mixed Feature Network for Enhancing Facial Emotion Recognition on the Small-Scale DatasetHuilin Chen. [doi]
- Active Object Segmentation: A New Modality for Egocentric Action RecognitionJian Ma, Bin Zhu 0006, Kun Li, Dima Damen. [doi]
- Development of a Chinese Synonym Library: Enhancing Clinical Terminology Standardization and InteroperabilityYani Chen, Jiaxiang E, Kaiyu Nie, Xiaoxia Nie, Ruili Wang. [doi]
- Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object DetectionXinhao Zhong, Siyu Jiao, Yao Zhao 0001, Yunchao Wei. [doi]
- Prompt-based Continual Learning for Extending Pretrained CLIP Models' KnowledgeLi Jiao, Lihong Cao, Tian Wang. [doi]
- Ultrasound Video Segmentation of Pubic Symphysis and Fetal Head for Angle of Progression MeasurementShuangping Chen, Huijin Wang, Shun Long, Jieyun Bai, Jianmei Jiang. [doi]
- SLIC: Secure Learned Image Codec through Compressed Domain Watermarking to Defend Image ManipulationChen-Hsiu Huang, Ja-Ling Wu. [doi]
- FA-UNext: A Feedback Attention-based MLP Network for Medical Image SegmentationQianyu Li, Bingcai Chen, Jiaxing Tian, Ruolan Liu. [doi]
- CA-OVS: Cluster and Adapt Mask Proposals for Open-Vocabulary Semantic SegmentationSon Duy Dao, Hengcan Shi, Dinh Q. Phung, Jianfei Cai 0001. [doi]
- RoboFormer: A Robust Multi-Modal Transformer for 3D Object Detection in Autonomous DrivingYuang Liu, Dacheng Liao, Mengshi Qi, Liang Liu 0001, Huadong Ma. [doi]
- BCS-NeRF: Bundle Cross-Sensing Neural Radiance FieldsMingwei Cao, Fengna Wang, Dengdi Sun, Haifeng Zhao 0001. [doi]
- Action Selection Learning for Multi-label Multi-view Action RecognitionTrung Thanh Nguyen, Yasutomo Kawanishi, Takahiro Komamizu, Ichiro Ide. [doi]
- Structured Bipartite Graph Ensemble ClusteringChen Wang, Feng Hou, Yi Wang, Ruili Wang. [doi]
- DocPointer: A parameter-efficient Pointer Network for Key Information ExtractionHaiPeng Li, Guangcun Wei, Haochen Xu, Boyan Guo. [doi]
- Emotionally Guided Symbolic Music Generation Using Diffusion Models: The AGE-DM ApproachMingzhe Zhang, Laura J. Ferris, Lin Yue, Miao Xu. [doi]
- ViCo: Engaging Video Comment Generation with Human Preference RewardsYuchong Sun, Bei Liu 0001, Xu Chen 0017, Ruihua Song, Jianlong Fu. [doi]
- A Benchmark for Gaussian Splatting Compression and Quality Assessment StudyQi Yang 0003, Kaifa Yang, Yuke Xing, Yiling Xu, Zhu Li 0001. [doi]
- Pitch-aware generative pretraining improves multi-pitch estimation with scarce dataMary Pilataki, Matthias Mauch, Simon Dixon. [doi]
- Highly Fault-Tolerant Discrete Lattice Information Coding Method for Screen-Shooting ScenariosDaidou Guo, Ching-Chun Chang, Cheng SenMao, Chuan Qin 0001. [doi]
- A Multi-scale Framework towards Human-Machine Friendly Remote Sensing Image CodingYingkai He, Zhen Zhang, Jing Xiao 0004. [doi]
- Transition in Focus of Prediction Tasks for Skeleton Graph Component Detection with TransformerZhiyuan Wang, Cong Yang, Yulu Zhang, Zeyd Boukhers, Wei Sui, Yi Ji 0001, Chunping Liu. [doi]
- Latent Variables Coding for Perceptual Image CompressionYingkai He, Zhen Zhang, Liang Liao, Jing Xiao. [doi]
- T2QRM: Text-Driven Quadruped Robot Motion GenerationMinghui Wang, Zixu Wang, Hongbin Xu, Kun Hu, Zhiyong Wang, Wenxiong Kang. [doi]
- Advancing Music Emotion Recognition: A Transformer Encoder-Based ApproachYangyuan Chen, Zhizhong Ma, Mingjing Wang, Mingzhe Liu. [doi]
- Adaptive Both homo- and hetero-Feature Integration for Multimodal Emotion RecognitionZe Kun Wang, Zhan-jun Si. [doi]
- OpenVideoWalls: an Open-Source System for Building Video Walls with Recycling Heterogeneous DisplaysZichen Zhu, Zhongze Tang, Amir Nassereldine, Jinjun Xiong, Sheng Wei 0001. [doi]
- Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the VideoTomoya Sugihara, Shuntaro Masuda, Ling Xiao 0001, Toshihiko Yamasaki. [doi]
- A Unified Editing Method for Co-Speech Gesture Generation via Diffusion InversionZeyu Zhao, Nan Gao, Zhi Zeng, Guixuan Zhang, Jie Liu, Shuwu Zhang. [doi]
- Point-Supervised Temporal Action Detection with Label Supplementation Based on TransformerCui Xu, Laiyun Qing. [doi]
- Accelerating Inference of Networks in the Frequency DomainChenqiu Zhao, Guanfang Dong, Anup Basu. [doi]
- Efficient Low-Dimensional Representation Via Manifold Learning-Based Model for Multimodal Sentiment AnalysisXingang Wang, Mengyi Wang, Hai Cui, Yijia Zhang 0001. [doi]
- StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-SpeechHaowei Lou, Hye-Young Paik, Wen Hu 0001, Lina Yao 0001. [doi]
- Fine-grained Video Semantic Distillation for Video-Text RetrievalZuyi Pei, Baoli Sun, Zhihui Wang 0001, Haojie Li. [doi]
- Emotion-Aware and Efficient Meme Sticker Dialogue GenerationZhaojun Guo, Junqiang Huang, Guobiao Li, Wanli Peng, Xinpeng Zhang 0001, Zhenxing Qian, Sheng Li 0006. [doi]
- Underwater Image Enhancement via Domain Adaptive Transfer Learning and Hybrid Reinforcement ModelTingting Yao, Yuan Gao, Zihao Feng, Qing Hu 0001, Zhiyong Wang 0001. [doi]
- Improving Sequential DeepFake Detection with Local information enhancementLongyun Dong, Yuanrong Xu, Jianping Zhong, Zhaobo Qi, Weigang Zhang. [doi]
- A Unified Contrastive Framework with Multi-Granularity Fusion for Text-to-Image GenerationYachao He, Li Liu, Huaxiang Zhang, Dongmei Liu, Hongzhen Li. [doi]
- Bivariate Mixup for 2D Contact Point Localization with Piezoelectric Microphone ArrayShogo Yonezawa, Yukinobu Taniguchi, Go Irie. [doi]
- S2FB IoU: Improving Boundary-based Object-Centric Image Segmentation Quality EvaluationRim El Filali, Soufiane Jdaba, Ronghui Xie, Ran Shi, Tong Qiao, Pan Qiaodong, Ting Wu 0001. [doi]
- KBY-Net: A Dual Learning Framework for Improving Object Detection in Rainy Weather ConditionsZheng-Xian Keh, Lai-Kuan Wong, Yuen Peng Loh, Ke Gu 0001, Weisi Lin. [doi]
- Joint Frame-Level and Block-Level Rate-Perception Optimized Preprocessing for Video CodingHuajie Tan, Guoqing Xiang, Xiaodong Xie, Huizhu Jia. [doi]
- Following in the Footsteps: Predicting Human Trajectories Using Motion Pattern MemoryYuxin Yang, Pengfei Zhu, Mengshi Qi, Huadong Ma. [doi]
- Focal Diffusion Process for Object-Aware 3D LiDAR GenerationHuijie Zhang, Xiaobai Liu. [doi]
- LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze DatasetRuikun Zhang, Hao Yang, Yan Yang, Ying Fu, Liyuan Pan. [doi]
- GGAvatar: Reconstructing Garment-Separated 3D Gaussian Splatting Avatars from Monocular VideoJingxuan Chen. [doi]
- CSCCap: Plugging Sparse Coding in Zero-Shot Image CaptioningYu Song, Xiaohui Yang, Rongping Huang, Haifeng Bai, Lili Yang. [doi]
- Incremental Few-Shot Object Detection by Leveraging External Information from Large Multimodal ModelsGuan Yu Wu, Wei-Ta Chu. [doi]
- An Information Cascade Prediction Algorithm Based on Time SeriesDongming Chen, Mingshuo Nie, Zhengping Sun, Huilin Chen, Dongqi Wang. [doi]
- A Multi-angle Text Recognition AlgorithmJie Wang, Huilin Chen, Wandong Xue, Dongming Chen, Dongqi Wang. [doi]
- TMM-CLIP: Task-guided Multi-Modal Alignment for Rehearsal-Free Class Incremental LearningYuankang Pan, Zhaoquan Yuan, Xiao Wu 0001, Zechao Li, Changsheng Xu. [doi]
- Policy-driven Auto-Augmentation with Distillment Rewards for Scene Text RecognitionPu Li, Yibiao Zhao, Xiaobai Liu. [doi]
- Variational Stochastic Multiple Auto-Encoder For Multimodal RecommendationYing Qiao, Aoxuan Chen, Xiang Li, Jinfei Gao. [doi]
- MSTMENet: Multi-Scale Spatio-Temporal Mapping and Evolution Network for Video DerainingFengqi Li, Mengchao Guo, Renxuan Xiong, Donglei Yang, Yi Wang 0037, Fengqiang Xu. [doi]
- CoolColor: Text-guided COherent OLd film COLORizationZichuan Huang, Yifan Li, Shuai Yang 0001, Jiaying Liu 0001. [doi]
- Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in MangaTakara Taniguchi, Ryosuke Furuta. [doi]
- STODINE: Decompose video to Object-centric Spatial-Temporal Slots for physical reasoningHaoyuan Zhang, Xiangyu Zhu 0001, Qu Tang, Zhaoxiang Zhang 0001, Zhen Lei 0001. [doi]
- Enhancing Modality Representation and Alignment for Multimodal Cold-start Active LearningMeng Shen 0002, Yake Wei, Jianxiong (Terry) Yin, Deepu Rajan, Di Hu 0001, Simon See. [doi]
- The Quantification of Emotional Expressions and Perceptions of Vocal Vibrato in Basic Emotion: Commercial Operatic Singing RecordingsJieying Liu. [doi]
- Fibre Population-guided Pre-training for 3D Spatial Super-Resolution on Multimodal Brain Diffusion MR ImagingZihao Tang, Xinyi Wang, Mariano Cabezas, Arkiev D'Souza, Michael Barnett 0006, Fernando Calamante, Weidong Cai 0001, Chenyu Wang 0001. [doi]
- ADP3D: Adaptive Point Selection for Efficient Multi-frame 3D Object DetectionGuohuan Gao, Gang Zhang, Xiangyang Xu. [doi]
- PCMark-NAS: Lightweight Print-Camera Resilient Watermarking Networks via Neural Architecture SearchDaidou Guo, Chuan Qin 0001. [doi]
- Robust discriminative and modal-consistent feature learning for fine-grained sketch-based image retrievalJunchao Ge, Huafeng Li, Yafei Zhang. [doi]
- Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion RecognitionChao Tan, Sheng Li, Yang Cao, Zhao Ren, Tanja Schultz. [doi]
- Multi-Frame Sparse Convolutional Learning for Point Cloud Color DenoisingTailin Yang, Wei Wu, Zhu Li, Rui Zhou. [doi]
- IdentityKD: Identity-wise Cross-modal Knowledge Distillation for Person Recognition via mmWave Radar SensorsLiqun Shan, Rujun Zhang, Sai Venkatesh Chilukoti, Xingli Zhang 0004, Insup Lee 0001, Xiali Hei 0001. [doi]
- Unified Multi-view Clustering based on Joint Multi-Structure Representation LearningSong Huang, Ziming Zeng, Min Li, Jianping Wang. [doi]
- Repetitive Action Counting with Feature Interaction Enhancement and Adaptive Gate FusionJiazhen Zhang, Kun Li, Yanyan Wei, Fei Wang, Wei Qian, Jinxing Zhou, Dan Guo. [doi]
- An Efficient Multi-prior Hybrid Approach for Consistent 3D Generation from Single ImagesYichen Ouyang, Jiayi Ye, Wenhao Chai, Dapeng Tao, Yibing Zhan, Gaoang Wang. [doi]
- MBC-ATA: Maximum Binary Classification and Anchor-based Triplet Augmentation for Unbiased Scene Graph GenerationHao Zhang, Xingning Dong, Jinfei Gao, Liang Hao, Pei Shen, Tian Gan. [doi]
- FATO: Frequency Attention Transformer for Omnidirectional Image Super-ResolutionHongyu An, Xinfeng Zhang 0001, Shijie Zhao 0001, Li Zhang 0006. [doi]
- Multi-stage Image Deraining based on Pre-trained Diffusion ModelXiong Zeng, Min Jiang, Ronghua Huang. [doi]
- MAFS: Modality-Aware Federated Semi-Supervised Learning with Selective Data Sharing Specified by Individual ClientsYi-Chen Li 0005, Chih-Fan Hsu, Jian-Kai Wang, Chung-Chi Tsai, Cheng-Hsin Hsu. [doi]
- HFS-HNeRV: High-Frequency Spectrum Hybrid Neural Representation for VideosJianhua Zhao, Xue Jun Li, Peter Han Joo Chong. [doi]
- FreqFormer: A Frequency Transformer for Semantic Segmentation of Remote Sensing ImagesXin Li 0090, Feng Xu 0008, Yao Tong, Fan Liu 0003, Yiwei Fang, Xin Lyu 0001, Jun Zhou 0001. [doi]
- MoE-Polyp: Shifting More Attention to Small Polyp Segmentation via Mixture-of-ExpertsZihuang Wu, Xinyu Xiong, Ying Chen, Siying Li, Hua Chen. [doi]