447 | -- | 482 | Xiao Wang 0014, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, Yaowei Wang, Yonghong Tian 0001, Wen Gao 0001. Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey |
483 | -- | 513 | Wei-Chien Wang, Euijoon Ahn, David Feng 0003, Jinman Kim. A Review of Predictive and Contrastive Self-supervised Learning for Medical Images |
514 | -- | 538 | Yang Zhao, Jiajun Zhang 0001, Chengqing Zong. Transformer: A General Framework from Machine Translation to Others |
539 | -- | 553 | Yi-Ming Lin, Yuan Gao, Maoguo Gong, Sijia Zhang, Yuan-Qiao Zhang, Zhi-yuan Li. Federated Learning on Multimodal Data: A Comprehensive Survey |
554 | -- | 568 | Mengya Han, Yibing Zhan, Baosheng Yu, Yong Luo 0002, Han Hu 0003, Bo Du 0001, Yonggang Wen 0001, Dacheng Tao. Region-adaptive Concept Aggregation for Few-shot Visual Recognition |
569 | -- | 582 | Haoyu Lu, Yuqi Huo, Mingyu Ding, Nanyi Fei, Zhiwu Lu 0001. Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval |
583 | -- | 594 | Xinyao Xu, De Xu, Fangbo Qin. A New Diagnosis Method with Few-shot Learning Based on a Class-rebalance Strategy for Scarce Faults in Industrial Processes |
595 | -- | 604 | Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao 0006. Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions |