column
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey
Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, Yaowei Wang, Yonghong Tian, Wen Gao
2023, 20(4): 447-482. DOI: 10.1007/s11633-022-1410-8
Abstract PDF SpringerLink Bib
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images
Wei-Chien Wang, Euijoon Ahn, Dagan Feng, Jinman Kim
2023, 20(4): 483-513. DOI: 10.1007/s11633-022-1406-4
Abstract PDF SpringerLink Bib
Transformer: A General Framework from Machine Translation to Others
Yang Zhao, Jiajun Zhang, Chengqing Zong
2023, 20(4): 514-538. DOI: 10.1007/s11633-022-1393-5
Abstract PDF SpringerLink Bib
Federated Learning on Multimodal Data: A Comprehensive Survey
Yi-Ming Lin, Yuan Gao, Mao-Guo Gong, Si-Jia Zhang, Yuan-Qiao Zhang, Zhi-Yuan Li
2023, 20(4): 539-553. DOI: 10.1007/s11633-022-1398-0
Abstract PDF SpringerLink Bib
Region-adaptive Concept Aggregation for Few-shot Visual Recognition
Mengya Han, Yibing Zhan, Baosheng Yu, Yong Luo, Han Hu, Bo Du, Yonggang Wen, Dacheng Tao
2023, 20(4): 554-568. DOI: 10.1007/s11633-022-1358-8
Abstract PDF SpringerLink Bib
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval
Haoyu Lu, Yuqi Huo, Mingyu Ding, Nanyi Fei, Zhiwu Lu
2023, 20(4): 569-582. DOI: 10.1007/s11633-022-1386-4
Abstract PDF SpringerLink Bib
A New Diagnosis Method with Few-shot Learning Based on a Class-rebalance Strategy for Scarce Faults in Industrial Processes
Xinyao Xu, De Xu, Fangbo Qin
2023, 20(4): 583-594. DOI: 10.1007/s11633-022-1363-y
Abstract PDF SpringerLink Bib
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions
Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao
2023, 20(4): 595-604. DOI: 10.1007/s11633-022-1356-x
Abstract PDF SpringerLink Bib