column
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey
Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, Yaowei Wang, Yonghong Tian, Wen Gao
2023, 20(4): 447-482. DOI: 10.1007/s11633-022-1410-8
Abstract PDF Cited By SpringerLink Bib
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images
Wei-Chien Wang, Euijoon Ahn, Dagan Feng, Jinman Kim
2023, 20(4): 483-513. DOI: 10.1007/s11633-022-1406-4
Abstract PDF Cited By SpringerLink Bib
Transformer: A General Framework from Machine Translation to Others
Yang Zhao, Jiajun Zhang, Chengqing Zong
2023, 20(4): 514-538. DOI: 10.1007/s11633-022-1393-5
Abstract PDF Cited By SpringerLink Bib
Federated Learning on Multimodal Data: A Comprehensive Survey
Yi-Ming Lin, Yuan Gao, Mao-Guo Gong, Si-Jia Zhang, Yuan-Qiao Zhang, Zhi-Yuan Li
2023, 20(4): 539-553. DOI: 10.1007/s11633-022-1398-0
Abstract PDF Cited By SpringerLink Bib
Region-adaptive Concept Aggregation for Few-shot Visual Recognition
Mengya Han, Yibing Zhan, Baosheng Yu, Yong Luo, Han Hu, Bo Du, Yonggang Wen, Dacheng Tao
2023, 20(4): 554-568. DOI: 10.1007/s11633-022-1358-8
Abstract PDF Cited By SpringerLink Bib
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval
Haoyu Lu, Yuqi Huo, Mingyu Ding, Nanyi Fei, Zhiwu Lu
2023, 20(4): 569-582. DOI: 10.1007/s11633-022-1386-4
Abstract PDF Cited By SpringerLink Bib
A New Diagnosis Method with Few-shot Learning Based on a Class-rebalance Strategy for Scarce Faults in Industrial Processes
Xinyao Xu, De Xu, Fangbo Qin
2023, 20(4): 583-594. DOI: 10.1007/s11633-022-1363-y
Abstract PDF Cited By SpringerLink Bib
Speech Emotion Recognition Using Cascaded Attention Network with Joint Loss for Discrimination of Confusions
Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao
2023, 20(4): 595-604. DOI: 10.1007/s11633-022-1356-x
Abstract PDF Cited By SpringerLink Bib
    Global Visitors