Journal of Geodesy and Geoinformation Science ›› 2023, Vol. 6 ›› Issue (4): 40-47.doi: 10.11947/j.JGGS.2023.0404

Previous Articles     Next Articles

CFM-UNet: A Joint CNN and Transformer Network via Cross Feature Modulation for Remote Sensing Images Segmentation

Min WANG1(), Peidong WANG2()   

  1. 1. School of Urban Construction Engineering, Guangzhou City Polytechnic, Guangzhou 510800, China
    2. Guangdong Mechanical & Electrical Polytechnic, Guangzhou 510515, China
  • Received:2023-09-18 Accepted:2023-11-23 Online:2023-12-20 Published:2024-02-06
  • Contact: Peidong WANG E-mail:wangmin@gcp.edu.cn;wpd_jxnc@163.com
  • Supported by:
    Young Innovative Talents Project of Guangdong Ordinary Universities(2022KQNCX225);School-level Teaching and Research Project of Guangzhou City Polytechnic(2022xky046)

Abstract:

The semantic segmentation methods based on CNN have made great progress, but there are still some shortcomings in the application of remote sensing images segmentation, such as the small receptive field can not effectively capture global context. In order to solve this problem, this paper proposes a hybrid model based on ResNet50 and swin transformer to directly capture long-range dependence, which fuses features through Cross Feature Modulation Module(CFMM). Experimental results on two publicly available datasets, Vaihingen and Potsdam, are mIoU of 70.27% and 76.63%, respectively. Thus, CFM-UNet can maintain a high segmentation performance compared with other competitive networks.

Key words: remote sensing images; semantic segmentation; swin transformer; feature modulation module