TransCloudSeg : ground-based cloud image segmentation with transformer

Liu, Shuang and Zhang, Jiafeng and Zhang, Zhong and Cao, Xiaozhong and Durrani, Tariq S. (2022) TransCloudSeg : ground-based cloud image segmentation with transformer. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 15. pp. 6121-6132. ISSN 1939-1404 (

[thumbnail of Liu-etal-IEEE-JSTAEORS-2022-TransCloudSeg-ground-based-cloud-image-segmentation]
Text. Filename: Liu_etal_IEEE_JSTAEORS_2022_TransCloudSeg_ground_based_cloud_image_segmentation.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (3MB)| Preview


Cloud image segmentation plays an important role in ground-based cloud observation. Recently, most existing methods for ground-based cloud image segmentation learn feature representations using the convolutional neural network (CNN), which results in the loss of global information because of the limited receptive field size of the filters in the CNN. In this article, we propose a novel deep model named TransCloudSeg, which makes full use of the advantages of the CNN and transformer to extract detailed information and global contextual information for ground-based cloud image segmentation. Specifically, TransCloudSeg hybridizes the CNN and transformer as the encoders to obtain different features. To recover and fuse the feature maps from the encoders, we design the CNN decoder and the transformer decoder for TransCloudSeg. After obtaining two sets of feature maps from two different decoders, we propose the heterogeneous fusion module to effectively fuse the heterogeneous feature maps by applying the self-attention mechanism. We conduct a series of experiments on Tianjin Normal University large-scale cloud detection database and Tianjin Normal University cloud detection database, and the results show that our method achieves a better performance than other state-of-the-art methods, thus proving the effectiveness of the proposed TransCloudSeg.