Small Sample Image Segmentation By Coupling Convolutions and Transformers


The proposed image segmentation method which contains an ICCT-UNet and a Residual Fusion Module (RFM).

Abstract
Compared with natural image segmentation, small sample image segmentation tasks, such as medical image segmentation and defect detection, have been less studied. Recent studies made efforts on bringing together Convolutional Neural Networks (CNNs) and Transformers in a serial or interleaved architecture in order to incorporate long-range dependencies into the features extracted using CNNs. In this study, we argue that these architectures limit the capability of the combination of CNNs and Transformers. To this end, we propose a dual-stream small sample image segmentation network, namely, the Interactive Coupling of Convolutions and Transformers Based UNet (ICCT-UNet), motivated by the success achieved using the UNet in the scenario of small sample image segmentation. Within this network, a CNN stream is paralleled with a Transformer stream while maintaining feature exchange inside each block through the proposed Window-Based Multi-head Cross-Attention (W-MHCA) mechanism. To derive an overall segmentation, the features learned by both the streams are further fused using a Residual Fusion Module (RFM).
Experimental Results

Quantitative Results on Medical Image Segmentation datasets Synapse, ACDC.


Qualitative Results on Medical Image Segmentation and Defect Image Segmentation of (a) UNet, (b) TransUnet, (c) SwinUnet and ours (d) ICCTUNet-T, (e) ICCTUNet-C, (f) ICCTUNet-F and (g) Ground-Truth