Unknown

Dataset Information

0

High-Resolution Swin Transformer for Automatic Medical Image Segmentation.


ABSTRACT: The resolution of feature maps is a critical factor for accurate medical image segmentation. Most of the existing Transformer-based networks for medical image segmentation adopt a U-Net-like architecture, which contains an encoder that converts the high-resolution input image into low-resolution feature maps using a sequence of Transformer blocks and a decoder that gradually generates high-resolution representations from low-resolution feature maps. However, the procedure of recovering high-resolution representations from low-resolution representations may harm the spatial precision of the generated segmentation masks. Unlike previous studies, in this study, we utilized the high-resolution network (HRNet) design style by replacing the convolutional layers with Transformer blocks, continuously exchanging feature map information with different resolutions generated by the Transformer blocks. The proposed Transformer-based network is named the high-resolution Swin Transformer network (HRSTNet). Extensive experiments demonstrated that the HRSTNet can achieve performance comparable with that of the state-of-the-art Transformer-based U-Net-like architecture on the 2021 Brain Tumor Segmentation dataset, the Medical Segmentation Decathlon's liver dataset, and the BTCV multi-organ segmentation dataset.

SUBMITTER: Wei C 

PROVIDER: S-EPMC10099222 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

High-Resolution Swin Transformer for Automatic Medical Image Segmentation.

Wei Chen C   Ren Shenghan S   Guo Kaitai K   Hu Haihong H   Liang Jimin J  

Sensors (Basel, Switzerland) 20230324 7


The resolution of feature maps is a critical factor for accurate medical image segmentation. Most of the existing Transformer-based networks for medical image segmentation adopt a U-Net-like architecture, which contains an encoder that converts the high-resolution input image into low-resolution feature maps using a sequence of Transformer blocks and a decoder that gradually generates high-resolution representations from low-resolution feature maps. However, the procedure of recovering high-reso  ...[more]

Similar Datasets

| S-EPMC10661918 | biostudies-literature
| S-EPMC10909362 | biostudies-literature
| S-EPMC10495965 | biostudies-literature
| S-EPMC8501087 | biostudies-literature
| S-EPMC11651933 | biostudies-literature
| S-EPMC10773825 | biostudies-literature
| S-EPMC10703013 | biostudies-literature
| S-EPMC9221215 | biostudies-literature
| S-EPMC9575930 | biostudies-literature
| S-EPMC10994332 | biostudies-literature