Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

Publikation: Working paperPreprintForskning

Standard

Mamba-UNet : UNet-Like Pure Visual Mamba for Medical Image Segmentation. / Wang, Ziyang; Zheng, Jian-Qing; Zhang, Yichi; Cui, Ge; Li, Lei.

2024.

Publikation: Working paperPreprintForskning

Harvard

Wang, Z, Zheng, J-Q, Zhang, Y, Cui, G & Li, L 2024 'Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation'.

APA

Wang, Z., Zheng, J-Q., Zhang, Y., Cui, G., & Li, L. (2024). Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation.

Vancouver

Wang Z, Zheng J-Q, Zhang Y, Cui G, Li L. Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation. 2024 feb. 7.

Author

Wang, Ziyang ; Zheng, Jian-Qing ; Zhang, Yichi ; Cui, Ge ; Li, Lei. / Mamba-UNet : UNet-Like Pure Visual Mamba for Medical Image Segmentation. 2024.

Bibtex

@techreport{f11ef4b0da244b639c5992df0daec6ed,
title = "Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation",
abstract = " In recent advancements in medical image analysis, Convolutional Neural Networks (CNN) and Vision Transformers (ViT) have set significant benchmarks. While the former excels in capturing local features through its convolution operations, the latter achieves remarkable global context understanding by leveraging self-attention mechanisms. However, both architectures exhibit limitations in efficiently modeling long-range dependencies within medical images, which is a critical aspect for precise segmentation. Inspired by the Mamba architecture, known for its proficiency in handling long sequences and global contextual information with enhanced computational efficiency as a State Space Model (SSM), we propose Mamba-UNet, a novel architecture that synergizes the U-Net in medical image segmentation with Mamba's capability. Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network. This design facilitates a comprehensive feature learning process, capturing intricate details and broader semantic contexts within medical images. We introduce a novel integration mechanism within the VMamba blocks to ensure seamless connectivity and information flow between the encoder and decoder paths, enhancing the segmentation performance. We conducted experiments on publicly available ACDC MRI Cardiac segmentation dataset, and Synapse CT Abdomen segmentation dataset. The results show that Mamba-UNet outperforms several types of UNet in medical image segmentation under the same hyper-parameter setting. The source code and baseline implementations are available. ",
keywords = "eess.IV, cs.CV",
author = "Ziyang Wang and Jian-Qing Zheng and Yichi Zhang and Ge Cui and Lei Li",
year = "2024",
month = feb,
day = "7",
language = "Udefineret/Ukendt",
type = "WorkingPaper",

}

RIS

TY - UNPB

T1 - Mamba-UNet

T2 - UNet-Like Pure Visual Mamba for Medical Image Segmentation

AU - Wang, Ziyang

AU - Zheng, Jian-Qing

AU - Zhang, Yichi

AU - Cui, Ge

AU - Li, Lei

PY - 2024/2/7

Y1 - 2024/2/7

N2 - In recent advancements in medical image analysis, Convolutional Neural Networks (CNN) and Vision Transformers (ViT) have set significant benchmarks. While the former excels in capturing local features through its convolution operations, the latter achieves remarkable global context understanding by leveraging self-attention mechanisms. However, both architectures exhibit limitations in efficiently modeling long-range dependencies within medical images, which is a critical aspect for precise segmentation. Inspired by the Mamba architecture, known for its proficiency in handling long sequences and global contextual information with enhanced computational efficiency as a State Space Model (SSM), we propose Mamba-UNet, a novel architecture that synergizes the U-Net in medical image segmentation with Mamba's capability. Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network. This design facilitates a comprehensive feature learning process, capturing intricate details and broader semantic contexts within medical images. We introduce a novel integration mechanism within the VMamba blocks to ensure seamless connectivity and information flow between the encoder and decoder paths, enhancing the segmentation performance. We conducted experiments on publicly available ACDC MRI Cardiac segmentation dataset, and Synapse CT Abdomen segmentation dataset. The results show that Mamba-UNet outperforms several types of UNet in medical image segmentation under the same hyper-parameter setting. The source code and baseline implementations are available.

AB - In recent advancements in medical image analysis, Convolutional Neural Networks (CNN) and Vision Transformers (ViT) have set significant benchmarks. While the former excels in capturing local features through its convolution operations, the latter achieves remarkable global context understanding by leveraging self-attention mechanisms. However, both architectures exhibit limitations in efficiently modeling long-range dependencies within medical images, which is a critical aspect for precise segmentation. Inspired by the Mamba architecture, known for its proficiency in handling long sequences and global contextual information with enhanced computational efficiency as a State Space Model (SSM), we propose Mamba-UNet, a novel architecture that synergizes the U-Net in medical image segmentation with Mamba's capability. Mamba-UNet adopts a pure Visual Mamba (VMamba)-based encoder-decoder structure, infused with skip connections to preserve spatial information across different scales of the network. This design facilitates a comprehensive feature learning process, capturing intricate details and broader semantic contexts within medical images. We introduce a novel integration mechanism within the VMamba blocks to ensure seamless connectivity and information flow between the encoder and decoder paths, enhancing the segmentation performance. We conducted experiments on publicly available ACDC MRI Cardiac segmentation dataset, and Synapse CT Abdomen segmentation dataset. The results show that Mamba-UNet outperforms several types of UNet in medical image segmentation under the same hyper-parameter setting. The source code and baseline implementations are available.

KW - eess.IV

KW - cs.CV

M3 - Preprint

BT - Mamba-UNet

ER -

ID: 395360776