Enhancing pixel-level crack segmentation with visual mamba and convolutional networks

Chengjia Han, Handuo Yang*, Yaowen Yang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.

Original languageEnglish
Article number105770
JournalAutomation in Construction
Volume168
DOIs
Publication statusPublished - Dec 1 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

ASJC Scopus Subject Areas

  • Control and Systems Engineering
  • Civil and Structural Engineering
  • Building and Construction

Keywords

  • Artificial intelligence
  • Convolutional neural networks
  • Crack detection
  • Semantic segmentation
  • Vision Mamba

Fingerprint

Dive into the research topics of 'Enhancing pixel-level crack segmentation with visual mamba and convolutional networks'. Together they form a unique fingerprint.

Cite this