Enhancing pixel-level crack segmentation with visual mamba and convolutional networks

Chengjia Han; Handuo Yang; Yaowen Yang

doi:10.1016/j.autcon.2024.105770

Enhancing pixel-level crack segmentation with visual mamba and convolutional networks

Chengjia Han, Handuo Yang^*, Yaowen Yang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

17 Citations (Scopus)

Abstract

Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.

Original language	English
Article number	105770
Journal	Automation in Construction
Volume	168
DOIs	https://doi.org/10.1016/j.autcon.2024.105770
Publication status	Published - Dec 1 2024
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2024 Elsevier B.V.

ASJC Scopus Subject Areas

Control and Systems Engineering
Civil and Structural Engineering
Building and Construction

Keywords

Artificial intelligence
Convolutional neural networks
Crack detection
Semantic segmentation
Vision Mamba

Access to Document

10.1016/j.autcon.2024.105770

Cite this

@article{7dfa852e5ee24d1eb76d787d69cc8813,

title = "Enhancing pixel-level crack segmentation with visual mamba and convolutional networks",

abstract = "Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.",

keywords = "Artificial intelligence, Convolutional neural networks, Crack detection, Semantic segmentation, Vision Mamba",

author = "Chengjia Han and Handuo Yang and Yaowen Yang",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2024",

month = dec,

day = "1",

doi = "10.1016/j.autcon.2024.105770",

language = "English",

volume = "168",

journal = "Automation in Construction",

issn = "0926-5805",

publisher = "Elsevier",

}

TY - JOUR

T1 - Enhancing pixel-level crack segmentation with visual mamba and convolutional networks

AU - Han, Chengjia

AU - Yang, Handuo

AU - Yang, Yaowen

PY - 2024/12/1

Y1 - 2024/12/1

N2 - Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.

AB - Computer vision-based semantic segmentation methods are currently the most widely used for automated detection of structural cracks in buildings and pavements. However, these methods face persistent challenges in detecting fine cracks with small widths and in distinguishing cracks from background stains. This paper addresses these issues by introducing MambaCrackNet, a new network architecture for pixel-level crack segmentation. MambaCrackNet incorporates residual visual Mamba blocks and integrates visual Mamba and convolutional neural network-based segmentation techniques. This approach effectively enhances the detection of fine cracks, reduces misdetections of background stains, and remains robust to variations in patch size and training sample sizes, making it highly practical for engineering applications. On two open access crack datasets, MambaCrackNet outperformed mainstream crack segmentation models, achieving MIoU scores of 0.8939 and 0.8560 and F1-scores of 0.8817 and 0.8412.

KW - Artificial intelligence

KW - Convolutional neural networks

KW - Crack detection

KW - Semantic segmentation

KW - Vision Mamba

UR - http://www.scopus.com/inward/record.url?scp=85203428114&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85203428114&partnerID=8YFLogxK

U2 - 10.1016/j.autcon.2024.105770

DO - 10.1016/j.autcon.2024.105770

M3 - Article

AN - SCOPUS:85203428114

SN - 0926-5805

VL - 168

JO - Automation in Construction

JF - Automation in Construction

M1 - 105770

ER -

Enhancing pixel-level crack segmentation with visual mamba and convolutional networks

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this