Unsupervised anomaly segmentation model for rail damage based on image-inpainting and cold diffusion

Chengjia Han; Yiqing Dong; Maggie Y. Gao; Liwei Dong; Yaowen Yang

doi:10.1016/j.autcon.2025.106342

Unsupervised anomaly segmentation model for rail damage based on image-inpainting and cold diffusion

Chengjia Han, Yiqing Dong, Maggie Y. Gao, Liwei Dong^*, Yaowen Yang

^*Corresponding author for this work

Nanyang Technological University

Research output: Contribution to journal › Article › peer-review

Abstract

Ensuring structural health of rail tracks is critical for safe train operations. While deep learning-based vision models are widely used for rail damage detection, supervised methods suffer from limited generalization due to scarce and diverse annotated data. Unsupervised models often experience missed detections and false positives when handling complex and variable rail background textures, as well as rail damage with significant intra-class variability. To address these limitations, this paper proposes an unsupervised pixel-level rail damage segmentation model based on a cold diffusion framework, called InpRailDiffusion. It introduces inpainting-based noise and uses a Mamba-enhanced, time-conditioned U-Net for progressive noise removal. Damage segmentation is achieved by analyzing pixel-wise differences between generated and original images with adaptive thresholding. A multi-scale masking strategy fuses reconstruction features at various spatial resolutions, reducing false positives and missed detections. Evaluated on RSDDs-I and RSDDs-II, InpRailDiffusion outperformed state-of-the-art baselines with MIoU/F1-Scores of 0.864/0.844 and 0.845/0.814, respectively.

Original language	English
Article number	106342
Journal	Automation in Construction
Volume	177
DOIs	https://doi.org/10.1016/j.autcon.2025.106342
Publication status	Published - Sept 2025
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2024

ASJC Scopus Subject Areas

Control and Systems Engineering
Civil and Structural Engineering
Building and Construction

Keywords

Diffusion model
Rail defects
Railway engineering
Semantic segmentation
Structural health monitoring
Unsupervised anomaly detection

Access to Document

10.1016/j.autcon.2025.106342

Cite this

@article{632b246905c3409d84a56f8ee6abfeaf,

title = "Unsupervised anomaly segmentation model for rail damage based on image-inpainting and cold diffusion",

abstract = "Ensuring structural health of rail tracks is critical for safe train operations. While deep learning-based vision models are widely used for rail damage detection, supervised methods suffer from limited generalization due to scarce and diverse annotated data. Unsupervised models often experience missed detections and false positives when handling complex and variable rail background textures, as well as rail damage with significant intra-class variability. To address these limitations, this paper proposes an unsupervised pixel-level rail damage segmentation model based on a cold diffusion framework, called InpRailDiffusion. It introduces inpainting-based noise and uses a Mamba-enhanced, time-conditioned U-Net for progressive noise removal. Damage segmentation is achieved by analyzing pixel-wise differences between generated and original images with adaptive thresholding. A multi-scale masking strategy fuses reconstruction features at various spatial resolutions, reducing false positives and missed detections. Evaluated on RSDDs-I and RSDDs-II, InpRailDiffusion outperformed state-of-the-art baselines with MIoU/F1-Scores of 0.864/0.844 and 0.845/0.814, respectively.",

keywords = "Diffusion model, Rail defects, Railway engineering, Semantic segmentation, Structural health monitoring, Unsupervised anomaly detection",

author = "Chengjia Han and Yiqing Dong and Gao, \{Maggie Y.\} and Liwei Dong and Yaowen Yang",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2025",

month = sep,

doi = "10.1016/j.autcon.2025.106342",

language = "English",

volume = "177",

journal = "Automation in Construction",

issn = "0926-5805",

publisher = "Elsevier",

}

TY - JOUR

T1 - Unsupervised anomaly segmentation model for rail damage based on image-inpainting and cold diffusion

AU - Han, Chengjia

AU - Dong, Yiqing

AU - Gao, Maggie Y.

AU - Dong, Liwei

AU - Yang, Yaowen

PY - 2025/9

Y1 - 2025/9

N2 - Ensuring structural health of rail tracks is critical for safe train operations. While deep learning-based vision models are widely used for rail damage detection, supervised methods suffer from limited generalization due to scarce and diverse annotated data. Unsupervised models often experience missed detections and false positives when handling complex and variable rail background textures, as well as rail damage with significant intra-class variability. To address these limitations, this paper proposes an unsupervised pixel-level rail damage segmentation model based on a cold diffusion framework, called InpRailDiffusion. It introduces inpainting-based noise and uses a Mamba-enhanced, time-conditioned U-Net for progressive noise removal. Damage segmentation is achieved by analyzing pixel-wise differences between generated and original images with adaptive thresholding. A multi-scale masking strategy fuses reconstruction features at various spatial resolutions, reducing false positives and missed detections. Evaluated on RSDDs-I and RSDDs-II, InpRailDiffusion outperformed state-of-the-art baselines with MIoU/F1-Scores of 0.864/0.844 and 0.845/0.814, respectively.

AB - Ensuring structural health of rail tracks is critical for safe train operations. While deep learning-based vision models are widely used for rail damage detection, supervised methods suffer from limited generalization due to scarce and diverse annotated data. Unsupervised models often experience missed detections and false positives when handling complex and variable rail background textures, as well as rail damage with significant intra-class variability. To address these limitations, this paper proposes an unsupervised pixel-level rail damage segmentation model based on a cold diffusion framework, called InpRailDiffusion. It introduces inpainting-based noise and uses a Mamba-enhanced, time-conditioned U-Net for progressive noise removal. Damage segmentation is achieved by analyzing pixel-wise differences between generated and original images with adaptive thresholding. A multi-scale masking strategy fuses reconstruction features at various spatial resolutions, reducing false positives and missed detections. Evaluated on RSDDs-I and RSDDs-II, InpRailDiffusion outperformed state-of-the-art baselines with MIoU/F1-Scores of 0.864/0.844 and 0.845/0.814, respectively.

KW - Diffusion model

KW - Rail defects

KW - Railway engineering

KW - Semantic segmentation

KW - Structural health monitoring

KW - Unsupervised anomaly detection

UR - http://www.scopus.com/inward/record.url?scp=105007985142&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=105007985142&partnerID=8YFLogxK

U2 - 10.1016/j.autcon.2025.106342

DO - 10.1016/j.autcon.2025.106342

M3 - Article

AN - SCOPUS:105007985142

SN - 0926-5805

VL - 177

JO - Automation in Construction

JF - Automation in Construction

M1 - 106342

ER -

Unsupervised anomaly segmentation model for rail damage based on image-inpainting and cold diffusion

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this