A framework for real-time traffic risk prediction incorporating cost-sensitive learning and dynamic thresholds

Dan Wu, Lu Xing*, Ye Li, Yiik Diew Wong, Jaeyoung Jay Lee, Changyin Dong

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In recent years, researchers have explored an innovative approach that leverages real vehicle trajectory data to simultaneously derive traffic state and risk level for real-time risk prediction, which is crucial for traffic safety. However, existing studies largely overlook the costs associated with incorrect predictions and the varying consequences of different misclassifications, which undermines the reliability of the obtained prediction results. To address these gaps, this study refined traffic risk classification into four levels (i.e., no, low, medium, and high risks) and incorporated misclassification costs into the prediction process through cost-sensitive learning (CSL). Furthermore, considering that multi-class prediction tasks often face performance degradation and increased risk level granularity worsens class imbalance, further amplifying this degradation, this study introduced dynamic thresholds (DTs) to improve model performance. The aforementioned cost coefficients and thresholds were pinpointed using a genetic algorithm (GA). Furthermore, the employed data, comprising variables related to traffic state and associated risk data, were sourced from the HighD dataset. Subsequently, CSL-DTs-based models were built by integrating CSL and DTs with four distinct baseline machine/deep learning models, and the prediction performance (e.g., precision) and computation time of these models were compared. Results show that, compared to the corresponding baseline models, the proposed models perform better for multi-class prediction tasks. Additionally, the computation time of the CSL-DTs-based models is found to be acceptable for real-time prediction purposes. Finally, to ensure the reliability of the results obtained through the GA optimization (e.g., avoiding local optima), convergence curves were plotted, confirming the robustness of the optimization process. A robustness analysis also demonstrates that the models are highly stable under slight perturbations of cost coefficients and thresholds, with minimal impact on performance. Findings of this study are expected to enhance the reliability of real-time traffic risk prediction, holding the promise of significantly promoting proactive traffic safety management.

Original languageEnglish
Article number108087
JournalAccident Analysis and Prevention
Volume218
DOIs
Publication statusPublished - Aug 2025
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2025 Elsevier Ltd

ASJC Scopus Subject Areas

  • Human Factors and Ergonomics
  • Safety, Risk, Reliability and Quality
  • Public Health, Environmental and Occupational Health
  • Law

Keywords

  • Cost-sensitive learning
  • Dynamic thresholds
  • Genetic algorithm
  • Risk prediction
  • Traffic safety

Fingerprint

Dive into the research topics of 'A framework for real-time traffic risk prediction incorporating cost-sensitive learning and dynamic thresholds'. Together they form a unique fingerprint.

Cite this