Predicting juvenile offending: A comparison of data mining methods

Rebecca P. Ang*, Dion H. Goh

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

In this study, the authors compared logistic regression and predictive data mining techniques such as decision trees (DTs), artificial neural networks (ANNs), and support vector machines (SVMs), and examined these methods on whether they could discriminate between adolescents who were charged or not charged for initial juvenile offending in a large Asian sample. Results were validated and tested in independent samples with logistic regression and DT, ANN, and SVM classifiers achieving accuracy rates of 95% and above. Findings from receiver operating characteristic analyses also supported these results. In addition, the authors examined distinct patterns of occurrences within and across classifiers. Proactive aggression and teacher-rated conflict consistently emerged as risk factors across validation and testing data sets of DT and ANN classifiers, and logistic regression. Reactive aggression, narcissistic exploitativeness, being male, and coming from a nonintact family were risk factors that emerged in one or more of these data sets across classifiers, while anxiety and poor peer relationships failed to emerge as predictors.

Original languageEnglish
Pages (from-to)191-207
Number of pages17
JournalInternational Journal of Offender Therapy and Comparative Criminology
Volume57
Issue number2
DOIs
Publication statusPublished - Feb 2013
Externally publishedYes

ASJC Scopus Subject Areas

  • Pathology and Forensic Medicine
  • Arts and Humanities (miscellaneous)
  • Applied Psychology

Keywords

  • juvenile offending
  • predictive data mining

Fingerprint

Dive into the research topics of 'Predicting juvenile offending: A comparison of data mining methods'. Together they form a unique fingerprint.

Cite this