Abstract
In prescriptive analytics, unknown quantities are involved in practical decision-making problems, and these unknown quantities need to be predicted using auxiliary data. A classic approach is to develop machine learning (ML) models to generate point estimates, which are then input to the decision making problem in a deterministic manner to prescribe the optimal decision. However, the limited quantity and inevitable errors in the auxiliary data lead to inaccurate predictions and thus sub-optimal decisions. One viable approach to addressing the above issue is to consider the uncertainties in data by inputting the conditional distributions of the unknown quantities on the auxiliary data to the optimization problem on hand, and the distributions are predicted by regression ML models. Meanwhile, it is observed that the quantitative target in some problems are discrete, and these properties are analogous to categorical targets in classification problems. Considering the fact that describing and estimating the distribution of categorical variables are much easier than quantitative variables, this study innovatively develops random forest (RF) models with regression and classification features to generate the distribution of quantitative targets that are discrete. Especially, nodes splitting criteria in the RF models is in a regression manner, while the outputs of individual decision trees and the whole RF model is in a classification manner. Numerical experiments using real port state control (PSC) inspection records and settings at the Hong Kong port are conducted to validate and compare the above prescriptive analytics approaches. The superiority of applying the newly proposed RF model into the development of prescriptive analytics approaches is also demonstrated.
Original language | English |
---|---|
Article number | 106517 |
Journal | Computers and Operations Research |
Volume | 163 |
DOIs | |
Publication status | Published - Mar 2024 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2023 Elsevier Ltd
ASJC Scopus Subject Areas
- General Computer Science
- Modelling and Simulation
- Management Science and Operations Research
Keywords
- Data-driven decision making
- Machine learning
- Ordinal classification
- Prescriptive analytics