Bagging for robust non-linear multivariate calibration of spectroscopy

Ke Wang; Tao Chen; Raymond Lau

doi:10.1016/j.chemolab.2010.10.004

Bagging for robust non-linear multivariate calibration of spectroscopy

Ke Wang, Tao Chen^*, Raymond Lau

^*Corresponding author for this work

Nanyang Technological University

Research output: Contribution to journal › Article › peer-review

36 Citations (Scopus)

Abstract

This paper presents the application of the bagging technique for non-linear regression models to obtain more accurate and robust calibration of spectroscopy. Bagging refers to the combination of multiple models obtained by bootstrap re-sampling with replacement into an ensemble model to reduce prediction errors. It is well suited to "non-robust" models, such as the non-linear calibration methods of artificial neural network (ANN) and Gaussian process regression (GPR), in which small changes in data or model parameters can result in significant change in model predictions. A specific variant of bagging, based on sub-sampling without replacement and named subagging, is also investigated, since it has been reported to possess similar prediction capability to bagging but requires less computation. However, this work shows that the calibration performance of subagging is sensitive to the amount of sub-sampled data, which needs to be determined by computationally intensive cross-validation. Therefore, we suggest that bagging is preferred to subagging in practice. Application study on two near infrared datasets demonstrates the effectiveness of the presented approach.

Original language	English
Pages (from-to)	1-6
Number of pages	6
Journal	Chemometrics and Intelligent Laboratory Systems
Volume	105
Issue number	1
DOIs	https://doi.org/10.1016/j.chemolab.2010.10.004
Publication status	Published - Jan 15 2011
Externally published	Yes

ASJC Scopus Subject Areas

Analytical Chemistry
Software
Computer Science Applications
Process Chemistry and Technology
Spectroscopy

Keywords

Bootstrap aggregating
Ensemble modelling
Near infrared spectroscopy
Non-linear calibration
Robust model

Access to Document

10.1016/j.chemolab.2010.10.004

Cite this

@article{ede70240b49644ef906bcf8089299760,

title = "Bagging for robust non-linear multivariate calibration of spectroscopy",

abstract = "This paper presents the application of the bagging technique for non-linear regression models to obtain more accurate and robust calibration of spectroscopy. Bagging refers to the combination of multiple models obtained by bootstrap re-sampling with replacement into an ensemble model to reduce prediction errors. It is well suited to {"}non-robust{"} models, such as the non-linear calibration methods of artificial neural network (ANN) and Gaussian process regression (GPR), in which small changes in data or model parameters can result in significant change in model predictions. A specific variant of bagging, based on sub-sampling without replacement and named subagging, is also investigated, since it has been reported to possess similar prediction capability to bagging but requires less computation. However, this work shows that the calibration performance of subagging is sensitive to the amount of sub-sampled data, which needs to be determined by computationally intensive cross-validation. Therefore, we suggest that bagging is preferred to subagging in practice. Application study on two near infrared datasets demonstrates the effectiveness of the presented approach.",

keywords = "Bootstrap aggregating, Ensemble modelling, Near infrared spectroscopy, Non-linear calibration, Robust model",

author = "Ke Wang and Tao Chen and Raymond Lau",

year = "2011",

month = jan,

day = "15",

doi = "10.1016/j.chemolab.2010.10.004",

language = "English",

volume = "105",

pages = "1--6",

journal = "Chemometrics and Intelligent Laboratory Systems",

issn = "0169-7439",

publisher = "Elsevier",

number = "1",

}

TY - JOUR

T1 - Bagging for robust non-linear multivariate calibration of spectroscopy

AU - Wang, Ke

AU - Chen, Tao

AU - Lau, Raymond

PY - 2011/1/15

Y1 - 2011/1/15

N2 - This paper presents the application of the bagging technique for non-linear regression models to obtain more accurate and robust calibration of spectroscopy. Bagging refers to the combination of multiple models obtained by bootstrap re-sampling with replacement into an ensemble model to reduce prediction errors. It is well suited to "non-robust" models, such as the non-linear calibration methods of artificial neural network (ANN) and Gaussian process regression (GPR), in which small changes in data or model parameters can result in significant change in model predictions. A specific variant of bagging, based on sub-sampling without replacement and named subagging, is also investigated, since it has been reported to possess similar prediction capability to bagging but requires less computation. However, this work shows that the calibration performance of subagging is sensitive to the amount of sub-sampled data, which needs to be determined by computationally intensive cross-validation. Therefore, we suggest that bagging is preferred to subagging in practice. Application study on two near infrared datasets demonstrates the effectiveness of the presented approach.

AB - This paper presents the application of the bagging technique for non-linear regression models to obtain more accurate and robust calibration of spectroscopy. Bagging refers to the combination of multiple models obtained by bootstrap re-sampling with replacement into an ensemble model to reduce prediction errors. It is well suited to "non-robust" models, such as the non-linear calibration methods of artificial neural network (ANN) and Gaussian process regression (GPR), in which small changes in data or model parameters can result in significant change in model predictions. A specific variant of bagging, based on sub-sampling without replacement and named subagging, is also investigated, since it has been reported to possess similar prediction capability to bagging but requires less computation. However, this work shows that the calibration performance of subagging is sensitive to the amount of sub-sampled data, which needs to be determined by computationally intensive cross-validation. Therefore, we suggest that bagging is preferred to subagging in practice. Application study on two near infrared datasets demonstrates the effectiveness of the presented approach.

KW - Bootstrap aggregating

KW - Ensemble modelling

KW - Near infrared spectroscopy

KW - Non-linear calibration

KW - Robust model

UR - http://www.scopus.com/inward/record.url?scp=78650948925&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650948925&partnerID=8YFLogxK

U2 - 10.1016/j.chemolab.2010.10.004

DO - 10.1016/j.chemolab.2010.10.004

M3 - Article

AN - SCOPUS:78650948925

SN - 0169-7439

VL - 105

SP - 1

EP - 6

JO - Chemometrics and Intelligent Laboratory Systems

JF - Chemometrics and Intelligent Laboratory Systems

IS - 1

ER -

Bagging for robust non-linear multivariate calibration of spectroscopy

Abstract

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this