Discriminative sparse neighbor approximation for imbalanced learning

Chen Huang; Chen Change Loy; Xiaoou Tang

doi:10.1109/TNNLS.2017.2671845

Discriminative sparse neighbor approximation for imbalanced learning

Chen Huang, Chen Change Loy, Xiaoou Tang

Research output: Contribution to journal › Article › peer-review

24 Citations (Scopus)

Abstract

Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue, conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that have a large degree of class overlap. In this paper, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then, we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small data set with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using hand-crafted features only.

Original language	English
Pages (from-to)	1503-1513
Number of pages	11
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	29
Issue number	5
DOIs	https://doi.org/10.1109/TNNLS.2017.2671845
Publication status	Published - May 2018
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2012 IEEE.

ASJC Scopus Subject Areas

Software
Computer Science Applications
Computer Networks and Communications
Artificial Intelligence

Keywords

Data extrapolation
Decision forest
Discriminative sparse neighbor approximation (DSNA)
Imbalanced learning

Access to Document

10.1109/TNNLS.2017.2671845

Cite this

@article{25b548a5d281465d81d218788e49db95,

title = "Discriminative sparse neighbor approximation for imbalanced learning",

abstract = "Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue, conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that have a large degree of class overlap. In this paper, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then, we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small data set with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using hand-crafted features only.",

keywords = "Data extrapolation, Decision forest, Discriminative sparse neighbor approximation (DSNA), Imbalanced learning",

author = "Chen Huang and Loy, \{Chen Change\} and Xiaoou Tang",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2018",

month = may,

doi = "10.1109/TNNLS.2017.2671845",

language = "English",

volume = "29",

pages = "1503--1513",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "5",

}

TY - JOUR

T1 - Discriminative sparse neighbor approximation for imbalanced learning

AU - Huang, Chen

AU - Loy, Chen Change

AU - Tang, Xiaoou

PY - 2018/5

Y1 - 2018/5

N2 - Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue, conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that have a large degree of class overlap. In this paper, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then, we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small data set with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using hand-crafted features only.

AB - Data imbalance is common in many vision tasks where one or more classes are rare. Without addressing this issue, conventional methods tend to be biased toward the majority class with poor predictive accuracy for the minority class. These methods further deteriorate on small, imbalanced data that have a large degree of class overlap. In this paper, we propose a novel discriminative sparse neighbor approximation (DSNA) method to ameliorate the effect of class-imbalance during prediction. Specifically, given a test sample, we first traverse it through a cost-sensitive decision forest to collect a good subset of training examples in its local neighborhood. Then, we generate from this subset several class-discriminating but overlapping clusters and model each as an affine subspace. From these subspaces, the proposed DSNA iteratively seeks an optimal approximation of the test sample and outputs an unbiased prediction. We show that our method not only effectively mitigates the imbalance issue, but also allows the prediction to extrapolate to unseen data. The latter capability is crucial for achieving accurate prediction on small data set with limited samples. The proposed imbalanced learning method can be applied to both classification and regression tasks at a wide range of imbalance levels. It significantly outperforms the state-of-the-art methods that do not possess an imbalance handling mechanism, and is found to perform comparably or even better than recent deep learning methods by using hand-crafted features only.

KW - Data extrapolation

KW - Decision forest

KW - Discriminative sparse neighbor approximation (DSNA)

KW - Imbalanced learning

UR - http://www.scopus.com/inward/record.url?scp=85016470469&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85016470469&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2017.2671845

DO - 10.1109/TNNLS.2017.2671845

M3 - Article

C2 - 28362590

AN - SCOPUS:85016470469

SN - 2162-237X

VL - 29

SP - 1503

EP - 1513

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 5

ER -

Discriminative sparse neighbor approximation for imbalanced learning

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Cite this