Prime Sample Attention in Object Detection

Yuhang Cao; Kai Chen; Chen Change Loy; Dahua Lin

doi:10.1109/CVPR42600.2020.01160

Prime Sample Attention in Object Detection

Yuhang Cao, Kai Chen, Chen Change Loy, Dahua Lin

Research output: Contribution to journal › Conference article › peer-review

200 Citations (Scopus)

Abstract

It is a common paradigm in object detection frameworks to treat all samples equally and target at maximizing the performance on average. In this work, we revisit this paradigm through a careful study on how different samples contribute to the overall performance measured in terms of mAP. Our study suggests that the samples in each mini-batch are neither independent nor equally important, and therefore a better classifier on average does not necessarily result in higher mAP. Motivated by this study, we propose the notion of Prime Samples, those that play a key role in driving the detection performance. We further develop a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) that directs the focus of the training process towards such samples. Our experiments demonstrate that it is often more effective to focus on prime samples than hard samples when training a detector. Particularly, on the MSCOCO dataset, PISA outperforms the random sampling baseline and hard mining schemes, eg~OHEM and Focal Loss, consistently by around 2% on both single-stage and two-stage detectors, even with a strong backbone ResNeXt-101. Code is available at: url{https://github.com/open-mmlab/mmdetection}.

Original language	English
Article number	9157482
Pages (from-to)	11580-11588
Number of pages	9
Journal	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs	https://doi.org/10.1109/CVPR42600.2020.01160
Publication status	Published - 2020
Externally published	Yes
Event	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 - Virtual, Online, United States Duration: Jun 14 2020 → Jun 19 2020

Bibliographical note

Publisher Copyright:
© 2020 IEEE.

ASJC Scopus Subject Areas

Software
Computer Vision and Pattern Recognition

Access to Document

10.1109/CVPR42600.2020.01160

Cite this

@article{46d5392fad134cd7849100a870310f19,

title = "Prime Sample Attention in Object Detection",

abstract = "It is a common paradigm in object detection frameworks to treat all samples equally and target at maximizing the performance on average. In this work, we revisit this paradigm through a careful study on how different samples contribute to the overall performance measured in terms of mAP. Our study suggests that the samples in each mini-batch are neither independent nor equally important, and therefore a better classifier on average does not necessarily result in higher mAP. Motivated by this study, we propose the notion of Prime Samples, those that play a key role in driving the detection performance. We further develop a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) that directs the focus of the training process towards such samples. Our experiments demonstrate that it is often more effective to focus on prime samples than hard samples when training a detector. Particularly, on the MSCOCO dataset, PISA outperforms the random sampling baseline and hard mining schemes, eg\textasciitilde{}OHEM and Focal Loss, consistently by around 2\% on both single-stage and two-stage detectors, even with a strong backbone ResNeXt-101. Code is available at: url\{https://github.com/open-mmlab/mmdetection\}.",

author = "Yuhang Cao and Kai Chen and Loy, \{Chen Change\} and Dahua Lin",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 ; Conference date: 14-06-2020 Through 19-06-2020",

year = "2020",

doi = "10.1109/CVPR42600.2020.01160",

language = "English",

pages = "11580--11588",

journal = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

issn = "1063-6919",

publisher = "IEEE Computer Society",

}

TY - JOUR

T1 - Prime Sample Attention in Object Detection

AU - Cao, Yuhang

AU - Chen, Kai

AU - Loy, Chen Change

AU - Lin, Dahua

PY - 2020

Y1 - 2020

N2 - It is a common paradigm in object detection frameworks to treat all samples equally and target at maximizing the performance on average. In this work, we revisit this paradigm through a careful study on how different samples contribute to the overall performance measured in terms of mAP. Our study suggests that the samples in each mini-batch are neither independent nor equally important, and therefore a better classifier on average does not necessarily result in higher mAP. Motivated by this study, we propose the notion of Prime Samples, those that play a key role in driving the detection performance. We further develop a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) that directs the focus of the training process towards such samples. Our experiments demonstrate that it is often more effective to focus on prime samples than hard samples when training a detector. Particularly, on the MSCOCO dataset, PISA outperforms the random sampling baseline and hard mining schemes, eg~OHEM and Focal Loss, consistently by around 2% on both single-stage and two-stage detectors, even with a strong backbone ResNeXt-101. Code is available at: url{https://github.com/open-mmlab/mmdetection}.

AB - It is a common paradigm in object detection frameworks to treat all samples equally and target at maximizing the performance on average. In this work, we revisit this paradigm through a careful study on how different samples contribute to the overall performance measured in terms of mAP. Our study suggests that the samples in each mini-batch are neither independent nor equally important, and therefore a better classifier on average does not necessarily result in higher mAP. Motivated by this study, we propose the notion of Prime Samples, those that play a key role in driving the detection performance. We further develop a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) that directs the focus of the training process towards such samples. Our experiments demonstrate that it is often more effective to focus on prime samples than hard samples when training a detector. Particularly, on the MSCOCO dataset, PISA outperforms the random sampling baseline and hard mining schemes, eg~OHEM and Focal Loss, consistently by around 2% on both single-stage and two-stage detectors, even with a strong backbone ResNeXt-101. Code is available at: url{https://github.com/open-mmlab/mmdetection}.

UR - http://www.scopus.com/inward/record.url?scp=85094678747&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85094678747&partnerID=8YFLogxK

U2 - 10.1109/CVPR42600.2020.01160

DO - 10.1109/CVPR42600.2020.01160

M3 - Conference article

AN - SCOPUS:85094678747

SN - 1063-6919

SP - 11580

EP - 11588

JO - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

JF - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

M1 - 9157482

T2 - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020

Y2 - 14 June 2020 through 19 June 2020

ER -

Prime Sample Attention in Object Detection

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Access to Document

Other files and links

Cite this