Facial landmark detection by deep multi-task learning

Zhanpeng Zhang; Ping Luo; Chen Change Loy; Xiaoou Tang

doi:10.1007/978-3-319-10599-4_7

Facial landmark detection by deep multi-task learning

Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang

Chinese University of Hong Kong

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1038 Citations (Scopus)

Abstract

Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].

Original language	English
Title of host publication	Computer Vision, ECCV 2014 - 13th European Conference, Proceedings
Publisher	Springer Verlag
Pages	94-108
Number of pages	15
Edition	PART 6
ISBN (Print)	9783319105987
DOIs	https://doi.org/10.1007/978-3-319-10599-4_7
Publication status	Published - 2014
Externally published	Yes
Event	13th European Conference on Computer Vision, ECCV 2014 - Zurich, Switzerland Duration: Sept 6 2014 → Sept 12 2014

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Number	PART 6
Volume	8694 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	13th European Conference on Computer Vision, ECCV 2014
Country/Territory	Switzerland
City	Zurich
Period	9/6/14 → 9/12/14

ASJC Scopus Subject Areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-319-10599-4_7

Cite this

Zhang, Z., Luo, P., Loy, C. C., & Tang, X. (2014). Facial landmark detection by deep multi-task learning. In Computer Vision, ECCV 2014 - 13th European Conference, Proceedings (PART 6 ed., pp. 94-108). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8694 LNCS, No. PART 6). Springer Verlag. https://doi.org/10.1007/978-3-319-10599-4_7

@inproceedings{6107f384929546039e965adaeb7073bd,

title = "Facial landmark detection by deep multi-task learning",

abstract = "Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].",

author = "Zhanpeng Zhang and Ping Luo and Loy, \{Chen Change\} and Xiaoou Tang",

year = "2014",

doi = "10.1007/978-3-319-10599-4\_7",

language = "English",

isbn = "9783319105987",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

number = "PART 6",

pages = "94--108",

booktitle = "Computer Vision, ECCV 2014 - 13th European Conference, Proceedings",

address = "Germany",

edition = "PART 6",

note = "13th European Conference on Computer Vision, ECCV 2014 ; Conference date: 06-09-2014 Through 12-09-2014",

}

Zhang, Z, Luo, P, Loy, CC & Tang, X 2014, Facial landmark detection by deep multi-task learning. in Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 6 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 6, vol. 8694 LNCS, Springer Verlag, pp. 94-108, 13th European Conference on Computer Vision, ECCV 2014, Zurich, Switzerland, 9/6/14. https://doi.org/10.1007/978-3-319-10599-4_7

Facial landmark detection by deep multi-task learning. / Zhang, Zhanpeng; Luo, Ping; Loy, Chen Change et al.
Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 6. ed. Springer Verlag, 2014. p. 94-108 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8694 LNCS, No. PART 6).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Facial landmark detection by deep multi-task learning

AU - Zhang, Zhanpeng

AU - Luo, Ping

AU - Loy, Chen Change

AU - Tang, Xiaoou

PY - 2014

Y1 - 2014

N2 - Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].

AB - Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].

UR - http://www.scopus.com/inward/record.url?scp=84906348918&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84906348918&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-10599-4_7

DO - 10.1007/978-3-319-10599-4_7

M3 - Conference contribution

AN - SCOPUS:84906348918

SN - 9783319105987

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 94

EP - 108

BT - Computer Vision, ECCV 2014 - 13th European Conference, Proceedings

PB - Springer Verlag

T2 - 13th European Conference on Computer Vision, ECCV 2014

Y2 - 6 September 2014 through 12 September 2014

ER -

Zhang Z, Luo P, Loy CC, Tang X. Facial landmark detection by deep multi-task learning. In Computer Vision, ECCV 2014 - 13th European Conference, Proceedings. PART 6 ed. Springer Verlag. 2014. p. 94-108. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 6). doi: 10.1007/978-3-319-10599-4_7

Facial landmark detection by deep multi-task learning

Abstract

Publication series

Conference

ASJC Scopus Subject Areas

Access to Document

Other files and links

Cite this