TY - GEN
T1 - Facial landmark detection by deep multi-task learning
AU - Zhang, Zhanpeng
AU - Luo, Ping
AU - Loy, Chen Change
AU - Tang, Xiaoou
PY - 2014
Y1 - 2014
N2 - Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].
AB - Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].
UR - http://www.scopus.com/inward/record.url?scp=84906348918&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84906348918&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-10599-4_7
DO - 10.1007/978-3-319-10599-4_7
M3 - Conference contribution
AN - SCOPUS:84906348918
SN - 9783319105987
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 94
EP - 108
BT - Computer Vision, ECCV 2014 - 13th European Conference, Proceedings
PB - Springer Verlag
T2 - 13th European Conference on Computer Vision, ECCV 2014
Y2 - 6 September 2014 through 12 September 2014
ER -