Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory

Yu Rong; Ziwei Liu; Chen Change Loy

doi:10.1109/TIP.2022.3154606

Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory

Yu Rong^*, Ziwei Liu, Chen Change Loy

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

9 Citations (Scopus)

Abstract

Deep neural networks have achieved remarkable progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.

Original language	English
Pages (from-to)	2907-2919
Number of pages	13
Journal	IEEE Transactions on Image Processing
Volume	31
DOIs	https://doi.org/10.1109/TIP.2022.3154606
Publication status	Published - 2022
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 1992-2012 IEEE.

ASJC Scopus Subject Areas

Software
Computer Graphics and Computer-Aided Design

Keywords

3D pose estimation
clustering
Motion capture

Access to Document

10.1109/TIP.2022.3154606

Cite this

@article{7c270ae96e7f48cc89c1a2e244be94bd,

title = "Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory",

abstract = "Deep neural networks have achieved remarkable progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.",

keywords = "3D pose estimation, clustering, Motion capture",

author = "Yu Rong and Ziwei Liu and Loy, \{Chen Change\}",

note = "Publisher Copyright: {\textcopyright} 1992-2012 IEEE.",

year = "2022",

doi = "10.1109/TIP.2022.3154606",

language = "English",

volume = "31",

pages = "2907--2919",

journal = "IEEE Transactions on Image Processing",

issn = "1057-7149",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory

AU - Rong, Yu

AU - Liu, Ziwei

AU - Loy, Chen Change

PY - 2022

Y1 - 2022

N2 - Deep neural networks have achieved remarkable progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.

AB - Deep neural networks have achieved remarkable progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.

KW - 3D pose estimation

KW - clustering

KW - Motion capture

UR - http://www.scopus.com/inward/record.url?scp=85127464240&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85127464240&partnerID=8YFLogxK

U2 - 10.1109/TIP.2022.3154606

DO - 10.1109/TIP.2022.3154606

M3 - Article

C2 - 35363614

AN - SCOPUS:85127464240

SN - 1057-7149

VL - 31

SP - 2907

EP - 2919

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

ER -

Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Cite this