Chasing the Tail in Monocular 3D Human Reconstruction With Prototype Memory

Yu Rong*, Ziwei Liu, Chen Change Loy

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

Deep neural networks have achieved remarkable progress in single-image 3D human reconstruction. However, existing methods still fall short in predicting rare poses. The reason is that most of the current models perform regression based on a single human prototype, which is similar to common poses while far from the rare poses. In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses. The core of our framework is a memory module that learns and stores a set of 3D human prototypes capturing local distributions for either common poses or rare poses. With this formulation, the regression starts from a better initialization, which is relatively easier to converge. Extensive experiments on several widely employed datasets demonstrate the proposed framework's effectiveness compared to other state-of-the-art methods. Notably, our approach significantly improves the models' performances on rare poses while generating comparable results on other samples.

Original languageEnglish
Pages (from-to)2907-2919
Number of pages13
JournalIEEE Transactions on Image Processing
Volume31
DOIs
Publication statusPublished - 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 1992-2012 IEEE.

ASJC Scopus Subject Areas

  • Software
  • Computer Graphics and Computer-Aided Design

Keywords

  • 3D pose estimation
  • clustering
  • Motion capture

Cite this