Abstract
While 3D content generation has advanced significantly, existing methods still face challenges with input formats, latent space design, and output representations. This paper introduces a novel 3D generation framework that addresses these challenges, offering scalable, high-quality 3D generation with an interactive Point Cloud-structured Latent space. Our framework employs a Variational Autoencoder (VAE) with multi-view posed RGB-D(epth)-N(ormal) renderings as input, using a unique latent space design that preserves 3D shape information, and incorporates a cascaded latent flow-based model for improved shape-texture disentanglement. The proposed method, GAUSSIANANYTHING, supports multi-modal conditional 3D generation, allowing for point cloud, caption, and single image inputs. Notably, the newly proposed latent space naturally enables geometry-texture disentanglement, thus allowing 3D-aware editing. Experimental results demonstrate the effectiveness of our approach on multiple datasets, outperforming existing native 3D methods in both text- and image-conditioned 3D generation.
Original language | English |
---|---|
Title of host publication | 13th International Conference on Learning Representations, ICLR 2025 |
Publisher | International Conference on Learning Representations, ICLR |
Pages | 66651-66675 |
Number of pages | 25 |
ISBN (Electronic) | 9798331320850 |
Publication status | Published - 2025 |
Externally published | Yes |
Event | 13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapore Duration: Apr 24 2025 → Apr 28 2025 |
Publication series
Name | 13th International Conference on Learning Representations, ICLR 2025 |
---|
Conference
Conference | 13th International Conference on Learning Representations, ICLR 2025 |
---|---|
Country/Territory | Singapore |
City | Singapore |
Period | 4/24/25 → 4/28/25 |
Bibliographical note
Publisher Copyright:© 2025 13th International Conference on Learning Representations, ICLR 2025. All rights reserved.
ASJC Scopus Subject Areas
- Language and Linguistics
- Computer Science Applications
- Education
- Linguistics and Language