Overview of ComboVerse. Given an input image that contains multiple objects, our method can generate high-quality 3D assets through a two-stage process. In the single-object reconstruction stage, we decompose every single object in the image with object inpainting, and perform single-image reconstruction to create individual 3D models. In the multi-object combination stage, we maintain the geometry and texture of each object while optimizing their scale, rotation, and translation parameters.
There are a lot of excellent works that inspire or relate to ours, ComboVerse is based on these outstanding works.
Wonder3D: Xiaoxiao Long*, Yuan-Chen Guo*, Cheng Lin, Yuan Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang. Wonder3D: Single Image to 3D using Cross-Domain Diffusion. CVPR 2024.
LRM: Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan. LRM: Large Reconstruction Model for Single Image to 3D. ICLR 2024.
OpenLRM: Zexin He, Tengfei Wang. OpenLRM: Open-Source Large Reconstruction Models.
SyncDreamer: Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang. SyncDreamer: Generating Multiview-consistent Images from a Single-view Image. ICLR 2024.
TripoSR: Dmitry Tochilkin, David Pankratz, Zexiang Liu, Zixuan Huang, Adam Letts, Yangguang Li, Ding Liang, Christian Laforte, Varun Jampani, Yan-Pei Cao. TripoSR: Fast 3D Object Reconstruction from a Single Image
@article{chen2024comboverse,
title={ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance},
author={Chen, Yongwei and Wang, Tengfei and Wu, Tong and Pan, Xingang and Jia, Kui and Liu, Ziwei},
journal={arXiv preprint arXiv:2403.12409},
year={2024}
}