Abstract
We present a unified learning framework for recovering both 3D mesh and camera pose of the object from a single image. Our approach learns to recover outer shape and surface geometric details of the mesh without relying on 3D supervision. We adopt multi-view normal maps as the 2D supervision so that the silhouette and geometric details information can be transferred to neural network. A normal mismatch based objective function is introduced to train the network, and the camera pose is parameterized into the objective, it integrates pose estimation with the mesh reconstruction in a same optimization procedure. We demonstrate the abilities of the proposed approach in generating 3D mesh and estimating camera pose with qualitative and quantitative experiments.
Original language | English |
---|---|
Title of host publication | Proceedings of the 32nd International Conference on Computer Animation and Social Agents |
Place of Publication | New York, NY, USA |
Publisher | Association for Computing Machinery, Inc |
Pages | 79–84 |
Number of pages | 6 |
ISBN (Print) | 9781450371599 |
DOIs | |
Publication status | Published - 1 Jul 2019 |
Event | The 32nd International Conference on Computer Animation and Social Agents 2019 - Paris, France Duration: 1 Jul 2019 → 3 Jul 2019 Conference number: 32 https://casa2019.sciencesconf.org/ |
Publication series
Name | CASA '19 |
---|---|
Publisher | Association for Computing Machinery |
Conference
Conference | The 32nd International Conference on Computer Animation and Social Agents 2019 |
---|---|
Abbreviated title | CASA 2019 |
Country/Territory | France |
City | Paris |
Period | 1/07/19 → 3/07/19 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- deep learning
- pose estimation
- mesh reconstruction