We present a method for jointly predicting a depth map and intrinsic images from single-image input. The two tasks are formulated in a synergistic manner through a joint conditional random field (CRF) that is solved using a novel convolutional neural network (CNN) architecture, called the joint convolutional neural field (JCNF) model. Tailored to our joint estimation problem, JCNF differs from previous CNNs in its sharing of convolutional activations and layers between networks for each task, its inference in the gradient domain where there exists greater correlation between depth and intrinsic images, and the incorporation of a gradient scale network that learns the confidence of estimated gradients in order to effectively balance them in the solution. This approach is shown to surpass state-of-the-art methods both on single-image depth estimation and on intrinsic image decomposition.
|Title of host publication||Computer Vision - 14th European Conference, ECCV 2016, Proceedings|
|Editors||Bastian Leibe, Jiri Matas, Nicu Sebe, Max Welling|
|Number of pages||17|
|Publication status||Published - 2016|
|Event||14th European Conference on Computer Vision, ECCV 2016 - Amsterdam, Netherlands|
Duration: 2016 Oct 8 → 2016 Oct 16
|Name||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Conference||14th European Conference on Computer Vision, ECCV 2016|
|Period||16/10/8 → 16/10/16|
Bibliographical noteFunding Information:
This research was supported by the MSIP (The Ministry of Science, ICT and Future Planning), Korea and Microsoft Research, under ICT/SW Creative research program supervised by the IITP(Institute for Information & Communications Technology Promotion) (IITP-2015-R2212-15-0008).
© Springer International Publishing AG 2016.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Computer Science(all)