Search results

chapter

End-to-End 3D Face Reconstruction with Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1503 - 1512

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D facial shape reconstruction from a single 2D facial image has been an active research area due to its wide applications. Inspired by the success of deep neural networks (DNN), we propose a DNN-based approach for End-to-End 3D FAce Reconstruction (UH-E2FAR) from a single 2D image. Different from recent works that reconstruct and refine the 3D face in an iterative manner using both an RGB...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2511 - 2519

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes, however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view...

chapter

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5524 - 5532

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There has been significant work on learning realistic, articulated, 3D models of the human body. In contrast, there are few such models of animals, despite many applications. The main challenge is that animals are much less cooperative than humans. The best human body models are learned from thousands of 3D scans of people in specific poses, which is infeasible with live animals. Consequently, we...

chapter

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 702 - 711

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a transformation-grounded image generation network for novel 3D view synthesis from a single image. Our approach first explicitly infers the parts of the geometry visible both in the input and novel views and then casts the remaining synthesis problem as image completion. Specifically, we both predict a flow to move the pixels from the input to the novel view along with a novel visibility...

chapter

Learning from Synthetic Humans

Gul Varol, Javier Romero, Xavier Martin, Naureen Mahmood, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4627 - 4635

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Estimating human pose, shape, and motion from images and videos are fundamental challenges with many applications. Recent advances in 2D human pose estimation use large amounts of manually-labeled training data for learning convolutional neural networks (CNNs). Such data is time consuming to acquire and difficult to extend. Moreover, manual labeling of 3D pose, depth and motion is impractical. In...

chapter

Learning Category-Specific 3D Shape Models from Weakly Labeled 2D Images

Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3587 - 3595

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, researchers have made great processes to build category-specific 3D shape models from 2D images with manual annotations consisting of class labels, keypoints, and ground truth figure-ground segmentations. However, the annotation of figure-ground segmentations is still labor-intensive and time-consuming. To further alleviate the burden of providing such manual annotations, we make the earliest...

chapter

Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace

Weilong Peng, Zhiyong Feng, Chao Xu, Yong Su

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5515 - 5523

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Pre-learnt subspace methods, e.g., 3DMMs, are significant exploration for the synthesis of 3D faces by assuming that faces are in a linear class. However, the human face is in a nonlinear manifold, and a new test are always not in the pre-learnt subspace accurately because of the disparity brought by ethnicity, age, gender, etc. In the paper, we propose a parametric T-spline morphable model (T-splineMM)...

chapter

Using Locally Corresponding CAD Models for Dense 3D Reconstructions from a Single Image

Chen Kong, Chen-Hsuan Lin, Simon Lucey

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5603 - 5611

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate the problem of estimating the dense 3D shape of an object, given a set of 2D landmarks and silhouette in a single image. An obvious prior to employ in such a problem is a dictionary of dense CAD models. Employing a sufficiently large enough dictionary of CAD models, however, is in general computationally infeasible. A common strategy in dictionary learning to encourage generalization...

chapter

SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks

Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 791 - 800

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D shape models are naturally parameterized using vertices and faces, i.e., composed of polygons forming a surface. However, current 3D learning paradigms for predictive and generative tasks using convolutional neural networks focus on a voxelized representation of the object. Lifting convolution operators from the traditional 2D to 3D results in high computational overhead with little additional...

chapter

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 388 - 397

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation. We present a deep convolutional neural network (CNN) architecture to localize semantic parts in 2D image and 3D space while inferring their visibility states, given a single RGB image. Our key insight is to exploit domain knowledge to regularize the network by deeply...

chapter

Semantic Scene Completion from a Single Depth Image

Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 190 - 198

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of...

chapter

Semantic Multi-view Stereo: Jointly Estimating Objects and Voxels

Ali Osman Ulusoy, Michael J. Black, Andreas Geiger

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4531 - 4540

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Dense 3D reconstruction from RGB images is a highly ill-posed problem due to occlusions, textureless or reflective surfaces, as well as other challenges. We propose object-level shape priors to address these ambiguities. Towards this goal, we formulate a probabilistic model that integrates multi-view image evidence with 3D shape information from multiple objects. Inference in this model yields a dense...

chapter

SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation

Li Yi, Hao Su, Xingwen Guo, Leonidas Guibas

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6584 - 6592

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we study the problem of semantic annotation on 3D models that are represented as shape graphs. A functional view is taken to represent localized information on graphs, so that annotations such as part segment or keypoint are nothing but 0-1 indicator vertex functions. Compared with images that are 2D grids, shape graphs are irregular and non-isomorphic data structures. To enable the...

chapter

3D Bounding Box Estimation Using Deep Learning and Geometry

Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Kosecka

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5632 - 5640

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a method for 3D object detection and pose estimation from a single image. In contrast to current techniques that only regress the 3D orientation of an object, our method first regresses relatively stable 3D object properties using a deep convolutional neural network and then combines these estimates with geometric constraints provided by a 2D object bounding box to produce a complete 3D...

chapter

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Celine Teuliere, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1827 - 1836

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel approach, called Deep MANTA (Deep Many-Tasks), for many-task vehicle analysis from a given image. A robust convolutional network is introduced for simultaneous vehicle detection, part localization, visibility characterization and 3D dimension estimation. Its architecture is based on a new coarse-to-fine object proposal that boosts the vehicle detection. Moreover,...

chapter

Fast 3D Reconstruction of Faces with Glasses

Fabio Maninchedda, Martin R. Oswald, Marc Pollefeys

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4608 - 4617

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a method for the fast 3D face reconstruction of people wearing glasses. Our method explicitly and robustly models the case in which a face to be reconstructed is partially occluded by glasses. We propose a simple and generic model for glasses that copes with a wide variety of different shapes, colors and styles, without the need for any database or learning. Our algorithm is simple, fast...

chapter

Unite the People: Closing the Loop Between 3D and 2D Human Representations

Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4704 - 4713

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D models provide a common ground for different representations of human bodies. In turn, robust 2D estimation has proven to be a powerful tool to obtain 3D fits in-the-wild. However, depending on the level of detail, it can be hard to impossible to acquire labeled data for training 2D estimators on large scale. We propose a hybrid approach to this problem: with an extended version of the recently...

chapter

Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency

Shubham Tulsiani, Tinghui Zhou, Alexei A. Efros, Jitendra Malik

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 209 - 217

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the notion of consistency between a 3D shape and a 2D observation and propose a differentiable formulation which allows computing gradients of the 3D shape given an observation from an arbitrary view. We do so by reformulating view consistency using a differentiable ray consistency (DRC) term. We show that this formulation can be incorporated in a learning framework to leverage different...

chapter

Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting

Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3681 - 3690

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a new Cascaded Shape Regression (CSR) architecture, namely Dynamic Attention-Controlled CSR (DAC-CSR), for robust facial landmark detection on unconstrained faces. Our DAC-CSR divides facial landmark detection into three cascaded sub-tasks: face bounding box refinement, general CSR and attention-controlled CSR. The first two stages refine initial face bounding boxes and output intermediate...

INFONA - science communication portal

Search results

End-to-End 3D Face Reconstruction with Deep Neural Networks

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

Learning from Synthetic Humans

Learning Category-Specific 3D Shape Models from Weakly Labeled 2D Images

Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace

Using Locally Corresponding CAD Models for Dense 3D Reconstructions from a Single Image

SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Semantic Scene Completion from a Single Depth Image

Semantic Multi-view Stereo: Jointly Estimating Objects and Voxels

SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation

3D Bounding Box Estimation Using Deep Learning and Geometry

Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image

Fast 3D Reconstruction of Faces with Glasses

Unite the People: Closing the Loop Between 3D and 2D Human Representations

Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency

Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options