2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6175 - 6184

Accurate visual localization is a key technology for autonomous navigation. 3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately. However, constructing (and extending) large-scale 3D models is still a significant challenge. In contrast, 2D image retrieval-based methods only require a database of geo-tagged images, which is trivial to construct...

chapter

3D Face Morphable Models "In-the-Wild"

James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5464 - 5473

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though powerful...

chapter

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Endri Dibra, Himanshu Jain, Cengiz Oztireli, Remo Ziegler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5504 - 5514

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a novel method for capturing human body shape from a single scaled silhouette. We combine deep correlated features capturing different 2D views, and embedding spaces based on 3D cues in a novel convolutional neural network (CNN) based architecture. We first train a CNN to find a richer body shape representation space from pose invariant 3D human shape descriptors. Then, we...

chapter

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1942 - 1950

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Spatial relationships between objects provide important information for text-based image retrieval. As users are more likely to describe a scene from a real world perspective, using 3D spatial relationships rather than 2D relationships that assume a particular viewing direction, one of the main challenges is to infer the 3D structure that bridges images with users text descriptions. However, direct...

chapter

End-to-End 3D Face Reconstruction with Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1503 - 1512

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D facial shape reconstruction from a single 2D facial image has been an active research area due to its wide applications. Inspired by the success of deep neural networks (DNN), we propose a DNN-based approach for End-to-End 3D FAce Reconstruction (UH-E2FAR) from a single 2D image. Different from recent works that reconstruct and refine the 3D face in an iterative manner using both an RGB...

chapter

A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5094 - 5102

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a Reinforcement Learning (RL) solution to the view planning problem (VPP), which generates a sequence of view points that are capable of sensing all accessible area of a given object represented as a 3D model. In doing so, the goal is to minimize the number of view points, making the VPP a class of set covering optimization problem (SCOP). The SCOP is NP-hard, and the inapproximability...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4714 - 4723

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a deep multitask architecture for fully automatic 2d and 3d human sensing (DMHS), including recognition and reconstruction, in monocular images. The system computes the figure-ground segmentation, semantically identifies the human body parts at pixel level, and estimates the 2d and 3d pose of the person. The model supports the joint training of all components by means of multi-task losses...

chapter

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Shiyu Huang, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4664 - 4673

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

As autonomous vehicles become an every-day reality, high-accuracy pedestrian detection is of paramount practical importance. Pedestrian detection is a highly researched topic with mature methods, but most datasets (for both training and evaluation) focus on common scenes of people engaged in typical walking poses on sidewalks. But performance is most crucial for dangerous scenarios that are rarely...

chapter

Synthesizing Normalized Faces from Facial Identity Features

Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3386 - 3395

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a method for synthesizing a frontal, neutral-expression image of a persons face, given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous generative approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this...

chapter

Adversarially Tuned Scene Generation

Vsr Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6441 - 6449

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generalization performance of trained computer vision (CV) systems that use computer graphics (CG) generated data is not yet effective due to the concept of domain-shift between virtual and real data. Although simulated data augmented with a few real-world samples has been shown to mitigate domain shift and improve transferability of trained models, guiding or bootstrapping the virtual data generation...

chapter

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2511 - 2519

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes, however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view...

chapter

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5524 - 5532

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There has been significant work on learning realistic, articulated, 3D models of the human body. In contrast, there are few such models of animals, despite many applications. The main challenge is that animals are much less cooperative than humans. The best human body models are learned from thousands of 3D scans of people in specific poses, which is infeasible with live animals. Consequently, we...

chapter

Recurrent 3D Pose Sequence Machines

Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5543 - 5552

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Human articulated pose recovery from monocular image sequences is very challenging due to the diverse appearances, viewpoints, occlusions, and also the human 3D pose is inherently ambiguous from the monocular imagery. It is thus critical to exploit rich spatial and temporal long-range dependencies among body joints for accurate 3D pose sequence prediction. Existing approaches usually manually design...

chapter

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2432 - 2443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A key requirement for leveraging supervised deep learning methods is the availability of large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very little data is available – current datasets cover a small range of scene views and have limited semantic annotations. To address this issue, we introduce ScanNet, an RGB-D video dataset containing 2.5M views in...

chapter

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 702 - 711

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a transformation-grounded image generation network for novel 3D view synthesis from a single image. Our approach first explicitly infers the parts of the geometry visible both in the input and novel views and then casts the remaining synthesis problem as image completion. Specifically, we both predict a flow to move the pixels from the input to the novel view along with a novel visibility...

chapter

Learning from Synthetic Humans

Gul Varol, Javier Romero, Xavier Martin, Naureen Mahmood, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4627 - 4635

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Estimating human pose, shape, and motion from images and videos are fundamental challenges with many applications. Recent advances in 2D human pose estimation use large amounts of manually-labeled training data for learning convolutional neural networks (CNNs). Such data is time consuming to acquire and difficult to extend. Moreover, manual labeling of 3D pose, depth and motion is impractical. In...

chapter

3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions

Andy Zeng, Shuran Song, Matthias NieBner, Matthew Fisher, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 199 - 208

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Matching local geometric features on real-world depth images is a challenging task due to the noisy, low-resolution, and incomplete nature of 3D scan data. These difficulties limit the performance of current state-of-art methods, which are typically based on histograms over geometric properties. In this paper, we present 3DMatch, a data-driven model that learns a local volumetric patch descriptor...

chapter

Learning Category-Specific 3D Shape Models from Weakly Labeled 2D Images

Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3587 - 3595

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, researchers have made great processes to build category-specific 3D shape models from 2D images with manual annotations consisting of class labels, keypoints, and ground truth figure-ground segmentations. However, the annotation of figure-ground segmentations is still labor-intensive and time-consuming. To further alleviate the burden of providing such manual annotations, we make the earliest...

chapter

Toroidal Constraints for Two-Point Localization Under High Outlier Ratios

Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6700 - 6708

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Localizing a query image against a 3D model at large scale is a hard problem, since 2D-3D matches become more and more ambiguous as the model size increases. This creates a need for pose estimation strategies that can handle very low inlier ratios. In this paper, we draw new insights on the geometric information available from the 2D-3D matching process. As modern descriptors are not invariant against...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

3D Face Morphable Models "In-the-Wild"

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

End-to-End 3D Face Reconstruction with Deep Neural Networks

A Reinforcement Learning Approach to the View Planning Problem

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Synthesizing Normalized Faces from Facial Identity Features

Adversarially Tuned Scene Generation

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Recurrent 3D Pose Sequence Machines

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

Learning from Synthetic Humans

3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions

Learning Category-Specific 3D Shape Models from Weakly Labeled 2D Images

Toroidal Constraints for Two-Point Localization Under High Outlier Ratios

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)