2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

Yuan Gao, Alan L. Yuille

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6718 - 6727

Many man-made objects have intrinsic symmetries and Manhattan structure. By assuming an orthographic projection model, this paper addresses the estimation of 3D structures and camera projection using symmetry and/or Manhattan structure cues, which occur when the input is single-or multiple-image from the same category, e.g., multiple different cars. Specifically, analysis on the single image case...

chapter

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6175 - 6184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Accurate visual localization is a key technology for autonomous navigation. 3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately. However, constructing (and extending) large-scale 3D models is still a significant challenge. In contrast, 2D image retrieval-based methods only require a database of geo-tagged images, which is trivial to construct...

chapter

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

Youngjoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2943 - 2952

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a new high dimensional regression method by merging Gaussian process regression into a variational autoencoder framework. In contrast to other regression methods, the proposed method focuses on the case where output responses are on a complex high dimensional manifold, such as images. Our contributions are summarized as follows: (i) A new regression method estimating high dimensional...

chapter

3D Face Morphable Models "In-the-Wild"

James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5464 - 5473

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though powerful...

chapter

Neural Face Editing with Intrinsic Image Disentangling

Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5444 - 5453

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional face editing methods often require a number of sophisticated and task specific algorithms to be applied one after the other — a process that is tedious, fragile, and computationally intensive. In this paper, we propose an end-to-end generative adversarial network that infers a face-specific disentangled representation of intrinsic face properties, including shape (i.e. normals),...

chapter

Full Resolution Image Compression with Recurrent Neural Networks

George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5435 - 5443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a set of full-resolution lossy image compression methods based on neural networks. Each of the architectures we describe can provide variable compression rates during deployment without requiring retraining of the network: each network need only be trained once. All of our architectures consist of a recurrent neural network (RNN)-based encoder and decoder, a binarizer, and a neural...

chapter

End-to-End 3D Face Reconstruction with Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1503 - 1512

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D facial shape reconstruction from a single 2D facial image has been an active research area due to its wide applications. Inspired by the success of deep neural networks (DNN), we propose a DNN-based approach for End-to-End 3D FAce Reconstruction (UH-E2FAR) from a single 2D image. Different from recent works that reconstruct and refine the 3D face in an iterative manner using both an RGB...

chapter

Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging

Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1484 - 1492

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel, practical solution for high quality reconstruction of axially-symmetric transparent objects. While a special case, such transparent objects are ubiquitous in the real world. Common examples of these are glasses, goblets, tumblers, carafes, etc., that can have very unique and visually appealing forms making their reconstruction interesting for vision and graphics applications. Our...

chapter

Deeply Aggregated Alternating Minimization for Image Restoration

Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 284 - 292

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Regularization-based image restoration has remained an active research topic in image processing and computer vision. It often leverages a guidance signal captured in different fields as an additional cue. In this work, we present a general framework for image restoration, called deeply aggregated alternating minimization (DeepAM). We propose to train deep neural network to advance two of the steps...

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4342 - 4351

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model that classifies pixels based not only on their visual appearance, as in the traditional page segmentation task, but also on the content of underlying text. Moreover,...

chapter

Towards a Quality Metric for Dense Light Fields

Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafal K. Mantiuk, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3720 - 3729

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Light fields become a popular representation of three-dimensional scenes, and there is interest in their processing, resampling, and compression. As those operations often result in loss of quality, there is a need to quantify it. In this work, we collect a new dataset of dense reference and distorted light fields as well as the corresponding quality scores which are scaled in perceptual units. The...

chapter

Synthesizing Normalized Faces from Facial Identity Features

Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3386 - 3395

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a method for synthesizing a frontal, neutral-expression image of a persons face, given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous generative approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this...

chapter

Neural Scene De-rendering

Jiajun Wu, Joshua B. Tenenbaum, Pushmeet Kohli

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7035 - 7043

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of holistic scene understanding. We would like to obtain a compact, expressive, and interpretable representation of scenes that encodes information such as the number of objects and their categories, poses, positions, etc. Such a representation would allow us to reason about and even reconstruct or manipulate elements of the scene. Previous works have used encoder-decoder based...

chapter

Snapshot Hyperspectral Light Field Imaging

Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6873 - 6881

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents the first snapshot hyperspectral light field imager in practice. Specifically, we design a novel hybrid camera system to obtain two complementary measurements that sample the angular and spectral dimensions respectively. To recover the full 5D hyperspectral light field from the severely undersampled measurements, we then propose an efficient computational reconstruction algorithm...

chapter

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7101 - 7110

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos. Given a pair of images from a video clip, our framework learns to predict the long-term 3D motions. To reduce the complexity of the learning framework, we propose to describe the motion as a sequence of atomic 3D flows computed with RGB-D modality. We use a Recurrent Neural Network...

chapter

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2511 - 2519

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes, however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view...

chapter

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

Haoqiang Fan, Hao Su, Leonidas Guibas

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2463 - 2471

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generation of 3D data by deep neural network has been attracting increasing attention in the research community. The majority of extant works resort to regular representations such as volumetric grids or collection of images, however, these representations obscure the natural invariance of 3D shapes under geometric transformations, and also suffer from a number of other issues. In this paper we address...

chapter

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2432 - 2443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A key requirement for leveraging supervised deep learning methods is the availability of large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very little data is available – current datasets cover a small range of scene views and have limited semantic annotations. To address this issue, we introduce ScanNet, an RGB-D video dataset containing 2.5M views in...

chapter

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Xin Yu, Fatih Porikli

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5367 - 5375

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most of the conventional face hallucination methods assume the input image is sufficiently large and aligned, and all require the input image to be noise-free. Their performance degrades drastically if the input image is tiny, unaligned, and contaminated by noise. In this paper, we introduce a novel transformative discriminative autoencoder to 8X super-resolve unaligned noisy and tiny (16X16) low-resolution...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

3D Face Morphable Models "In-the-Wild"

Neural Face Editing with Intrinsic Image Disentangling

Full Resolution Image Compression with Recurrent Neural Networks

End-to-End 3D Face Reconstruction with Deep Neural Networks

Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging

Deeply Aggregated Alternating Minimization for Image Restoration

Semantic Autoencoder for Zero-Shot Learning

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Towards a Quality Metric for Dense Light Fields

Synthesizing Normalized Faces from Facial Identity Features

Neural Scene De-rendering

Snapshot Hyperspectral Light Field Imaging

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)