2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

Yuan Gao, Alan L. Yuille

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6718 - 6727

Many man-made objects have intrinsic symmetries and Manhattan structure. By assuming an orthographic projection model, this paper addresses the estimation of 3D structures and camera projection using symmetry and/or Manhattan structure cues, which occur when the input is single-or multiple-image from the same category, e.g., multiple different cars. Specifically, analysis on the single image case...

chapter

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6175 - 6184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Accurate visual localization is a key technology for autonomous navigation. 3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately. However, constructing (and extending) large-scale 3D models is still a significant challenge. In contrast, 2D image retrieval-based methods only require a database of geo-tagged images, which is trivial to construct...

chapter

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Endri Dibra, Himanshu Jain, Cengiz Oztireli, Remo Ziegler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5504 - 5514

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a novel method for capturing human body shape from a single scaled silhouette. We combine deep correlated features capturing different 2D views, and embedding spaces based on 3D cues in a novel convolutional neural network (CNN) based architecture. We first train a CNN to find a richer body shape representation space from pose invariant 3D human shape descriptors. Then, we...

chapter

Light Field Blind Motion Deblurring

Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2354 - 2362

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of deblurring light fields of general 3D scenes captured under 3D camera motion and present both theoretical and practical contributions. By analyzing the motion-blurred light field in the primal and Fourier domains, we develop intuition into the effects of camera motion on the light field, show the advantages of capturing a 4D light field instead of a conventional 2D image for...

chapter

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1942 - 1950

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Spatial relationships between objects provide important information for text-based image retrieval. As users are more likely to describe a scene from a real world perspective, using 3D spatial relationships rather than 2D relationships that assume a particular viewing direction, one of the main challenges is to infer the 3D structure that bridges images with users text descriptions. However, direct...

chapter

End-to-End 3D Face Reconstruction with Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1503 - 1512

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D facial shape reconstruction from a single 2D facial image has been an active research area due to its wide applications. Inspired by the success of deep neural networks (DNN), we propose a DNN-based approach for End-to-End 3D FAce Reconstruction (UH-E2FAR) from a single 2D image. Different from recent works that reconstruct and refine the 3D face in an iterative manner using both an RGB...

chapter

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4714 - 4723

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a deep multitask architecture for fully automatic 2d and 3d human sensing (DMHS), including recognition and reconstruction, in monocular images. The system computes the figure-ground segmentation, semantically identifies the human body parts at pixel level, and estimates the 2d and 3d pose of the person. The model supports the joint training of all components by means of multi-task losses...

chapter

Towards a Quality Metric for Dense Light Fields

Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafal K. Mantiuk, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3720 - 3729

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Light fields become a popular representation of three-dimensional scenes, and there is interest in their processing, resampling, and compression. As those operations often result in loss of quality, there is a need to quantify it. In this work, we collect a new dataset of dense reference and distorted light fields as well as the corresponding quality scores which are scaled in perceptual units. The...

chapter

Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation

Kuan-Lun Tseng, Yen-Liang Lin, Winston Hsu, Chung-Yang Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3739 - 3746

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning models such as convolutional neural network have been widely used in 3D biomedical segmentation and achieve state-of-the-art performance. However, most of them often adapt a single modality or stack multiple modalities as different input channels, which ignores the correlations among them. To leverage the multi-modalities, we propose a deep convolution encoder-decoder structure with...

chapter

Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval

Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3615 - 3623

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Retrieving 3D shapes with sketches is a challenging problem since 2D sketches and 3D shapes are from two heterogeneous domains, which results in large discrepancy between them. In this paper, we propose to learn barycenters of 2D projections of 3D shapes for sketch-based 3D shape retrieval. Specifically, we first use two deep convolutional neural networks (CNNs) to extract deep features of sketches...

chapter

Fast Fourier Color Constancy

Jonathan T. Barron, Yun-Ta Tsai

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6950 - 6958

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present Fast Fourier Color Constancy (FFCC), a color constancy algorithm which solves illuminant estimation by reducing it to a spatial localization task on a torus. By operating in the frequency domain, FFCC produces lower error rates than the previous state-of-the-art by 13–20% while being 250-3000 times faster. This unconventional approach introduces challenges regarding aliasing,...

chapter

Multi-view 3D Object Detection Network for Autonomous Driving

Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6526 - 6534

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper aims at high-accuracy 3D object detection in autonomous driving scenario. We propose Multi-View 3D networks (MV3D), a sensory-fusion framework that takes both LIDAR point cloud and RGB images as input and predicts oriented 3D bounding boxes. We encode the sparse 3D point cloud with a compact multi-view representation. The network is composed of two subnetworks: one for 3D object proposal...

chapter

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2511 - 2519

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes, however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view...

chapter

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

Gil Elbaz, Tamar Avraham, Anath Fischer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2472 - 2481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an algorithm for registration between a large-scale point cloud and a close-proximity scanned point cloud, providing a localization solution that is fully independent of prior information about the initial positions of the two point cloud coordinate systems. The algorithm, denoted LORAX, selects super-points–local subsets of points–and describes the geometric structure...

chapter

Recurrent 3D Pose Sequence Machines

Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5543 - 5552

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Human articulated pose recovery from monocular image sequences is very challenging due to the diverse appearances, viewpoints, occlusions, and also the human 3D pose is inherently ambiguous from the monocular imagery. It is thus critical to exploit rich spatial and temporal long-range dependencies among body joints for accurate 3D pose sequence prediction. Existing approaches usually manually design...

chapter

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

Haoqiang Fan, Hao Su, Leonidas Guibas

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2463 - 2471

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generation of 3D data by deep neural network has been attracting increasing attention in the research community. The majority of extant works resort to regular representations such as volumetric grids or collection of images, however, these representations obscure the natural invariance of 3D shapes under geometric transformations, and also suffer from a number of other issues. In this paper we address...

chapter

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2432 - 2443

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A key requirement for leveraging supervised deep learning methods is the availability of large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very little data is available – current datasets cover a small range of scene views and have limited semantic annotations. To address this issue, we introduce ScanNet, an RGB-D video dataset containing 2.5M views in...

chapter

Unrolling the Shutter: CNN to Correct Motion Distortions

Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2345 - 2353

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Row-wise exposure delay present in CMOS cameras is responsible for skew and curvature distortions known as the rolling shutter (RS) effect while imaging under camera motion. Existing RS correction methods resort to using multiple images or tailor scene-specific correction schemes. We propose a convolutional neural network (CNN) architecture that automatically learns essential scene features from a...

chapter

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing

Yu-Chuan Su, Kristen Grauman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1368 - 1376

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

360° Video requires human viewers to actively control where to look while watching the video. Although it provides a more immersive experience of the visual content, it also introduces additional burden for viewers, awkward interfaces to navigate the video lead to suboptimal viewing experiences. Virtual cinematography is an appealing direction to remedy these problems, but conventional methods...

chapter

Learning Shape Abstractions by Assembling Volumetric Primitives

Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1466 - 1474

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a learning framework for abstracting complex shapes by learning to assemble objects using 3D volumetric primitives. In addition to generating simple and geometrically interpretable explanations of 3D objects, our framework also allows us to automatically discover and exploit consistent structure in the data. We demonstrate that using our method allows predicting shape representations which...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Light Field Blind Motion Deblurring

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

End-to-End 3D Face Reconstruction with Deep Neural Networks

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Towards a Quality Metric for Dense Light Fields

Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation

Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval

Fast Fourier Color Constancy

Multi-view 3D Object Detection Network for Autonomous Driving

Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

Recurrent 3D Pose Sequence Machines

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Unrolling the Shutter: CNN to Correct Motion Distortions

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing

Learning Shape Abstractions by Assembling Volumetric Primitives

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)