2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Items from 1 to 20 out of 142 results

chapter

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

Yuan Gao, Alan L. Yuille

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6718 - 6727

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many man-made objects have intrinsic symmetries and Manhattan structure. By assuming an orthographic projection model, this paper addresses the estimation of 3D structures and camera projection using symmetry and/or Manhattan structure cues, which occur when the input is single-or multiple-image from the same category, e.g., multiple different cars. Specifically, analysis on the single image case...

chapter

3D Shape Segmentation with Projective Convolutional Networks

Evangelos Kalogerakis, Melinos Averkiou, Subhransu Maji, Siddhartha Chaudhuri

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6630 - 6639

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper introduces a deep architecture for segmenting 3D objects into their labeled semantic parts. Our architecture combines image-based Fully Convolutional Networks (FCNs) and surface-based Conditional Random Fields (CRFs) to yield coherent segmentations of 3D shapes. The image-based FCNs are used for efficient view-based reasoning about 3D object parts. Through a special projection layer, FCN...

chapter

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6175 - 6184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Accurate visual localization is a key technology for autonomous navigation. 3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately. However, constructing (and extending) large-scale 3D models is still a significant challenge. In contrast, 2D image retrieval-based methods only require a database of geo-tagged images, which is trivial to construct...

chapter

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2848 - 2857

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining...

chapter

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodola, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5425 - 5434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has achieved a remarkable performance breakthrough in several fields, most notably in speech recognition, natural language processing, and computer vision. In particular, convolutional neural network (CNN) architectures currently produce state-of-the-art performance on a variety of image analysis tasks such as object detection and recognition. Most of deep learning research has so far...

chapter

Detailed, Accurate, Human Shape Estimation from Clothed 3D Scan Sequences

Chao Zhang, Sergi Pujades, Michael Black, Gerard Pons-Moll

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5484 - 5493

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of estimating human pose and body shape from 3D scans over time. Reliable estimation of 3D body shape is necessary for many applications including virtual try-on, health monitoring, and avatar creation for virtual reality. Scanning bodies in minimal clothing, however, presents a practical barrier to these applications. We address this problem by estimating body shape under clothing...

chapter

3D Face Morphable Models "In-the-Wild"

James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5464 - 5473

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though powerful...

chapter

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Endri Dibra, Himanshu Jain, Cengiz Oztireli, Remo Ziegler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5504 - 5514

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a novel method for capturing human body shape from a single scaled silhouette. We combine deep correlated features capturing different 2D views, and embedding spaces based on 3D cues in a novel convolutional neural network (CNN) based architecture. We first train a CNN to find a richer body shape representation space from pose invariant 3D human shape descriptors. Then, we...

chapter

Light Field Blind Motion Deblurring

Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2354 - 2362

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of deblurring light fields of general 3D scenes captured under 3D camera motion and present both theoretical and practical contributions. By analyzing the motion-blurred light field in the primal and Fourier domains, we develop intuition into the effects of camera motion on the light field, show the advantages of capturing a 4D light field instead of a conventional 2D image for...

chapter

Spatiotemporal Pyramid Network for Video Action Recognition

Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2097 - 2106

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Two-stream convolutional networks have shown strong performance in video action recognition tasks. The key idea is to learn spatiotemporal features by fusing convolutional networks spatially and temporally. However, it remains unclear how to model the correlations between the spatial and temporal structures at multiple abstraction levels. First, the spatial stream tends to fail if two videos share...

chapter

Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos

Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2146 - 2155

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given two action sequences, we are interested in spotting/co-segmenting all pairs of sub-sequences that represent the same action. We propose a totally unsupervised solution to this problem. No a-priori model of the actions is assumed to be available. The number of common sub-sequences may be unknown. The sub-sequences can be located anywhere in the original sequences, may differ in duration and the...

chapter

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1942 - 1950

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Spatial relationships between objects provide important information for text-based image retrieval. As users are more likely to describe a scene from a real world perspective, using 3D spatial relationships rather than 2D relationships that assume a particular viewing direction, one of the main challenges is to infer the 3D structure that bridges images with users text descriptions. However, direct...

chapter

End-to-End 3D Face Reconstruction with Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1503 - 1512

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D facial shape reconstruction from a single 2D facial image has been an active research area due to its wide applications. Inspired by the success of deep neural networks (DNN), we propose a DNN-based approach for End-to-End 3D FAce Reconstruction (UH-E2FAR) from a single 2D image. Different from recent works that reconstruct and refine the 3D face in an iterative manner using both an RGB...

chapter

Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging

Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1484 - 1492

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel, practical solution for high quality reconstruction of axially-symmetric transparent objects. While a special case, such transparent objects are ubiquitous in the real world. Common examples of these are glasses, goblets, tumblers, carafes, etc., that can have very unique and visually appealing forms making their reconstruction interesting for vision and graphics applications. Our...

chapter

A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5094 - 5102

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a Reinforcement Learning (RL) solution to the view planning problem (VPP), which generates a sequence of view points that are capable of sensing all accessible area of a given object represented as a 3D model. In doing so, the goal is to minimize the number of view points, making the VPP a class of set covering optimization problem (SCOP). The SCOP is NP-hard, and the inapproximability...

chapter

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5057 - 5065

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Indoor scene understanding is central to applications such as robot navigation and human companion assistance. Over the last years, data-driven deep neural networks have outperformed many traditional approaches thanks to their representation learning capabilities. One of the bottlenecks in training for better representations is the amount of available per-pixel ground truth data that is required for...

chapter

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1417 - 1426

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Temporal action localization is an important yet challenging problem. Given a long, untrimmed video consisting of multiple action instances and complex background contents, we need not only to recognize their action categories, but also to localize the start time and end time of each instance. Many state-of-the-art systems use segment-level classifiers to select and rank proposal segments of pre-determined...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4714 - 4723

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a deep multitask architecture for fully automatic 2d and 3d human sensing (DMHS), including recognition and reconstruction, in monocular images. The system computes the figure-ground segmentation, semantically identifies the human body parts at pixel level, and estimates the 2d and 3d pose of the person. The model supports the joint training of all components by means of multi-task losses...

chapter

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Shiyu Huang, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4664 - 4673

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

As autonomous vehicles become an every-day reality, high-accuracy pedestrian detection is of paramount practical importance. Pedestrian detection is a highly researched topic with mature methods, but most datasets (for both training and evaluation) focus on common scenes of people engaged in typical walking poses on sidewalks. But performance is most crucial for dangerous scenarios that are rarely...

Keywords:
THREE-DIMENSIONAL DISPLAYS

Publication date

Set your own date range

Keywords

SHAPE (52)
SOLID MODELING (45)
CAMERAS (42)
TWO DIMENSIONAL DISPLAYS (41)
TRAINING (34)
IMAGE RECONSTRUCTION (29)
SEMANTICS (21)
POSE ESTIMATION (20)
COMPUTATIONAL MODELING (18)
GEOMETRY (15)
VIDEOS (15)
FEATURE EXTRACTION (14)
ESTIMATION (13)
ROBUSTNESS (13)
COMPUTER VISION (11)
IMAGE SEGMENTATION (11)
NEURAL NETWORKS (11)
CONVOLUTION (10)
KERNEL (10)
VISUALIZATION (10)
FACE (9)
OPTICAL IMAGING (9)
SURFACE RECONSTRUCTION (9)
OPTIMIZATION (8)
DATA MODELS (7)
MACHINE LEARNING (7)
STRAIN (7)
BENCHMARK TESTING (6)
CALIBRATION (6)
DEFORMABLE MODELS (6)
DETECTORS (6)
IMAGE COLOR ANALYSIS (6)
PROPOSALS (6)
SKELETON (6)
TRAINING DATA (6)
COMPUTER ARCHITECTURE (5)
HIDDEN MARKOV MODELS (5)
PIPELINES (5)
PROBABILISTIC LOGIC (5)
REAL-TIME SYSTEMS (5)
DATABASES (4)
DISTORTION (4)
IMAGE EDGE DETECTION (4)
IMAGE RESOLUTION (4)
INDEXES (4)
ITERATIVE CLOSEST POINT ALGORITHM (4)
LIGHTING (4)
MANIFOLDS (4)
MATHEMATICAL MODEL (4)
MOTION SEGMENTATION (4)
OBJECT DETECTION (4)
RENDERING (COMPUTER GRAPHICS) (4)
STANDARDS (4)
DECODING (3)
LABELING (3)
LASER RADAR (3)
MEASUREMENT (3)
MINIMIZATION (3)
SURFACE TREATMENT (3)
TRANSFORMS (3)
APERTURES (2)
ATMOSPHERIC MODELING (2)
BIOLOGICAL SYSTEM MODELING (2)
BUILDINGS (2)
COGNITION (2)
COHERENCE (2)
CONVOLUTIONAL CODES (2)
ENCODING (2)
GENERATORS (2)
HEATING SYSTEMS (2)
IMAGE RECOGNITION (2)
LAPLACE EQUATIONS (2)
LAYOUT (2)
LEGGED LOCOMOTION (2)
LOGIC GATES (2)
OBJECT RECOGNITION (2)
OCTREES (2)
PREDICTIVE MODELS (2)
RADIOMETRY (2)
RELIABILITY (2)
SCATTERING (2)
SEA SURFACE (2)
SENSORS (2)
SILICON (2)
SIMULTANEOUS LOCALIZATION AND MAPPING (2)
SPATIAL RESOLUTION (2)
SPECTRAL ANALYSIS (2)
TOPOLOGY (2)
TRAJECTORY (2)
ACTIVITY RECOGNITION (1)
ADAPTATION MODELS (1)
ADAPTIVE OPTICS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANALYTICAL MODELS (1)
ANIMALS (1)
ARRAYS (1)
AUTOMOBILES (1)
AZIMUTH (1)
BIOLOGICAL NEURAL NETWORKS (1)
more

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images

3D Shape Segmentation with Projective Convolutional Networks

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Detailed, Accurate, Human Shape Estimation from Clothed 3D Scan Sequences

3D Face Morphable Models "In-the-Wild"

Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Light Field Blind Motion Deblurring

Spatiotemporal Pyramid Network for Video Action Recognition

Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

End-to-End 3D Face Reconstruction with Deep Neural Networks

Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging

A Reinforcement Learning Approach to the View Planning Problem

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)