Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

chapter

Anytime Recognition of Objects and Scenes

Sergey Karayev, Mario Fritz, Trevor Darrell

2014 IEEE Conference on Computer Vision and Pattern Recognition > 572 - 579

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Humans are capable of perceiving a scene at a glance, and obtain deeper understanding with additional time. Similarly, visual recognition deployments should be robust to varying computational budgets. Such situations require Anytime recognition ability, which is rarely considered in computer vision research. We present a method for learning dynamic policies to optimize Anytime performance in visual...

chapter

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik

2014 IEEE Conference on Computer Vision and Pattern Recognition > 580 - 587

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous...

chapter

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Raviteja Vemulapalli, Felipe Arrate, Rama Chellappa

2014 IEEE Conference on Computer Vision and Pattern Recognition > 588 - 595

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently introduced cost-effective depth sensors coupled with the real-time skeleton estimation algorithm of Shotton et al. [16] have generated a renewed interest in skeleton-based human action recognition. Most of the existing skeleton-based approaches use either the joint locations or the joint angles to represent a human skeleton. In this paper, we propose a new skeletal representation that explicitly...

chapter

Multi-view Super Vector for Action Recognition

Zhuowei Cai, Limin Wang, Xiaojiang Peng, Yu Qiao

2014 IEEE Conference on Computer Vision and Pattern Recognition > 596 - 603

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Images and videos are often characterized by multiple types of local descriptors such as SIFT, HOG and HOF, each of which describes certain aspects of object feature. Recognition systems benefit from fusing multiple types of these descriptors. Two widely applied fusion pipelines are descriptor concatenation and kernel average. The first one is effective when different descriptors are strongly correlated,...

chapter

Unsupervised Spectral Dual Assignment Clustering of Human Actions in Context

Simon Jones, Ling Shao

2014 IEEE Conference on Computer Vision and Pattern Recognition > 604 - 611

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A recent trend of research has shown how contextual information related to an action, such as a scene or object, can enhance the accuracy of human action recognition systems. However, using context to improve unsupervised human action clustering has never been considered before, and cannot be achieved using existing clustering methods. To solve this problem, we introduce a novel, general purpose algorithm,...

chapter

Parsing Videos of Actions with Segmental Grammars

Hamed Pirsiavash, Deva Ramanan

2014 IEEE Conference on Computer Vision and Pattern Recognition > 612 - 619

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Real-world videos of human activities exhibit temporal structure at various scales, long videos are typically composed out of multiple action instances, where each instance is itself composed of sub-actions with variable durations and orderings. Temporal grammars can presumably model such hierarchical structure, but are computationally difficult to apply for long video streams. We describe simple...

chapter

Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition

Jingyong Su, Anuj Srivastava, Fillipe D.M. de Souza, Sudeep Sarkar

2014 IEEE Conference on Computer Vision and Pattern Recognition > 620 - 627

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In statistical analysis of video sequences for speech recognition, and more generally activity recognition, it is natural to treat temporal evolutions of features as trajectories on Riemannian manifolds. However, different evolution patterns result in arbitrary parameterizations of these trajectories. We investigate a recent framework from statistics literature that handles this nuisance variability...

chapter

Piecewise Planar and Compact Floorplan Reconstruction from Images

Ricardo Cabral, Yasutaka Furukawa

2014 IEEE Conference on Computer Vision and Pattern Recognition > 628 - 635

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a system to reconstruct piecewise planar and compact floorplans from images, which are then converted to high quality texture-mapped models for free- viewpoint visualization. There are two main challenges in image-based floorplan reconstruction. The first is the lack of 3D information that can be extracted from images by Structure from Motion and Multi-View Stereo, as indoor scenes...

chapter

Data-Driven Flower Petal Modeling with Botany Priors

Chenxi Zhang, Mao Ye, Bo Fu, Ruigang Yang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 636 - 643

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we focus on the 3D modeling of flower, in particular the petals. The complex structure, severe occlusions, and wide variations make the reconstruction of their 3D models a challenging task. Therefore, even though the flower is the most distinctive part of a plant, there has been little modeling study devoted to it. We overcome these challenges by combining data driven modeling techniques...

chapter

User-Specific Hand Modeling from Monocular Depth Sequences

Jonathan Taylor, Richard Stebbing, Varun Ramakrishna, Cem Keskin, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 644 - 651

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a method for acquiring dense nonrigid shape and deformation from a single monocular depth sensor. We focus on modeling the human hand, and assume that a single rough template model is available. We combine and extend existing work on model-based tracking, subdivision surface fitting, and mesh deformation to acquire detailed hand models from as few as 15 frames of depth data. We...

chapter

Class Specific 3D Object Shape Priors Using Surface Normals

Christian Hane, Nikolay Savinov, Marc Pollefeys

2014 IEEE Conference on Computer Vision and Pattern Recognition > 652 - 659

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Dense 3D reconstruction of real world objects containing textureless, reflective and specular parts is a challenging task. Using general smoothness priors such as surface area regularization can lead to defects in the form of disconnected parts or unwanted indentations. We argue that this problem can be solved by exploiting the object class specific local surface orientations, e.g. a car is always...

chapter

Frequency-Based 3D Reconstruction of Transparent and Specular Objects

Ding Liu, Xida Chen, Yee-Hong Yang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 660 - 667

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D reconstruction of transparent and specular objects is a very challenging topic in computer vision. For transparent and specular objects, which have complex interior and exterior structures that can reflect and refract light in a complex fashion, it is difficult, if not impossible, to use either passive stereo or the traditional structured light methods to do the reconstruction. We propose a frequency-based...

chapter

Human Body Shape Estimation Using a Multi-resolution Manifold Forest

Frank Perbet, Sam Johnson, Minh-Tri Pham, Bjorn Stenger

2014 IEEE Conference on Computer Vision and Pattern Recognition > 668 - 675

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a method for estimating the 3D body shape of a person with robustness to clothing. We formulate the problem as optimization over the manifold of valid depth maps of body shapes learned from synthetic training data. The manifold itself is represented using a novel data structure, a Multi-Resolution Manifold Forest (MRMF), which contains vertical edges between tree nodes as well...

chapter

Quality Dynamic Human Body Modeling Using a Single Low-Cost Depth Camera

Qing Zhang, Bo Fu, Mao Ye, Ruigang Yang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 676 - 683

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we present a novel autonomous pipeline to build a personalized parametric model (pose-driven avatar) using a single depth sensor. Our method first captures a few high-quality scans of the user rotating herself at multiple poses from different views. We fit each incomplete scan using template fitting techniques with a generic human template, and register all scans to every pose using...

chapter

Single-View 3D Scene Parsing by Attributed Grammar

Xiaobai Liu, Yibiao Zhao, Song-chun Zhu

2014 IEEE Conference on Computer Vision and Pattern Recognition > 684 - 691

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present an attributed grammar for parsing man-made outdoor scenes into semantic surfaces, and recovering its 3D model simultaneously. The grammar takes superpixels as its terminal nodes and use five production rules to generate the scene into a hierarchical parse graph. Each graph node actually correlates with a surface or a composite of surfaces in the 3D world or the 2D image....

chapter

Separation of Line Drawings Based on Split Faces for 3D Object Reconstruction

Changqing Zou, Heng Yang, Jianzhuang Liu

2014 IEEE Conference on Computer Vision and Pattern Recognition > 692 - 699

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Reconstructing 3D objects from single line drawings is often desirable in computer vision and graphics applications. If the line drawing of a complex 3D object is decomposed into primitives of simple shape, the object can be easily reconstructed. We propose an effective method to conduct the line drawing separation and turn a complex line drawing into parametric 3D models. This is achieved by recursively...

chapter

When 3D Reconstruction Meets Ubiquitous RGB-D Images

Quanshi Zhang, Xuan Song, Xiaowei Shao, Huijing Zhao, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 700 - 707

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D reconstruction from a single image is a classical problem in computer vision. However, it still poses great challenges for the reconstruction of daily-use objects with irregular shapes. In this paper, we propose to learn 3D reconstruction knowledge from informally captured RGB-D images, which will probably be ubiquitously used in daily life. The learning of 3D reconstruction is defined as a category...

chapter

Stable Template-Based Isometric 3D Reconstruction in All Imaging Conditions by Linear Least-Squares

Ajad Chhatkuli, Daniel Pizarro, Adrien Bartoli

2014 IEEE Conference on Computer Vision and Pattern Recognition > 708 - 715

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

It has been recently shown that reconstructing an isometric surface from a single 2D input image matched to a 3D template was a well-posed problem. This however does not tell us how reconstruction algorithms will behave in practical conditions, where the amount of perspective is generally small and the projection thus behaves like weak-perspective or orthography. We here bring answers to what is theoretically...

chapter

Discrete-Continuous Depth Estimation from a Single Image

Miaomiao Liu, Mathieu Salzmann, Xuming He

2014 IEEE Conference on Computer Vision and Pattern Recognition > 716 - 723

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we tackle the problem of estimating the depth of a scene from a single image. This is a challenging task, since a single image on its own does not provide any depth cue. To address this, we exploit the availability of a pool of images for which the depth is known. More specifically, we formulate monocular depth estimation as a discrete-continuous optimization problem, where the continuous...

chapter

Leveraging Hierarchical Parametric Networks for Skeletal Joints Based Action Segmentation and Recognition

Di Wu, Ling Shao

2014 IEEE Conference on Computer Vision and Pattern Recognition > 724 - 731

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Over the last few years, with the immense popularity of the Kinect, there has been renewed interest in developing methods for human gesture and action recognition from 3D skeletal data. A number of approaches have been proposed to extract representative features from 3D skeletal data, most commonly hard wired geometric or bio-inspired shape context features. We propose a hierarchial dynamic framework...

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition

Anytime Recognition of Objects and Scenes

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Multi-view Super Vector for Action Recognition

Unsupervised Spectral Dual Assignment Clustering of Human Actions in Context

Parsing Videos of Actions with Segmental Grammars

Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition

Piecewise Planar and Compact Floorplan Reconstruction from Images

Data-Driven Flower Petal Modeling with Botany Priors

User-Specific Hand Modeling from Monocular Depth Sequences

Class Specific 3D Object Shape Priors Using Surface Normals

Frequency-Based 3D Reconstruction of Transparent and Specular Objects

Human Body Shape Estimation Using a Multi-resolution Manifold Forest

Quality Dynamic Human Body Modeling Using a Single Low-Cost Depth Camera

Single-View 3D Scene Parsing by Attributed Grammar

Separation of Line Drawings Based on Split Faces for 3D Object Reconstruction

When 3D Reconstruction Meets Ubiquitous RGB-D Images

Stable Template-Based Isometric 3D Reconstruction in All Imaging Conditions by Linear Least-Squares

Discrete-Continuous Depth Estimation from a Single Image

Leveraging Hierarchical Parametric Networks for Skeletal Joints Based Action Segmentation and Recognition

Filter options

Publication date

Keywords

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE Conference on Computer Vision and Pattern Recognition