2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Predicting Multiple Attributes via Relative Multi-task Learning

Lin Chen, Qiang Zhang, Baoxin Li

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1027 - 1034

Relative attributes learning aims to learn ranking functions describing the relative strength of attributes. Most of current learning approaches learn ranking functions for each attribute independently without considering possible intrinsic relatedness among the attributes. For a problem involving multiple attributes, it is reasonable to assume that utilizing such relatedness among the attributes...

chapter

Incremental Activity Modeling and Recognition in Streaming Videos

Mahmudul Hasan, Amit K. Roy-Chowdhury

2014 IEEE Conference on Computer Vision and Pattern Recognition > 796 - 803

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most of the state-of-the-art approaches to human activity recognition in video need an intensive training stage and assume that all of the training examples are labeled and available beforehand. But these assumptions are unrealistic for many applications where we have to deal with streaming videos. In these videos, as new activities are seen, they can be leveraged upon to improve the current activity...

chapter

Discrete-Continuous Depth Estimation from a Single Image

Miaomiao Liu, Mathieu Salzmann, Xuming He

2014 IEEE Conference on Computer Vision and Pattern Recognition > 716 - 723

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we tackle the problem of estimating the depth of a scene from a single image. This is a challenging task, since a single image on its own does not provide any depth cue. To address this, we exploit the availability of a pool of images for which the depth is known. More specifically, we formulate monocular depth estimation as a discrete-continuous optimization problem, where the continuous...

chapter

Ask the Image: Supervised Pooling to Preserve Feature Locality

Sean Ryan Fanello, Nicoletta Noceti, Carlo Ciliberto, Giorgio Metta, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 851 - 858

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we propose a weighted supervised pooling method for visual recognition systems. We combine a standard Spatial Pyramid Representation which is commonly adopted to encode spatial information, with an appropriate Feature Space Representation favoring semantic information in an appropriate feature space. For the latter, we propose a weighted pooling strategy exploiting data supervision to...

chapter

Capturing Long-Tail Distributions of Object Subcategories

Xiangxin Zhu, Dragomir Anguelov, Deva Ramanan

2014 IEEE Conference on Computer Vision and Pattern Recognition > 915 - 922

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We argue that object subcategories follow a long-tail distribution: a few subcategories are common, while many are rare. We describe distributed algorithms for learning large- mixture models that capture long-tail distributions, which are hard to model with current approaches. We introduce a generalized notion of mixtures (or subcategories) that allow for examples to be shared across multiple subcategories...

chapter

On Projective Reconstruction in Arbitrary Dimensions

Behrooz Nasihatkon, Richard Hartley, Jochen Trumpf

2014 IEEE Conference on Computer Vision and Pattern Recognition > 477 - 484

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the theory of projective reconstruction for multiple projections from an arbitrary dimensional projective space into lower-dimensional spaces. This problem is important due to its applications in the analysis of dynamical scenes. The current theory, due to Hartley and Schaffalitzky, is based on the Grassmann tensor, generalizing the ideas of fundamental matrix, trifocal tensor and quadrifocal...

chapter

Minimal Scene Descriptions from Structure from Motion Models

Song Cao, Noah Snavely

2014 IEEE Conference on Computer Vision and Pattern Recognition > 461 - 468

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

How much data do we need to describe a location? We explore this question in the context of 3D scene reconstructions created from running structure from motion on large Internet photo collections, where reconstructions can contain many millions of 3D points. We consider several methods for computing much more compact representations of such reconstructions for the task of location recognition, with...

chapter

Simultaneous Localization and Calibration: Self-Calibration of Consumer Depth Cameras

Qian-Yi Zhou, Vladlen Koltun

2014 IEEE Conference on Computer Vision and Pattern Recognition > 454 - 460

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We describe an approach for simultaneous localization and calibration of a stream of range images. Our approach jointly optimizes the camera trajectory and a calibration function that corrects the camera's unknown nonlinear distortion. Experiments with real-world benchmark data and synthetic data show that our approach increases the accuracy of camera trajectories and geometric models estimated from...

chapter

Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels

Andras Bodis-Szomoru, Hayko Riemenschneider, Luc Van Gool

2014 IEEE Conference on Computer Vision and Pattern Recognition > 469 - 476

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

State-of-the-art Multi-View Stereo (MVS) algorithms deliver dense depth maps or complex meshes with very high detail, and redundancy over regular surfaces. In turn, our interest lies in an approximate, but light-weight method that is better to consider for large-scale applications, such as urban scene reconstruction from ground-based images. We present a novel approach for producing dense reconstructions...

chapter

Max-Margin Boltzmann Machines for Object Segmentation

Jimei Yang, Simon Safar, Ming-Hsuan Yang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 320 - 327

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present Max-Margin Boltzmann Machines (MMBMs) for object segmentation. MMBMs are essentially a class of Conditional Boltzmann Machines that model the joint distribution of hidden variables and output labels conditioned on input observations. In addition to image-to-label connections, we build direct image-to-hidden connections to facilitate global shape prediction, and thus derive a simple Iterated...

chapter

Point Matching in the Presence of Outliers in Both Point Sets: A Concave Optimization Approach

Wei Lian, Lei Zhang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 352 - 359

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, a concave optimization approach has been proposed to solve the robust point matching (RPM) problem. This method is globally optimal, but it requires that each model point has a counterpart in the data point set. Unfortunately, such a requirement may not be satisfied in certain applications when there are outliers in both point sets. To address this problem, we relax this condition and reduce...

chapter

Multi-view Super Vector for Action Recognition

Zhuowei Cai, Limin Wang, Xiaojiang Peng, Yu Qiao

2014 IEEE Conference on Computer Vision and Pattern Recognition > 596 - 603

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Images and videos are often characterized by multiple types of local descriptors such as SIFT, HOG and HOF, each of which describes certain aspects of object feature. Recognition systems benefit from fusing multiple types of these descriptors. Two widely applied fusion pipelines are descriptor concatenation and kernel average. The first one is effective when different descriptors are strongly correlated,...

chapter

Covariance Trees for 2D and 3D Processing

Thierry Guillemot, Andres Almansa, Tamy Boubekeur

2014 IEEE Conference on Computer Vision and Pattern Recognition > 556 - 563

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Gaussian Mixture Models have become one of the major tools in modern statistical image processing, and allowed performance breakthroughs in patch-based image denoising and restoration problems. Nevertheless, their adoption level was kept relatively low because of the computational cost associated to learning such models on large image databases. This work provides a flexible and generic tool for dealing...

chapter

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Raviteja Vemulapalli, Felipe Arrate, Rama Chellappa

2014 IEEE Conference on Computer Vision and Pattern Recognition > 588 - 595

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently introduced cost-effective depth sensors coupled with the real-time skeleton estimation algorithm of Shotton et al. [16] have generated a renewed interest in skeleton-based human action recognition. Most of the existing skeleton-based approaches use either the joint locations or the joint angles to represent a human skeleton. In this paper, we propose a new skeletal representation that explicitly...

chapter

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik

2014 IEEE Conference on Computer Vision and Pattern Recognition > 580 - 587

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous...

chapter

Optimal Decisions from Probabilistic Models: The Intersection-over-Union Case

Sebastian Nowozin

2014 IEEE Conference on Computer Vision and Pattern Recognition > 548 - 555

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A probabilistic model allows us to reason about the world and make statistically optimal decisions using Bayesian decision theory. However, in practice the intractability of the decision problem forces us to adopt simplistic loss functions such as the 0/1 loss or Hamming loss and as result we make poor decisions through MAP estimates or through low-order marginal statistics. In this work we investigate...

chapter

Relative Pose Estimation for a Multi-camera System with Known Vertical Direction

Gim Hee Lee, Marc Pollefeys, Friedrich Fraundorfer

2014 IEEE Conference on Computer Vision and Pattern Recognition > 540 - 547

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present our minimal 4-point and linear 8-point algorithms to estimate the relative pose of a multi-camera system with known vertical directions, i.e. known absolute roll and pitch angles. We solve the minimal 4-point algorithm with the hidden variable resultant method and show that it leads to an 8-degree univariate polynomial that gives up to 8 real solutions. We identify a degenerated...

chapter

Two-View Camera Housing Parameters Calibration for Multi-layer Flat Refractive Interface

Xida Chen, Yee-Hong Yang

2014 IEEE Conference on Computer Vision and Pattern Recognition > 524 - 531

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel refractive calibration method for an underwater stereo camera system where both cameras are looking through multiple parallel flat refractive interfaces. At the heart of our method is an important finding that the thickness of the interface can be estimated from a set of pixel correspondences in the stereo images when the refractive axis is given. To our best knowledge,...

chapter

Visual Persuasion: Inferring Communicative Intents of Images

Jungseock Joo, Weixin Li, Francis F. Steen, Song-Chun Zhu

2014 IEEE Conference on Computer Vision and Pattern Recognition > 216 - 223

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we introduce the novel problem of understanding visual persuasion. Modern mass media make extensive use of images to persuade people to make commercial and political decisions. These effects and techniques are widely studied in the social sciences, but behavioral studies do not scale to massive datasets. Computer vision has made great strides in building syntactical representations of...

chapter

The Secrets of Salient Object Segmentation

Yin Li, Xiaodi Hou, Christof Koch, James M. Rehg, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 280 - 287

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we provide an extensive evaluation of fixation prediction and salient object segmentation algorithms as well as statistics of major datasets. Our analysis identifies serious design flaws of existing salient object benchmarks, called the dataset design bias, by over emphasising the stereotypical concepts of saliency. The dataset design bias does not only create the discomforting disconnection...

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Predicting Multiple Attributes via Relative Multi-task Learning

Incremental Activity Modeling and Recognition in Streaming Videos

Discrete-Continuous Depth Estimation from a Single Image

Ask the Image: Supervised Pooling to Preserve Feature Locality

Capturing Long-Tail Distributions of Object Subcategories

On Projective Reconstruction in Arbitrary Dimensions

Minimal Scene Descriptions from Structure from Motion Models

Simultaneous Localization and Calibration: Self-Calibration of Consumer Depth Cameras

Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels

Max-Margin Boltzmann Machines for Object Segmentation

Point Matching in the Presence of Outliers in Both Point Sets: A Concave Optimization Approach

Multi-view Super Vector for Action Recognition

Covariance Trees for 2D and 3D Processing

Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Optimal Decisions from Probabilistic Models: The Intersection-over-Union Case

Relative Pose Estimation for a Multi-camera System with Known Vertical Direction

Two-View Camera Housing Parameters Calibration for Multi-layer Flat Refractive Interface

Visual Persuasion: Inferring Communicative Intents of Images

The Secrets of Salient Object Segmentation

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)