2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Large-Scale Video Classification with Convolutional Neural Networks

Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1725 - 1732

Convolutional Neural Networks (CNNs) have been established as a powerful class of models for image recognition problems. Encouraged by these results, we provide an extensive empirical evaluation of CNNs on large-scale video classification using a new dataset of 1 million YouTube videos belonging to 487 classes. We study multiple approaches for extending the connectivity of a CNN in time domain to...

chapter

A Hierarchical Probabilistic Model for Facial Feature Detection

Yue Wu, Ziheng Wang, Qiang Ji

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1781 - 1788

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Facial feature detection from facial images has attracted great attention in the field of computer vision. It is a nontrivial task since the appearance and shape of the face tend to change under different conditions. In this paper, we propose a hierarchical probabilistic model that could infer the true locations of facial features given the image measurements even if the face is with significant facial...

chapter

Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition

Mengyi Liu, Shiguang Shan, Ruiping Wang, Xilin Chen

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1749 - 1756

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Facial expression is temporally dynamic event which can be decomposed into a set of muscle motions occurring in different facial regions over various time intervals. For dynamic expression recognition, two key issues, temporal alignment and semantics-aware dynamic representation, must be taken into account. In this paper, we attempt to solve both problems via manifold modeling of videos based on a...

chapter

Convolutional Neural Networks for No-Reference Image Quality Assessment

Le Kang, Peng Ye, Yi Li, David Doermann

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1733 - 1740

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we describe a Convolutional Neural Network (CNN) to accurately predict image quality without a reference image. Taking image patches as input, the CNN works in the spatial domain without using hand-crafted features that are employed by most previous methods. The network consists of one convolutional layer with max and min pooling, two fully connected layers and an output node. Within...

chapter

Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras

Kenneth Alberto Funes Mora, Jean-Marc Odobez

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1773 - 1780

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a head pose invariant gaze estimation model for distant RGB-D cameras. It relies on a geometric understanding of the 3D gaze action and generation of eye images. By introducing a semantic segmentation of the eye region within a generative process, the model (i) avoids the critical feature tracking of geometrical approaches requiring high resolution images, (ii) decouples the person dependent...

chapter

Unified Face Analysis by Iterative Multi-output Random Forests

Xiaowei Zhao, Tae-Kyun Kim, Wenhan Luo

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1765 - 1772

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a unified method for joint face image analysis, i.e., simultaneously estimating head pose, facial expression and landmark positions in real-world face images. To achieve this goal, we propose a novel iterative Multi-Output Random Forests (iMORF) algorithm, which explicitly models the relations among multiple tasks and iteratively exploits such relations to boost the performance...

chapter

RAPS: Robust and Efficient Automatic Construction of Person-Specific Deformable Models

Christos Sagonas, Yannis Panagakis, Stefanos Zafeiriou, Maja Pantic

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1789 - 1796

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The construction of Facial Deformable Models (FDMs) is a very challenging computer vision problem, since the face is a highly deformable object and its appearance drastically changes under different poses, expressions, and illuminations. Although several methods for generic FDMs construction, have been proposed for facial landmark localization in still images, they are insufficient for tasks such...

chapter

Topic Modeling of Multimodal Data: An Autoregressive Approach

Yin Zheng, Yu-Jin Zhang, Hugo Larochelle

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1370 - 1377

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply...

chapter

Higher-Order Clique Reduction without Auxiliary Variables

Hiroshi Ishikawa

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1362 - 1369

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a method to reduce most higher-order terms of Markov Random Fields with binary labels into lower-order ones without introducing any new variables, while keeping the minimizer of the energy unchanged. While the method does not reduce all terms, it can be used with existing techniques that transformsarbitrary terms (by introducing auxiliary variables) and improve the speed. The method eliminates...

chapter

Scanline Sampler without Detailed Balance: An Efficient MCMC for MRF Optimization

Wonsik Kim, Kyoung Mu Lee

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1354 - 1361

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Markov chain Monte Carlo (MCMC) is an elegant tool, widely used in variety of areas. In computer vision, it has been used for the inference on the Markov random field model (MRF). However, MCMC less concerned than other deterministic approaches although it converges to global optimal solution in theory. The major obstacle is its slow convergence. To come up with faster sampling method, we investigate...

chapter

Partial Occlusion Handling for Visual Tracking via Robust Part Matching

Tianzhu Zhang, Kui Jia, Changsheng Xu, Yi Ma, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1258 - 1265

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Part-based visual tracking is advantageous due to its robustness against partial occlusion. However, how to effectively exploit the confidence scores of individual parts to construct a robust tracker is still a challenging problem. In this paper, we address this problem by simultaneously matching parts in each of multiple frames, which is realized by a locality-constrained low-rank sparse learning...

chapter

Subspace Tracking under Dynamic Dimensionality for Online Background Subtraction

Matthew Berger, Lee M. Seversky

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1274 - 1281

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Long-term modeling of background motion in videos is an important and challenging problem used in numerous applications such as segmentation and event recognition. A major challenge in modeling the background from point trajectories lies in dealing with the variable length duration of trajectories, which can be due to such factors as trajectories entering and leaving the frame or occlusion from different...

chapter

Learning Scalable Discriminative Dictionary with Sample Relatedness

Jiashi Feng, Stefanie Jegelka, Shuicheng Yan, Trevor Darrell

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1645 - 1652

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attributes are widely used as mid-level descriptors of object properties in object recognition and retrieval. Mostly, such attributes are manually pre-defined based on domain knowledge, and their number is fixed. However, pre-defined attributes may fail to adapt to the properties of the data at hand, may not necessarily be discriminative, and/or may not generalize well. In this work, we propose a...

chapter

PANDA: Pose Aligned Networks for Deep Attribute Modeling

Ning Zhang, Manohar Paluri, Marc'Aurelio Ranzato, Trevor Darrell, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1637 - 1644

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a method for inferring human attributes (such as gender, hair style, clothes style, expression, action) from images of people under large variation of viewpoint, pose, appearance, articulation and occlusion. Convolutional Neural Nets (CNN) have been shown to perform very well on large scale object recognition problems. In the context of attribute classification, however, the signal is often...

chapter

Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation

Catalin Ionescu, Joao Carreira, Cristian Sminchisescu

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1661 - 1668

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, the emergence of Kinect systems has demonstrated the benefits of predicting an intermediate body part labeling for 3D human pose estimation, in conjunction with RGB-D imagery. The availability of depth information plays a critical role, so an important question is whether a similar representation can be developed with sufficient robustness in order to estimate 3D pose from RGB images. This...

chapter

Transfer Joint Matching for Unsupervised Domain Adaptation

Mingsheng Long, Jianmin Wang, Guiguang Ding, Jiaguang Sun, more

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1410 - 1417

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual domain adaptation, which learns an accurate classifier for a new domain using labeled images from an old domain, has shown promising value in computer vision yet still been a challenging problem. Most prior works have explored two learning strategies independently for domain adaptation: feature matching and instance reweighting. In this paper, we show that both strategies are important and...

chapter

Scalable Multitask Representation Learning for Scene Classification

Maksim Lapin, Bernt Schiele, Matthias Hein

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1434 - 1441

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The underlying idea of multitask learning is that learning tasks jointly is better than learning each task individually. In particular, if only a few training examples are available for each task, sharing a jointly trained representation improves classification performance. In this paper, we propose a novel multitask learning method that learns a low-dimensional representation jointly with the corresponding...

chapter

Stable Learning in Coding Space for Multi-class Decoding and Its Extension for Multi-class Hypothesis Transfer Learning

Bang Zhang, Yi Wang, Yang Wang, Fang Chen

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1075 - 1081

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many prevalent multi-class classification approaches can be unified and generalized by the output coding framework which usually consists of three phases: (1) coding, (2) learning binary classifiers, and (3) decoding. Most of these approaches focus on the first two phases and predefined distance function is used for decoding. In this paper, however, we propose to perform learning in coding space for...

chapter

Merging SVMs with Linear Discriminant Analysis: A Combined Model

Symeon Nikitidis, Stefanos Zafeiriou, Maja Pantic

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1067 - 1074

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A key problem often encountered by many learning algorithms in computer vision dealing with high dimensional data is the so called "curse of dimensionality" which arises when the available training samples are less than the input feature space dimensionality. To remedy this problem, we propose a joint dimensionality reduction and classification framework by formulating an optimization problem...

chapter

Optimizing Average Precision Using Weakly Supervised Data

Aseem Behl, C.V. Jawahar, M. Pawan Kumar

2014 IEEE Conference on Computer Vision and Pattern Recognition > 1011 - 1018

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The performance of binary classification tasks, such as action classification and object detection, is often measured in terms of the average precision (AP). Yet it is common practice in computer vision to employ the support vector machine (SVM) classifier, which optimizes a surrogate 0-1 loss. The popularity of SVM can be attributed to its empirical performance. Specifically, in fully supervised...

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Large-Scale Video Classification with Convolutional Neural Networks

A Hierarchical Probabilistic Model for Facial Feature Detection

Learning Expressionlets on Spatio-temporal Manifold for Dynamic Facial Expression Recognition

Convolutional Neural Networks for No-Reference Image Quality Assessment

Geometric Generative Gaze Estimation (G3E) for Remote RGB-D Cameras

Unified Face Analysis by Iterative Multi-output Random Forests

RAPS: Robust and Efficient Automatic Construction of Person-Specific Deformable Models

Topic Modeling of Multimodal Data: An Autoregressive Approach

Higher-Order Clique Reduction without Auxiliary Variables

Scanline Sampler without Detailed Balance: An Efficient MCMC for MRF Optimization

Partial Occlusion Handling for Visual Tracking via Robust Part Matching

Subspace Tracking under Dynamic Dimensionality for Online Background Subtraction

Learning Scalable Discriminative Dictionary with Sample Relatedness

PANDA: Pose Aligned Networks for Deep Attribute Modeling

Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation

Transfer Joint Matching for Unsupervised Domain Adaptation

Scalable Multitask Representation Learning for Scene Classification

Stable Learning in Coding Space for Multi-class Decoding and Its Extension for Multi-class Hypothesis Transfer Learning

Merging SVMs with Linear Discriminant Analysis: A Combined Model

Optimizing Average Precision Using Weakly Supervised Data

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)