2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Learning to Learn from Noisy Web Videos

Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7455 - 7463

Understanding the simultaneously very diverse and intricately fine-grained set of possible human actions is a critical open problem in computer vision. Manually labeling training videos is feasible for some action classes but doesnt scale to the full long-tailed distribution of actions. A promising way to address this is to leverage noisy data from web queries to learn new actions, using semi-supervised...

chapter

Multi-way Multi-level Kernel Modeling for Neuroimaging Classification

Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6846 - 6854

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Owing to prominence as a diagnostic tool for probing the neural correlates of cognition, neuroimaging tensor data has been the focus of intense investigation. Although many supervised tensor learning approaches have been proposed, they either cannot capture the nonlinear relationships of tensor data or cannot preserve the complex multi-way structural information. In this paper, we propose a Multi-way...

chapter

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3068 - 3076

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we introduce Recipe1M, a new large-scale, structured corpus of over 1m cooking recipes and 800k food images. As the largest publicly available collection of recipe data, Recipe1M affords the ability to train high-capacity models on aligned, multi-modal data. Using these data, we train a neural network to find a joint embedding of recipes and images that yields impressive results on...

chapter

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6119 - 6127

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The attention mechanisms in deep neural networks are inspired by humans attention that sequentially focuses on the most relevant parts of the information over time to generate prediction output. The attention parameters in those models are implicitly trained in an end-to-end manner, yet there have been few trials to explicitly incorporate human gaze tracking to supervise the attention models. In this...

chapter

Low-Rank-Sparse Subspace Representation for Robust Regression

Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2972 - 2981

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning robust regression model from high-dimensional corrupted data is an essential and difficult problem in many practical applications. The state-of-the-art methods have studied low-rank regression models that are robust against typical noises (like Gaussian noise and out-sample sparse noise) or outliers, such that a regression model can be learned from clean data lying on underlying subspaces...

chapter

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

Youngjoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2943 - 2952

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a new high dimensional regression method by merging Gaussian process regression into a variational autoencoder framework. In contrast to other regression methods, the proposed method focuses on the case where output responses are on a complex high dimensional manifold, such as images. Our contributions are summarized as follows: (i) A new regression method estimating high dimensional...

chapter

3D Face Morphable Models "In-the-Wild"

James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5464 - 5473

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though powerful...

chapter

Interpretable Structure-Evolving LSTM

Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2175 - 2184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper develops a general framework for learning interpretable data representation via Long Short-Term Memory (LSTM) recurrent neural networks over hierarchal graph structures. Instead of learning LSTM models over the pre-fixed structures, we propose to further learn the intermediate interpretable multi-level graph structures in a progressive and stochastic way from data during the LSTM network...

chapter

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Zhiyuan Shi, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4684 - 4693

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing RNN-based approaches for action recognition from depth sequences require either skeleton joints or hand-crafted depth features as inputs. An end-to-end manner, mapping from raw depth maps to action classes, is non-trivial to design due to the fact that: 1) single channel map lacks texture thus weakens the discriminative power, 2) relatively small set of depth training data. To address these...

chapter

Deeply Aggregated Alternating Minimization for Image Restoration

Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 284 - 292

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Regularization-based image restoration has remained an active research topic in image processing and computer vision. It often leverages a guidance signal captured in different fields as an additional cue. In this work, we present a general framework for image restoration, called deeply aggregated alternating minimization (DeepAM). We propose to train deep neural network to advance two of the steps...

chapter

Probabilistic Temporal Subspace Clustering

Behnam Gholami, Vladimir Pavlovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4313 - 4322

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Subspace clustering is a common modeling paradigm used to identify constituent modes of variation in data with locally linear structure. These structures are common to many problems in computer vision, including modeling time series of complex human motion. However classical subspace clustering algorithms learn the relationships within a set of data without considering the temporal dependency and...

chapter

Compact Matrix Factorization with Dependent Subspaces

Viktor Larsson, Carl Olsson

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4361 - 4370

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional matrix factorization methods approximate high dimensional data with a low dimensional subspace. This imposes constraints on the matrix elements which allow for estimation of missing entries. A lower rank provides stronger constraints and makes estimation of the missing entries less ambiguous at the cost of measurement fit. In this paper we propose a new factorization model that further...

chapter

AGA: Attribute-Guided Augmentation

Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3328 - 3336

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of data augmentation, i.e., generating artificial samples to extend a given corpus of training data. Specifically, we propose attributed-guided augmentation (AGA) which learns a mapping that allows to synthesize data such that an attribute of a synthesized sample is at a desired value or strength. This is particularly interesting in situations where little data with no attribute...

chapter

Expert Gate: Lifelong Learning with a Network of Experts

Rahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7120 - 7129

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we introduce a model of lifelong learning, based on a Network of Experts. New tasks / experts are learned and added to the model sequentially, building on what was learned before. To ensure scalability of this process, data from previous tasks cannot be stored and hence is not available when learning a new task. A critical issue in such context, not addressed in the literature so far,...

chapter

Adversarially Tuned Scene Generation

Vsr Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6441 - 6449

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generalization performance of trained computer vision (CV) systems that use computer graphics (CG) generated data is not yet effective due to the concept of domain-shift between virtual and real data. Although simulated data augmented with a few real-world samples has been shown to mitigate domain shift and improve transferability of trained models, guiding or bootstrapping the virtual data generation...

chapter

Knowledge Acquisition for Visual Question Answering via Iterative Querying

Yuke Zhu, Joseph J. Lim, Li Fei-Fei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6146 - 6155

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Humans possess an extraordinary ability to learn new skills and new knowledge for problem solving. Such learning ability is also required by an automatic model to deal with arbitrary, open-ended questions in the visual world. We propose a neural-based approach to acquiring task-driven information for visual question answering (VQA). Our model proposes queries to actively acquire relevant information...

chapter

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Shireen Elhabian, Ross Whitaker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2185 - 2196

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Shape models provide a compact parameterization of a class of shapes, and have been shown to be important to a variety of vision problems, including object detection, tracking, and image segmentation. Learning generative shape models from grid-structured representations, aka silhouettes, is usually hindered by (1) data likelihoods with intractable marginals and posteriors, (2) high-dimensional shape...

chapter

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5524 - 5532

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There has been significant work on learning realistic, articulated, 3D models of the human body. In contrast, there are few such models of animals, despite many applications. The main challenge is that animals are much less cooperative than humans. The best human body models are learned from thousands of 3D scans of people in specific poses, which is infeasible with live animals. Consequently, we...

chapter

An Efficient Background Term for 3D Reconstruction and Tracking with Smooth Surface Models

Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2575 - 2583

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel strategy to shrink and constrain a 3D model, represented as a smooth spline-like surface, within the visual hull of an object observed from one or multiple views. This new background or silhouette term combines the efficiency of previous approaches based on an image-plane distance transform with the accuracy of formulations based on raycasting or ray potentials. The overall formulation...

chapter

Deep Hashing Network for Unsupervised Domain Adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5385 - 5394

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, deep neural networks have emerged as a dominant machine learning tool for a wide variety of application domains. However, training a deep neural network requires a large amount of labeled data, which is an expensive process in terms of time, labor and human expertise. Domain adaptation or transfer learning algorithms address this challenge by leveraging labeled data in a different,...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning to Learn from Noisy Web Videos

Multi-way Multi-level Kernel Modeling for Neuroimaging Classification

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Low-Rank-Sparse Subspace Representation for Robust Regression

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

3D Face Morphable Models "In-the-Wild"

Interpretable Structure-Evolving LSTM

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Deeply Aggregated Alternating Minimization for Image Restoration

Probabilistic Temporal Subspace Clustering

Compact Matrix Factorization with Dependent Subspaces

AGA: Attribute-Guided Augmentation

Expert Gate: Lifelong Learning with a Network of Experts

Adversarially Tuned Scene Generation

Knowledge Acquisition for Visual Question Answering via Iterative Querying

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

3D Menagerie: Modeling the 3D Shape and Pose of Animals

An Efficient Background Term for 3D Reconstruction and Tracking with Smooth Surface Models

Deep Hashing Network for Unsupervised Domain Adaptation

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)