2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6119 - 6127

The attention mechanisms in deep neural networks are inspired by humans attention that sequentially focuses on the most relevant parts of the information over time to generate prediction output. The attention parameters in those models are implicitly trained in an end-to-end manner, yet there have been few trials to explicitly incorporate human gaze tracking to supervise the attention models. In this...

chapter

Compact Matrix Factorization with Dependent Subspaces

Viktor Larsson, Carl Olsson

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4361 - 4370

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional matrix factorization methods approximate high dimensional data with a low dimensional subspace. This imposes constraints on the matrix elements which allow for estimation of missing entries. A lower rank provides stronger constraints and makes estimation of the missing entries less ambiguous at the cost of measurement fit. In this paper we propose a new factorization model that further...

chapter

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5085 - 5093

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training convolutional networks (CNNs) that fit on a single GPU with minibatch stochastic gradient descent has become effective in practice. However, there is still no effective method for training large networks that do not fit in the memory of a few GPU cards, or for parallelizing CNN training. In this work we show that a simple hard mixture of experts model can be efficiently trained to good effect...

chapter

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Ziad Al-Halah, Rainer Stiefelhagen

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5112 - 5121

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attribute-based recognition models, due to their impressive performance and their ability to generalize well on novel categories, have been widely adopted for many computer vision applications. However, usually both the attribute vocabulary and the class-attribute associations have to be provided manually by domain experts or large number of annotators. This is very costly and not necessarily optimal...

chapter

Infinite Variational Autoencoder for Semi-Supervised Learning

M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 781 - 790

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data. This is achieved using a mixture model where the mixing coefficients are modeled by a Dirichlet process, allowing us to integrate over the coefficients when performing inference. Critically, this then allows us to automatically vary the number of autoencoders in the mixture based on the data...

chapter

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Denis Tome, Chris Russell, Lourdes Agapito

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5689 - 5698

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search...

chapter

Captioning Images with Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1170 - 1178

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources – labeled images from object recognition...

chapter

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3530 - 3538

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Compact Matrix Factorization with Dependent Subspaces

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Infinite Variational Autoencoder for Semi-Supervised Learning

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Captioning Images with Diverse Objects

End-to-End Learning of Driving Models from Large-Scale Video Datasets

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)