2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Items from 1 to 20 out of 36 results

chapter

Scene Graph Generation by Iterative Message Passing

Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3097 - 3106

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their relationships using scene graphs, a visually-grounded graphical structure of an image. We propose a novel end-to-end model that generates such structured scene representation...

chapter

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6119 - 6127

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The attention mechanisms in deep neural networks are inspired by humans attention that sequentially focuses on the most relevant parts of the information over time to generate prediction output. The attention parameters in those models are implicitly trained in an end-to-end manner, yet there have been few trials to explicitly incorporate human gaze tracking to supervise the attention models. In this...

chapter

Generating the Future with Adversarial Transformers

Carl Vondrick, Antonio Torralba

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2992 - 3000

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We learn models to generate the immediate future in video. This problem has two main challenges. Firstly, since the future is uncertain, models should be multi-modal, which can be difficult to learn. Secondly, since the future is similar to the past, models store low-level details, which complicates learning of high-level semantics. We propose a framework to tackle both of these challenges. We present...

chapter

Deep Image Harmonization

Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2799 - 2807

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compositing is one of the most common operations in photo editing. To generate realistic composites, the appearances of foreground and background need to be adjusted to make them compatible. Previous approaches to harmonize composites have focused on learning statistical relationships between hand-crafted appearance features of the foreground and background, which is unreliable especially when the...

chapter

Flexible Spatio-Temporal Networks for Video Prediction

Chaochao Lu, Michael Hirsch, Bernhard Scholkopf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2137 - 2145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We describe a modular framework for video frame prediction. We refer to it as a Flexible Spatio-Temporal Network (FSTN) as it allows the extrapolation of a video sequence as well as the estimation of synthetic frames lying in between observed frames and thus the generation of slow-motion videos. By devising a customized objective function comprising decoding, encoding, and adversarial losses, we are...

chapter

Relationship Proposal Networks

Ji Zhang, Mohamed Elhoseiny, Scott Cohen, Walter Chang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5226 - 5234

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image scene understanding requires learning the relationships between objects in the scene. A scene with many objects may have only a few individual interacting objects (e.g., in a party image with many people, only a handful of people might be speaking with each other). To detect all relationships, it would be inefficient to first detect all individual objects and then classify all pairs, not only...

chapter

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5178 - 5186

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper describes a fast and accurate semantic image segmentation approach that encodes not only segmentation-specified features but also high-order context compatibilities and boundary guidance constraints. We introduce a structured patch prediction technique to make a trade-off between classification discriminability and boundary sensibility for features. Both label and feature contexts are embedded...

chapter

Compact Matrix Factorization with Dependent Subspaces

Viktor Larsson, Carl Olsson

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4361 - 4370

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional matrix factorization methods approximate high dimensional data with a low dimensional subspace. This imposes constraints on the matrix elements which allow for estimation of missing entries. A lower rank provides stronger constraints and makes estimation of the missing entries less ambiguous at the cost of measurement fit. In this paper we propose a new factorization model that further...

chapter

The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6478 - 6487

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visual narrative is often a combination of explicit information and judicious omissions, relying on the viewer to supply missing details. In comics, most movements in time and space are hidden in the gutters between panels. To follow the story, readers logically connect panels together by inferring unseen actions through a process called closure. While computers can now describe the content of natural...

chapter

Predictive-Corrective Networks for Action Detection

Achal Dave, Olga Russakovsky, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2067 - 2076

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

While deep feature learning has revolutionized techniques for static-image understanding, the same does not quite hold for video processing. Architectures and optimization techniques used for video are largely based off those for static images, potentially underutilizing rich video information. In this work, we rethink both the underlying network architecture and the stochastic learning paradigm for...

chapter

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5085 - 5093

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training convolutional networks (CNNs) that fit on a single GPU with minibatch stochastic gradient descent has become effective in practice. However, there is still no effective method for training large networks that do not fit in the memory of a few GPU cards, or for parallelizing CNN training. In this work we show that a simple hard mixture of experts model can be efficiently trained to good effect...

chapter

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Ziad Al-Halah, Rainer Stiefelhagen

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5112 - 5121

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attribute-based recognition models, due to their impressive performance and their ability to generalize well on novel categories, have been widely adopted for many computer vision applications. However, usually both the attribute vocabulary and the class-attribute associations have to be provided manually by domain experts or large number of annotators. This is very costly and not necessarily optimal...

chapter

Infinite Variational Autoencoder for Semi-Supervised Learning

M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 781 - 790

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents an infinite variational autoencoder (VAE) whose capacity adapts to suit the input data. This is achieved using a mixture model where the mixing coefficients are modeled by a Dirichlet process, allowing us to integrate over the coefficients when performing inference. Critically, this then allows us to automatically vary the number of autoencoders in the mixture based on the data...

chapter

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Manuel HauBmann, Fred A. Hamprecht, Melih Kandemir

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 810 - 819

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Gaussian Processes (GPs) are effective Bayesian predictors. We here show for the first time that instance labels of a GP classifier can be inferred in the multiple instance learning (MIL) setting using variational Bayes. We achieve this via a new construction of the bag likelihood that assumes a large value if the instance predictions obey the MIL constraints and a small value otherwise. This construction...

chapter

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 161 - 169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of depth estimation from a single still image. Inspired by recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose...

chapter

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Siavash Gorji, James J. Clark

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3472 - 3481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel visual attention tracking technique based on Shared Attention modeling. By considering the viewer as a participant in the activity occurring in the scene, our model learns the loci of attention of the scene actors and use it to augment image salience. We go beyond image salience and instead of only computing the power of image regions to pull attention, we also consider the strength...

chapter

Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling

Spyros Gidaris, Nikos Komodakis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7187 - 7196

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Pixel wise image labeling is an interesting and challenging problem with great significance in the computer vision community. In order for a dense labeling algorithm to be able to achieve accurate and precise results, it has to consider the dependencies that exist in the joint space of both the input and the output variables. An implicit approach for modeling those dependencies is by training a deep...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6165 - 6174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust object recognition systems usually rely on powerful feature extraction mechanisms from a large number of real images. However, in many realistic applications, collecting sufficient images for ever-growing new classes is unattainable. In this paper, we propose a new Zero-shot learning (ZSL) framework that can synthesise visual features for unseen classes without acquiring real images. Using...

chapter

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Denis Tome, Chris Russell, Lourdes Agapito

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5689 - 5698

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search...

Keywords:
PREDICTIVE MODELS

Publication date

Set your own date range

Keywords

TRAINING (19)
VISUALIZATION (15)
SEMANTICS (14)
DATA MODELS (8)
COMPUTATIONAL MODELING (6)
DECODING (6)
FEATURE EXTRACTION (6)
ESTIMATION (5)
IMAGE SEGMENTATION (5)
COMPUTER VISION (4)
CONVOLUTION (4)
NEURAL NETWORKS (4)
ADAPTATION MODELS (3)
COMPUTER ARCHITECTURE (3)
LOGIC GATES (3)
PROPOSALS (3)
RECURRENT NEURAL NETWORKS (3)
COMPLEXITY THEORY (2)
GENERATORS (2)
HIDDEN MARKOV MODELS (2)
IMAGE RECOGNITION (2)
MESSAGE PASSING (2)
SOLID MODELING (2)
STANDARDS (2)
SUPERVISED LEARNING (2)
THREE-DIMENSIONAL DISPLAYS (2)
TRAJECTORY (2)
ANALYTICAL MODELS (1)
BAYES METHODS (1)
BIOINFORMATICS (1)
CAMERAS (1)
COHERENCE (1)
CONTEXT MODELING (1)
CORRELATION (1)
DYNAMICS (1)
EXTRAPOLATION (1)
FACE (1)
FACE RECOGNITION (1)
FEEDFORWARD SYSTEMS (1)
FLICKR (1)
FORECASTING (1)
FUSES (1)
GAME THEORY (1)
GAUSSIAN PROCESSES (1)
GAZE TRACKING (1)
GENOMICS (1)
HEAD (1)
HEATING SYSTEMS (1)
IMAGE COLOR ANALYSIS (1)
IMAGE EDGE DETECTION (1)
IMAGE RECONSTRUCTION (1)
IMAGE REPRESENTATION (1)
IMAGE RESOLUTION (1)
INFERENCE ALGORITHMS (1)
INTERPOLATION (1)
IRON (1)
KALMAN FILTERS (1)
LABELING (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIGHTING (1)
MACHINE LEARNING (1)
MARINE VEHICLES (1)
MATHEMATICAL MODEL (1)
MATRIX DECOMPOSITION (1)
MEASUREMENT (1)
MICROPROCESSORS (1)
MIXTURE MODELS (1)
MOTION PICTURES (1)
MOTION SEGMENTATION (1)
NETWORK ARCHITECTURE (1)
OBJECT DETECTION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
OPTICAL IMAGING (1)
PIPELINES (1)
POSE ESTIMATION (1)
PROBABILISTIC LOGIC (1)
ROBOTS (1)
SEMISUPERVISED LEARNING (1)
SPATIAL RESOLUTION (1)
SPATIOTEMPORAL PHENOMENA (1)
SPEECH (1)
STOCHASTIC PROCESSES (1)
TAXONOMY (1)
TENSILE STRESS (1)
TWO DIMENSIONAL DISPLAYS (1)
VEHICLE DYNAMICS (1)
VIDEO SEQUENCES (1)
VIDEOS (1)
VOCABULARY (1)
more

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene Graph Generation by Iterative Message Passing

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Generating the Future with Adversarial Transformers

Deep Image Harmonization

Flexible Spatio-Temporal Networks for Video Prediction

Relationship Proposal Networks

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

Compact Matrix Factorization with Dependent Subspaces

The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

Predictive-Corrective Networks for Action Detection

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Infinite Variational Autoencoder for Semi-Supervised Learning

Variational Bayesian Multiple Instance Learning with Gaussian Processes

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling

Convolutional Random Walk Networks for Semantic Image Segmentation

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)