2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Scene Parsing through ADE20K Dataset

Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5122 - 5130

Scene parsing, or recognizing and segmenting objects and stuff in an image, is one of the key problems in computer vision. Despite the communitys efforts in data collection, there are still few image datasets covering a wide range of scenes and object categories with dense and detailed annotations for scene parsing. In this paper, we introduce and analyze the ADE20K dataset, spanning diverse annotations...

chapter

A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5094 - 5102

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a Reinforcement Learning (RL) solution to the view planning problem (VPP), which generates a sequence of view points that are capable of sensing all accessible area of a given object represented as a 3D model. In doing so, the goal is to minimize the number of view points, making the VPP a class of set covering optimization problem (SCOP). The SCOP is NP-hard, and the inapproximability...

chapter

Zero-Shot Classification with Discriminative Semantic Representation Learning

Meng Ye, Yuhong Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5103 - 5111

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning, a special case of unsupervised domain adaptation where the source and target domains have disjoint label spaces, has become increasingly popular in the computer vision community. In this paper, we propose a novel zero-shot learning method based on discriminative sparse non-negative matrix factorization. The proposed approach aims to identify a set of common high-level semantic...

chapter

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5057 - 5065

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Indoor scene understanding is central to applications such as robot navigation and human companion assistance. Over the last years, data-driven deep neural networks have outperformed many traditional approaches thanks to their representation learning capabilities. One of the bottlenecks in training for better representations is the amount of available per-pixel ground truth data that is required for...

chapter

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1417 - 1426

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Temporal action localization is an important yet challenging problem. Given a long, untrimmed video consisting of multiple action instances and complex background contents, we need not only to recognize their action categories, but also to localize the start time and end time of each instance. Many state-of-the-art systems use segment-level classifiers to select and rank proposal segments of pre-determined...

chapter

The World of Fast Moving Objects

Denys Rozumnyi, Jan Kotera, Filip Sroubek, Lukas Novotny, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4838 - 4846

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The notion of a Fast Moving Object (FMO), i.e. an object that moves over a distance exceeding its size within the exposure time, is introduced. FMOs may, and typically do, rotate with high angular speed. FMOs are very common in sports videos, but are not rare elsewhere. In a single frame, such objects are often barely visible and appear as semitransparent streaks. A method for the detection and tracking...

chapter

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Yinlin Hu, Yunsong Li, Rui Song

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4791 - 4799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The interpolation of correspondences (EpicFlow) was widely used for optical flow estimation in most-recent works. It has the advantage of edge-preserving and efficiency. However, it is vulnerable to input matching noise, which is inevitable in modern matching techniques. In this paper, we present a Robust Interpolation method of Correspondences (called RicFlow) to overcome the weakness. First, the...

chapter

Attentional Correlation Filter Network for Adaptive Visual Tracking

Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4828 - 4837

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new tracking framework with an attentional mechanism that chooses a subset of the associated correlation filters for increased robustness and computational efficiency. The subset of filters is adaptively selected by a deep attentional network according to the dynamic properties of the tracking target. Our contributions are manifold, and are summarised as follows: (i) Introducing the Attentional...

chapter

Deep Level Sets for Salient Object Detection

Ping Hu, Bing Shuai, Jun Liu, Gang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 540 - 549

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has been applied to saliency detection in recent years. The superior performance has proved that deep networks can model the semantic properties of salient objects. Yet it is difficult for a deep network to discriminate pixels belonging to similar receptive fields around the object boundaries, thus deep networks may output maps with blurred saliency and inaccurate boundaries. To tackle...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Identifying First-Person Camera Wearers in Third-Person Videos

Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4734 - 4742

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider scenarios in which we wish to perform joint scene understanding, object tracking, activity recognition, and other tasks in scenarios in which multiple people are wearing body-worn cameras while a third-person static camera also captures the scene. To do this, we need to establish person-level correspondences across first-and third-person videos, which is challenging because the camera...

chapter

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4714 - 4723

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a deep multitask architecture for fully automatic 2d and 3d human sensing (DMHS), including recognition and reconstruction, in monocular images. The system computes the figure-ground segmentation, semantically identifies the human body parts at pixel level, and estimates the 2d and 3d pose of the person. The model supports the joint training of all components by means of multi-task losses...

chapter

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

Shiyu Huang, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4664 - 4673

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

As autonomous vehicles become an every-day reality, high-accuracy pedestrian detection is of paramount practical importance. Pedestrian detection is a highly researched topic with mature methods, but most datasets (for both training and evaluation) focus on common scenes of people engaged in typical walking poses on sidewalks. But performance is most crucial for dangerous scenarios that are rarely...

chapter

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Umar Iqbal, Anton Milan, Juergen Gall

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4654 - 4663

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied directly to this problem, since it also requires to solve the problem of person association over time in addition to the pose estimation for each person. We therefore...

chapter

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Zhiyuan Shi, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4684 - 4693

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing RNN-based approaches for action recognition from depth sequences require either skeleton joints or hand-crafted depth features as inputs. An end-to-end manner, mapping from raw depth maps to action classes, is non-trivial to design due to the fact that: 1) single channel map lacks texture thus weakens the discriminative power, 2) relatively small set of depth training data. To address these...

chapter

Quality Aware Network for Set to Set Recognition

Yu Liu, Junjie Yan, Wanli Ouyang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4694 - 4703

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper targets on the problem of set to set recognition, which learns the metric between two image sets. Images in each set belong to the same identity. Since images in a set can be complementary, they hopefully lead to higher accuracy in practical applications. However, the quality of each sample cannot be guaranteed, and samples with poor quality will hurt the metric. In this paper, the quality...

chapter

Learning to Rank Retargeted Images

Yang Chen, Yong-Jin Liu, Yu-Kun Lai

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4743 - 4751

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image retargeting techniques that adjust images into different sizes have attracted much attention recently. Objective quality assessment (OQA) of image retargeting results is often desired to automatically select the best results. Existing OQA methods output an absolute score for each retargeted image and use these scores to compare different results. Observing that it is challenging even for human...

chapter

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Yuanming Hu, Baoyuan Wang, Stephen Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 330 - 339

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Improvements in color constancy have arisen from the use of convolutional neural networks (CNNs). However, the patch-based CNNs that exist for this problem are faced with the issue of estimation ambiguity, where a patch may contain insufficient information to establish a unique or even a limited possible range of illumination colors. Image patches with estimation ambiguity not only appear with great...

chapter

Deeply Aggregated Alternating Minimization for Image Restoration

Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 284 - 292

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Regularization-based image restoration has remained an active research topic in image processing and computer vision. It often leverages a guidance signal captured in different fields as an additional cue. In this work, we present a general framework for image restoration, called deeply aggregated alternating minimization (DeepAM). We propose to train deep neural network to advance two of the steps...

chapter

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 257 - 265

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene Parsing through ADE20K Dataset

A Reinforcement Learning Approach to the View Planning Problem

Zero-Shot Classification with Discriminative Semantic Representation Learning

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

The World of Fast Moving Objects

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Attentional Correlation Filter Network for Adaptive Visual Tracking

Deep Level Sets for Salient Object Detection

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Identifying First-Person Camera Wearers in Third-Person Videos

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

PoseTrack: Joint Multi-person Pose Estimation and Tracking

Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences

Quality Aware Network for Set to Set Recognition

Learning to Rank Retargeted Images

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Deeply Aggregated Alternating Minimization for Image Restoration

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)