2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Convolutional Neural Network Architecture for Geometric Matching

Ignacio Rocco, Relja Arandjelovic, Josef Sivic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 39 - 48

We address the problem of determining correspondences between two images in agreement with a geometric model such as an affine or thin-plate spline transformation, and estimating its parameters. The contributions of this work are three-fold. First, we propose a convolutional neural network architecture for geometric matching. The architecture is based on three main components that mimic the standard...

chapter

Universal Adversarial Perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 86 - 94

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a state-of-the-art deep neural network classifier, we show the existence of a universal (image-agnostic) and very small perturbation vector that causes natural images to be misclassified with high probability. We propose a systematic algorithm for computing universal perturbations, and show that state-of-the-art deep neural networks are highly vulnerable to such perturbations, albeit being quasi-imperceptible...

chapter

Designing Effective Inter-Pixel Information Flow for Natural Image Matting

Yagiz Aksoy, Tunc Ozan Aydin, Marc Pollefeys

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 228 - 236

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel, purely affinity-based natural image matting algorithm. Our method relies on carefully defined pixel-to-pixel connections that enable effective use of information available in the image and the trimap. We control the information flow from the known-opacity regions into the unknown region, as well as within the unknown region itself, by utilizing multiple definitions of pixel affinities...

chapter

BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks

Bohyung Han, Jack Sim, Hartwig Adam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 521 - 530

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an extremely simple but effective regularization technique of convolutional neural networks (CNNs), referred to as BranchOut, for online ensemble tracking. Our algorithm employs a CNN for target representation, which has a common convolutional layers but has multiple branches of fully connected layers. For better regularization, a subset of branches in the CNN are selected randomly for...

chapter

Zero-Shot Action Recognition with Error-Correcting Output Codes

Jie Qin, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1042 - 1051

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, zero-shot action recognition (ZSAR) has emerged with the explosive growth of action categories. In this paper, we explore ZSAR from a novel perspective by adopting the Error-Correcting Output Codes (dubbed ZSECOC). Our ZSECOC equips the conventional ECOC with the additional capability of ZSAR, by addressing the domain shift problem. In particular, we learn discriminative ZSECOC for seen...

chapter

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1349 - 1358

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a novel tracker which is controlled by sequentially pursuing actions learned by deep reinforcement learning. In contrast to the existing trackers using deep networks, the proposed tracker is designed to achieve a light computation as well as satisfactory tracking accuracy in both location and scale. The deep network to control actions is pre-trained using various training sequences...

chapter

Context-Aware Correlation Filter Tracking

Matthias Mueller, Neil Smith, Bernard Ghanem

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1387 - 1395

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Correlation filter (CF) based trackers have recently gained a lot of popularity due to their impressive performance on benchmark datasets, while maintaining high frame rates. A significant amount of recent research focuses on the incorporation of stronger features for a richer representation of the tracking target. However, this only helps to discriminate the target from background within a small...

chapter

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Patrick Knobelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1456 - 1465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel and principled hybrid CNN+CRF model for stereo estimation. Our model allows to exploit the advantages of both, convolutional neural networks (CNNs) and conditional random fields (CRFs) in an unified approach. The CNNs compute expressive features for matching and distinctive color edges, which in turn are used to compute the unary and binary costs of the CRF. For inference, we apply...

chapter

Deep Representation Learning for Human Motion Prediction and Classification

Judith Butepage, Michael J. Black, Danica Kragic, Hedvig Kjellstrom

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1591 - 1599

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generative models of 3D human motion are often restricted to a small number of activities and can therefore not generalize well to novel movements or applications. In this work we propose a deep learning framework for human motion capture data that learns a generic representation from a large corpus of motion capture data and generalizes well to new, unseen, motions. Using an encoding-decoding network...

chapter

Xception: Deep Learning with Depthwise Separable Convolutions

Francois Chollet

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1800 - 1807

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an interpretation of Inception modules in convolutional neural networks as being an intermediate step in-between regular convolution and the depthwise separable convolution operation (a depthwise convolution followed by a pointwise convolution). In this light, a depthwise separable convolution can be understood as an Inception module with a maximally large number of towers. This observation...

chapter

Linking Image and Text with 2-Way Nets

Aviv Eisenschtat, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1855 - 1865

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional...

chapter

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Lluis Gomez, Yash Patel, Marcal Rusinol, Dimosthenis Karatzas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2017 - 2026

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

End-to-end training from scratch of current deep architectures for new computer vision problems would require Imagenet-scale datasets, and this is not always possible. In this paper we present a method that is able to take advantage of freely available multi-modal content to train computer vision algorithms without human supervision. We put forward the idea of performing self-supervised learning of...

chapter

Interpretable Structure-Evolving LSTM

Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2175 - 2184

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper develops a general framework for learning interpretable data representation via Long Short-Term Memory (LSTM) recurrent neural networks over hierarchal graph structures. Instead of learning LSTM models over the pre-fixed structures, we propose to further learn the intermediate interpretable multi-level graph structures in a progressive and stochastic way from data during the LSTM network...

chapter

Photorealistic Facial Texture Inference Using Deep Neural Networks

Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2326 - 2335

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a data-driven inference method that can synthesize a photorealistic texture map of a complete 3D face model given a partial 2D view of a person in the wild. After an initial estimation of shape and low-frequency albedo, we compute a high-frequency partial texture map, without the shading component, of the visible face area. To extract the fine appearance details from this incomplete input,...

chapter

Adaptive Class Preserving Representation for Image Classification

Jian-Xun Mi, Qiankun Fu, Weisheng Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2624 - 2632

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In linear representation-based image classification, an unlabeled sample is represented by the entire training set. To obtain a stable and discriminative solution, regularization on the vector of representation coefficients is necessary. For example, the representation in sparse representation-based classification (SRC) uses L1 norm penalty as regularization, which is equal to lasso. However, lasso...

chapter

A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors

Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2818 - 2827

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Rain streaks removal is an important issue of the outdoor vision system and has been recently investigated extensively. In this paper, we propose a novel tensor based video rain streaks removal approach by fully considering the discriminatively intrinsic characteristics of rain streaks and clean videos, which needs neither rain detection nor time-consuming dictionary learning stage. In specific, on...

chapter

Awesome Typography: Statistics-Based Text Effects Transfer

Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2886 - 2895

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we explore the problem of generating fantastic special-effects for the typography. It is quite challenging due to the model diversities to illustrate varied text effects for different characters. To address this issue, our key idea is to exploit the analytics on the high regularity of the spatial distribution for text effects to guide the synthesis process. Specifically, we characterize...

chapter

Low-Rank-Sparse Subspace Representation for Robust Regression

Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2972 - 2981

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning robust regression model from high-dimensional corrupted data is an essential and difficult problem in many practical applications. The state-of-the-art methods have studied low-rank regression models that are robust against typical noises (like Gaussian noise and out-sample sparse noise) or outliers, such that a regression model can be learned from clean data lying on underlying subspaces...

chapter

Multi-task Clustering of Human Actions by Sharing Information

Xiaoqiang Yan, Shizhe Hu, Yangdong Ye

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4049 - 4057

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Sharing information between multiple tasks can enhance the accuracy of human action recognition systems. However, using shared information to improve multi-task human action clustering has never been considered before, and cannot be achieved using existing clustering methods. In this work, we present a novel and effective Multi-Task Information Bottleneck (MTIB) clustering method, which is capable...

chapter

Semantic Regularisation for Recurrent Image Annotation

Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4160 - 4168

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The CNN-RNN design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional Neural Network Architecture for Geometric Matching

Universal Adversarial Perturbations

Designing Effective Inter-Pixel Information Flow for Natural Image Matting

BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks

Zero-Shot Action Recognition with Error-Correcting Output Codes

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Context-Aware Correlation Filter Tracking

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Deep Representation Learning for Human Motion Prediction and Classification

Xception: Deep Learning with Depthwise Separable Convolutions

Linking Image and Text with 2-Way Nets

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Interpretable Structure-Evolving LSTM

Photorealistic Facial Texture Inference Using Deep Neural Networks

Adaptive Class Preserving Representation for Image Classification

A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors

Awesome Typography: Statistics-Based Text Effects Transfer

Low-Rank-Sparse Subspace Representation for Robust Regression

Multi-task Clustering of Human Actions by Sharing Information

Semantic Regularisation for Recurrent Image Annotation

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)