2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1 - 9

Multi-view subspace clustering aims to partition a set of multi-source data into their underlying groups. To boost the performance of multi-view clustering, numerous subspace learning algorithms have been developed in recent years, but with rare exploitation of the representation complementarity between different views as well as the indicator consistency among the representations, let alone considering...

chapter

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

Martin Simonovsky, Nikos Komodakis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 29 - 38

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A number of problems can be formulated as prediction on graph-structured data. In this work, we generalize the convolution operator from regular grids to arbitrary graphs while avoiding the spectral domain, which allows us to handle graphs of varying size and connectivity. To move beyond a simple diffusion, filter weights are conditioned on the specific edge labels in the neighborhood of a vertex...

chapter

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 407 - 415

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A human action can be seen as transitions between ones body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate static...

chapter

Model-Based Iterative Restoration for Binary Document Image Compression with Dictionary Learning

Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 606 - 615

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The inherent noise in the observed (e.g., scanned) binary document image degrades the image quality and harms the compression ratio through breaking the pattern repentance and adding entropy to the document images. In this paper, we design a cost function in Bayesian framework with dictionary learning. Minimizing our cost function produces a restored image which has better quality than that of the...

chapter

Teaching Compositionality to CNNs

Austin Stone, Huayan Wang, Michael Stark, Yi Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 732 - 741

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural networks (CNNs) have shown great success in computer vision, approaching human-level performance when trained for specific tasks via application-specific loss functions. In this paper, we propose a method for augmenting and training CNNs so that their learned features are compositional. It encourages networks to form representations that disentangle objects from their surroundings...

chapter

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 916 - 925

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compact coding has been widely applied to approximate nearest neighbor search for large-scale image retrieval, due to its computation efficiency and retrieval quality. This paper presents a compact coding solution with a focus on the deep learning to quantization approach, which improves retrieval quality by end-to-end representation learning and compact encoding and has already shown the superior...

chapter

LCR-Net: Localization-Classification-Regression for Human Pose

Gregory Rogez, Philippe Weinzaepfel, Cordelia Schmid

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1216 - 1224

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D pose of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our architecture, named LCR-Net, contains...

chapter

Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment

Erik Wijmans, Yasutaka Furukawa

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1427 - 1435

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel algorithm that utilizes a 2D floorplan to align panorama RGBD scans. While effective panorama RGBD alignment techniques exist, such a system requires extremely dense RGBD image sampling. Our approach can significantly reduce the number of necessary scans with the aid of a floorplan image. We formulate a novel Markov Random Field inference problem as a scan placement over...

chapter

Generalized Rank Pooling for Activity Recognition

Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1581 - 1590

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most popular deep models for action recognition split video sequences into short sub-sequences consisting of a few frames, frame-based features are then pooled for recognizing the activity. Usually, this pooling step discards the temporal order of the frames, which could otherwise be used for better recognition. Towards this end, we propose a novel pooling method, generalized rank pooling (GRP), that...

chapter

Turning an Urban Scene Video into a Cinemagraph

Hang Yan, Yebin Liu, Yasutaka Furukawa

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1629 - 1637

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes an algorithm that turns a regular video capturing urban scenes into a high-quality endless animation, known as a Cinemagraph. The creation of a Cinemagraph usually requires a static camera in a carefully configured scene. The task becomes challenging for a regular video with a moving camera and objects. Our approach first warps an input video into the viewpoint of a reference camera...

chapter

Deep Crisp Boundaries

Yupei Wang, Xin Zhao, Kaiqi Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1724 - 1732

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Edge detection had made significant progress with the help of deep Convolutional Networks (ConvNet). ConvNet based edge detectors approached human level performance on standard benchmarks. We provide a systematical study of these detector outputs, and show that they failed to accurately localize edges, which can be adversarial for tasks that require crisp edge inputs. In addition, we propose a novel...

chapter

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1743 - 1751

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One of recent trends [31, 32, 14] in network architecture design is stacking small filters (e.g., 1x1 or 3x3) in the entire network because the stacked small filters is more efficient than a large kernel, given the same computational complexity. However, in the field of semantic segmentation, where we need to perform dense per-pixel prediction, we find that the large kernel (and effective receptive...

chapter

Reflectance Adaptive Filtering Improves Intrinsic Image Estimation

Thomas Nestmeyer, Peter V. Gehler

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1771 - 1780

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Separating an image into reflectance and shading layers poses a challenge for learning approaches because no large corpus of precise and realistic ground truth decompositions exists. The Intrinsic Images in the Wild (IIW) dataset provides a sparse set of relative human reflectance judgments, which serves as a standard benchmark for intrinsic images. A number of methods use IIW to learn statistical...

chapter

Building a Regular Decision Boundary with Deep Networks

Edouard Oyallon

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1886 - 1894

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we build a generic architecture of Convolutional Neural Networks to discover empirical properties of neural networks. Our first contribution is to introduce a state-of-the-art framework that depends upon few hyper parameters and to study the network when we vary them. It has no max pooling, no biases, only 13 layers, is purely convolutional and yields up to 95.4% and 79.6% accuracy respectively...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

GMS: Grid-Based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

Jiawang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2828 - 2837

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Incorporating smoothness constraints into feature matching is known to enable ultra-robust matching. However, such formulations are both complex and slow, making them unsuitable for video applications. This paper proposes GMS (Grid-based Motion Statistics), a simple means of encapsulating motion smoothness as the statistical likelihood of a certain number of matches in a region. GMS enables translation...

chapter

Learning Diverse Image Colorization

Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2877 - 2885

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Colorization is an ambiguous problem, with multiple viable colorizations for a single grey-level image. However, previous methods only produce the single most probable colorization. Our goal is to model the diversity intrinsic to the problem of colorization and produce multiple colorizations that display long-scale spatial co-ordination. We learn a low dimensional embedding of color fields using a...

chapter

Awesome Typography: Statistics-Based Text Effects Transfer

Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2886 - 2895

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we explore the problem of generating fantastic special-effects for the typography. It is quite challenging due to the model diversities to illustrate varied text effects for different characters. To address this issue, our key idea is to exploit the analytics on the high regularity of the spatial distribution for text effects to guide the synthesis process. Specifically, we characterize...

chapter

Efficient Linear Programming for Dense CRFs

Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2934 - 2942

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The fully connected conditional random field (CRF) with Gaussian pairwise potentials has proven popular and effective for multi-class semantic segmentation. While the energy of a dense CRF can be minimized accurately using a linear programming (LP) relaxation, the state-of-the-art algorithm is too slow to be useful in practice. To alleviate this deficiency, we introduce an efficient LP minimization...

chapter

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2962 - 2971

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Model-Based Iterative Restoration for Binary Document Image Compression with Dictionary Learning

Teaching Compositionality to CNNs

Deep Visual-Semantic Quantization for Efficient Image Retrieval

LCR-Net: Localization-Classification-Regression for Human Pose

Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment

Generalized Rank Pooling for Activity Recognition

Turning an Urban Scene Video into a Cinemagraph

Deep Crisp Boundaries

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

Reflectance Adaptive Filtering Improves Intrinsic Image Estimation

Building a Regular Decision Boundary with Deep Networks

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

GMS: Grid-Based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

Learning Diverse Image Colorization

Awesome Typography: Statistics-Based Text Effects Transfer

Efficient Linear Programming for Dense CRFs

Adversarial Discriminative Domain Adaptation

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)