2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Learning Non-maximum Suppression

Jan Hosang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6469 - 6477

Object detectors have hugely profited from moving towards an end-to-end learning paradigm: proposals, fea tures, and the classifier becoming one neural network improved results two-fold on general object detection. One indispensable component is non-maximum suppression (NMS), a post-processing algorithm responsible for merging all detections that belong to the same object. The de facto standard NMS...

chapter

Efficient Linear Programming for Dense CRFs

Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2934 - 2942

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The fully connected conditional random field (CRF) with Gaussian pairwise potentials has proven popular and effective for multi-class semantic segmentation. While the energy of a dense CRF can be minimized accurately using a linear programming (LP) relaxation, the state-of-the-art algorithm is too slow to be useful in practice. To alleviate this deficiency, we introduce an efficient LP minimization...

chapter

Awesome Typography: Statistics-Based Text Effects Transfer

Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2886 - 2895

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we explore the problem of generating fantastic special-effects for the typography. It is quite challenging due to the model diversities to illustrate varied text effects for different characters. To address this issue, our key idea is to exploit the analytics on the high regularity of the spatial distribution for text effects to guide the synthesis process. Specifically, we characterize...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure from Motion

Eisuke Ito, Takayuki Okatani

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4512 - 4520

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we consider critical motion sequences (CMSs) of rolling-shutter (RS) SfM. Employing an RS camera model with linearized pure rotation, we show that the RS distortion can be approximately expressed by two internal parameters of an imaginary camera plus one-parameter nonlinear transformation similar to lens distortion. We then reformulate the problem as self-calibration of the imaginary...

chapter

Multi-modal Mean-Fields via Cardinality-Based Clamping

Pierre Baque, Francois Fleuret, Pascal Fua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4303 - 4312

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Mean Field inference is central to statistical physics. It has attracted much interest in the Computer Vision community to efficiently solve problems expressible in terms of large Conditional Random Fields. However, since it models the posterior probability distribution as a product of marginal probabilities, it may fail to properly account for important dependencies between variables. We therefore...

chapter

Fast Fourier Color Constancy

Jonathan T. Barron, Yun-Ta Tsai

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6950 - 6958

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present Fast Fourier Color Constancy (FFCC), a color constancy algorithm which solves illuminant estimation by reducing it to a spatial localization task on a torus. By operating in the frequency domain, FFCC produces lower error rates than the previous state-of-the-art by 13–20% while being 250-3000 times faster. This unconventional approach introduces challenges regarding aliasing,...

chapter

Low-Rank Bilinear Pooling for Fine-Grained Classification

Shu Kong, Charless Fowlkes

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7025 - 7034

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Pooling second-order local feature statistics to form a high-dimensional bilinear feature has been shown to achieve state-of-the-art performance on a variety of fine-grained classification tasks. To address the computational demands of high feature dimensionality, we propose to represent the covariance features as a matrix and apply a low-rank bilinear classifier. The resulting classifier can be evaluated...

chapter

Knowledge Acquisition for Visual Question Answering via Iterative Querying

Yuke Zhu, Joseph J. Lim, Li Fei-Fei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6146 - 6155

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Humans possess an extraordinary ability to learn new skills and new knowledge for problem solving. Such learning ability is also required by an automatic model to deal with arbitrary, open-ended questions in the visual world. We propose a neural-based approach to acquiring task-driven information for visual question answering (VQA). Our model proposes queries to actively acquire relevant information...

chapter

GMS: Grid-Based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

Jiawang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2828 - 2837

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Incorporating smoothness constraints into feature matching is known to enable ultra-robust matching. However, such formulations are both complex and slow, making them unsuitable for video applications. This paper proposes GMS (Grid-based Motion Statistics), a simple means of encapsulating motion smoothness as the statistical likelihood of a certain number of matches in a region. GMS enables translation...

chapter

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Youssef Tamaazousti, Herve Le Borgne, Celine Hudelot

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5282 - 5291

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In a transfer-learning scheme, the intermediate layers of a pre-trained CNN are employed as universal image representation to tackle many visual classification problems. The current trend to generate such representation is to learn a CNN on a large set of images labeled among the most specific categories. Such processes ignore potential relations between categories, as well as the categorical-levels...

chapter

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5085 - 5093

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training convolutional networks (CNNs) that fit on a single GPU with minibatch stochastic gradient descent has become effective in practice. However, there is still no effective method for training large networks that do not fit in the memory of a few GPU cards, or for parallelizing CNN training. In this work we show that a simple hard mixture of experts model can be efficiently trained to good effect...

chapter

Teaching Compositionality to CNNs

Austin Stone, Huayan Wang, Michael Stark, Yi Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 732 - 741

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural networks (CNNs) have shown great success in computer vision, approaching human-level performance when trained for specific tasks via application-specific loss functions. In this paper, we propose a method for augmenting and training CNNs so that their learned features are compositional. It encourages networks to form representations that disentangle objects from their surroundings...

chapter

Learning Discriminative and Transformation Covariant Local Feature Detectors

Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4923 - 4931

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust covariant local feature detectors are important for detecting local features that are (1) discriminative of the image content and (2) can be repeatably detected at consistent locations when the image undergoes diverse transformations. Such detectors are critical for applications such as image search and scene reconstruction. Many learning-based local feature detectors address one of these two...

chapter

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

Martin Simonovsky, Nikos Komodakis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 29 - 38

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A number of problems can be formulated as prediction on graph-structured data. In this work, we generalize the convolution operator from regular grids to arbitrary graphs while avoiding the spectral domain, which allows us to handle graphs of varying size and connectivity. To move beyond a simple diffusion, filter weights are conditioned on the specific edge labels in the neighborhood of a vertex...

chapter

Multigrid Neural Architectures

Tsung-Wei Ke, Michael Maire, Stella X. Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4067 - 4075

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a multigrid extension of convolutional neural networks (CNNs). Rather than manipulating representations living on a single spatial grid, our network layers operate across scale space, on a pyramid of grids. They consume multigrid inputs and produce multigrid outputs, convolutional filters themselves have both within-scale and cross-scale extent. This aspect is distinct from simple multiscale...

chapter

Improving Training of Deep Neural Networks via Singular Value Bounding

Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3994 - 4002

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning methods achieve great success recently on many computer vision problems. In spite of these practical successes, optimization of deep networks remains an active topic in deep learning research. In this work, we focus on investigation of the network solution properties that can potentially lead to good performance. Our research is inspired by theoretical and empirical results that use...

chapter

S3Pool: Pooling with Stochastic Spatial Sampling

Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4003 - 4011

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Feature pooling layers (e.g., max pooling) in convolutional neural networks (CNNs) serve the dual purpose of providing increasingly abstract representations as well as yielding computational savings in subsequent convolutional layers. We view the pooling operation in CNNs as a two step procedure: first, a pooling window (e.g., 2× 2) slides over the feature map with stride one which leaves...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2962 - 2971

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning Non-maximum Suppression

Efficient Linear Programming for Dense CRFs

Awesome Typography: Statistics-Based Text Effects Transfer

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure from Motion

Multi-modal Mean-Fields via Cardinality-Based Clamping

Fast Fourier Color Constancy

Low-Rank Bilinear Pooling for Fine-Grained Classification

Knowledge Acquisition for Visual Question Answering via Iterative Querying

GMS: Grid-Based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Teaching Compositionality to CNNs

Learning Discriminative and Transformation Covariant Local Feature Detectors

Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

Multigrid Neural Architectures

Improving Training of Deep Neural Networks via Singular Value Bounding

S3Pool: Pooling with Stochastic Spatial Sampling

Convolutional Random Walk Networks for Semantic Image Segmentation

Adversarial Discriminative Domain Adaptation

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)