2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Low-Rank-Sparse Subspace Representation for Robust Regression

Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2972 - 2981

Learning robust regression model from high-dimensional corrupted data is an essential and difficult problem in many practical applications. The state-of-the-art methods have studied low-rank regression models that are robust against typical noises (like Gaussian noise and out-sample sparse noise) or outliers, such that a regression model can be learned from clean data lying on underlying subspaces...

chapter

BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition

Jacob Chan, Jimmy Addison Lee, Qian Kemao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3020 - 3028

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents BIND (Binary Integrated Net Descriptor), a texture-less object detector that encodes multi-layered binary-represented nets for high precision edge-based description. Our proposed concept aligns layers of object-sized patches (nets) onto highly fragmented occlusion resistant line-segment midpoints (linelets) to encode regional information into efficient binary strings. These lightweight...

chapter

Accurate Depth and Normal Maps from Occlusion-Aware Focal Stack Symmetry

Michael Strecke, Anna Alperovich, Bastian Goldluecke

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2529 - 2537

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel approach to jointly estimate consistent depth and normal maps from 4D light fields, with two main contributions. First, we build a cost volume from focal stack symmetry. However, in contrast to previous approaches, we introduce partial focal stacks in order to be able to robustly deal with occlusions. This idea already yields significanly better disparity maps. Second, even recent...

chapter

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2915 - 2924

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to obtain, but direct training on such automatially harvested images can lead to unsatisfactory performance, because the noisy labels of Web images adversely affect the...

chapter

A Unified Approach of Multi-scale Deep and Hand-Crafted Features for Defocus Estimation

Jinsun Park, Yu-Wing Tai, Donghyeon Cho, In So Kweon

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2760 - 2769

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we introduce robust and synergetic hand-crafted features and a simple but efficient deep feature from a convolutional neural network (CNN) architecture for defocus estimation. This paper systematically analyzes the effectiveness of different features, and shows how each feature can compensate for the weaknesses of other features when they are concatenated. For a full defocus map estimation,...

chapter

CNN-Based Patch Matching for Optical Flow with Thresholded Hinge Embedding Loss

Christian Bailer, Kiran Varanasi, Didier Stricker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2710 - 2719

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning based approaches have not yet achieved their full potential in optical flow estimation, where their performance still trails heuristic approaches. In this paper, we present a CNN based patch matching approach for optical flow estimation. An important contribution of our approach is a novel thresholded loss for Siamese networks. We demonstrate that our loss performs clearly better than existing...

chapter

NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance

Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Pinies, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1446 - 1455

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a direct monocular SLAM algorithm based on the Normalised Information Distance (NID) metric. In contrast to current state-of-the-art direct methods based on photometric error minimisation, our information-theoretic NID metric provides robustness to appearance variation due to lighting, weather and structural changes in the scene. We demonstrate successful localisation and mapping across...

chapter

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Yinlin Hu, Yunsong Li, Rui Song

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4791 - 4799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The interpolation of correspondences (EpicFlow) was widely used for optical flow estimation in most-recent works. It has the advantage of edge-preserving and efficiency. However, it is vulnerable to input matching noise, which is inevitable in modern matching techniques. In this paper, we present a Robust Interpolation method of Correspondences (called RicFlow) to overcome the weakness. First, the...

chapter

Attentional Correlation Filter Network for Adaptive Visual Tracking

Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4828 - 4837

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new tracking framework with an attentional mechanism that chooses a subset of the associated correlation filters for increased robustness and computational efficiency. The subset of filters is adaptively selected by a deep attentional network according to the dynamic properties of the tracking target. Our contributions are manifold, and are summarised as follows: (i) Introducing the Attentional...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Provable Self-Representation Based Outlier Detection in a Union of Subspaces

Chong You, Daniel P. Robinson, Rene Vidal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4323 - 4332

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many computer vision tasks involve processing large amounts of data contaminated by outliers, which need to be detected and rejected. While outlier detection methods based on robust statistics have existed for decades, only recently have methods based on sparse and low-rank representation been developed along with guarantees of correct outlier detection when the inliers lie in one or more low-dimensional...

chapter

Latent Multi-view Subspace Clustering

Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4333 - 4341

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose a novel Latent Multi-view Subspace Clustering (LMSC) method, which clusters data points with latent representation and simultaneously explores underlying complementary information from multiple views. Unlike most existing single view subspace clustering methods that reconstruct data points using original features, our method seeks the underlying latent representation and...

chapter

Multi-object Tracking with Quadruplet Convolutional Neural Networks

Jeany Son, Mooyeol Baek, Minsu Cho, Bohyung Han

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3786 - 3795

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose Quadruplet Convolutional Neural Networks (Quad-CNN) for multi-object tracking, which learn to associate object detections across frames using quadruplet losses. The proposed networks consider target appearances together with their temporal adjacencies for data association. Unlike conventional ranking losses, the quadruplet loss enforces an additional constraint that makes temporally adjacent...

chapter

Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection

Yuliang Liu, Lianwen Jin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3454 - 3461

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting incidental scene text is a challenging task because of multi-orientation, perspective distortion, and variation of text size, color and scale. Retrospective research has only focused on using rectangular bounding box or horizontal sliding window to localize text, which may result in redundant background noise, unnecessary overlap or even information loss. To address these issues, we propose...

chapter

Detecting Oriented Text in Natural Images by Linking Segments

Baoguang Shi, Xiang Bai, Serge Belongie

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3482 - 3490

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line, A link connects two adjacent...

chapter

Seeing into Darkness: Scotopic Visual Recognition

Bo Chen, Pietro Perona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7292 - 7301

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Images are formed by counting how many photons traveling from a given set of directions hit an image sensor during a given time interval. When photons are few and far in between, the concept of image breaks down and it is best to consider directly the flow of photons. Computer vision in this regime, which we call scotopic, is radically different from the classical image-based paradigm in that visual...

chapter

On the Effectiveness of Visible Watermarks

Tali Dekel, Michael Rubinstein, Ce Liu, William T. Freeman

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6864 - 6872

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Visible watermarking is a widely-used technique for marking and protecting copyrights of many millions of images on the web, yet it suffers from an inherent security flaw—watermarks are typically added in a consistent manner to many images. We show that this consistency allows to automatically estimate the watermark and recover the original images with high accuracy. Specifically, we present...

chapter

Geometric Loss Functions for Camera Pose Regression with Deep Learning

Alex Kendall, Roberto Cipolla

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6555 - 6564

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has shown to be effective for robust and real-time monocular image relocalisation. In particular, PoseNet [22] is a deep convolutional neural network which learns to regress the 6-DOF camera pose from a single image. It learns to localize using high level features and is robust to difficult lighting, motion blur and unknown camera intrinsics, where point based SIFT registration fails...

chapter

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

Gil Elbaz, Tamar Avraham, Anath Fischer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2472 - 2481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an algorithm for registration between a large-scale point cloud and a close-proximity scanned point cloud, providing a localization solution that is fully independent of prior information about the initial positions of the two point cloud coordinate systems. The algorithm, denoted LORAX, selects super-points–local subsets of points–and describes the geometric structure...

chapter

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2233 - 2241

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Low-Rank-Sparse Subspace Representation for Robust Regression

BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition

Accurate Depth and Normal Maps from Occlusion-Aware Focal Stack Symmetry

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

A Unified Approach of Multi-scale Deep and Hand-Crafted Features for Defocus Estimation

CNN-Based Patch Matching for Optical Flow with Thresholded Hinge Embedding Loss

NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Attentional Correlation Filter Network for Adaptive Visual Tracking

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Provable Self-Representation Based Outlier Detection in a Union of Subspaces

Latent Multi-view Subspace Clustering

Multi-object Tracking with Quadruplet Convolutional Neural Networks

Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection

Detecting Oriented Text in Natural Images by Linking Segments

Seeing into Darkness: Scotopic Visual Recognition

On the Effectiveness of Visible Watermarks

Geometric Loss Functions for Camera Pose Regression with Deep Learning

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)