2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Jose Lezama, Qiang Qiu, Guillermo Sapiro

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6807 - 6816

Surveillance cameras today often capture NIR (near infrared) images in low-light environments. However, most face datasets accessible for training and verification are only collected in the VIS (visible light) spectrum. It remains a challenging problem to match NIR to VIS face images due to the different light spectrum. Recently, breakthroughs have been made for VIS face recognition by applying deep...

chapter

Deep Cross-Modal Hashing

Qing-Yuan Jiang, Wu-Jun Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3270 - 3278

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Due to its low storage cost and fast query speed, cross-modal hashing (CMH) has been widely used for similarity search in multimedia retrieval applications. However, most existing CMH methods are based on hand-crafted features which might not be optimally compatible with the hash-code learning procedure. As a result, existing CMH methods with hand-crafted features may not achieve satisfactory performance...

chapter

Unsupervised Video Summarization with Adversarial LSTM Networks

Behrooz Mahasseni, Michael Lam, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2982 - 2991

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of unsupervised video summarization, formulated as selecting a sparse subset of video frames that optimally represent the input video. Our key idea is to learn a deep summarizer network to minimize distance between training videos and a distribution of their summarizations, in an unsupervised way. Such a summarizer can then be applied on a new video for estimating...

chapter

Deep TEN: Texture Encoding Network

Hang Zhang, Jia Xue, Kristin Dana

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2896 - 2905

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a Deep Texture Encoding Network (Deep-TEN) with a novel Encoding Layer integrated on top of convolutional layers, which ports the entire dictionary learning and encoding pipeline into a single model. Current methods build from distinct components, using standard encoders with separate off-the-shelf features such as SIFT descriptors or pre-trained CNN features for material recognition. Our...

chapter

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2915 - 2924

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to obtain, but direct training on such automatially harvested images can lead to unsatisfactory performance, because the noisy labels of Web images adversely affect the...

chapter

A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors

Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2818 - 2827

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Rain streaks removal is an important issue of the outdoor vision system and has been recently investigated extensively. In this paper, we propose a novel tensor based video rain streaks removal approach by fully considering the discriminatively intrinsic characteristics of rain streaks and clean videos, which needs neither rain detection nor time-consuming dictionary learning stage. In specific, on...

chapter

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodola, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5425 - 5434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has achieved a remarkable performance breakthrough in several fields, most notably in speech recognition, natural language processing, and computer vision. In particular, convolutional neural network (CNN) architectures currently produce state-of-the-art performance on a variety of image analysis tasks such as object detection and recognition. Most of deep learning research has so far...

chapter

Attentional Correlation Filter Network for Adaptive Visual Tracking

Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4828 - 4837

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new tracking framework with an attentional mechanism that chooses a subset of the associated correlation filters for increased robustness and computational efficiency. The subset of filters is adaptively selected by a deep attentional network according to the dynamic properties of the tracking target. Our contributions are manifold, and are summarised as follows: (i) Introducing the Attentional...

chapter

Multi-object Tracking with Quadruplet Convolutional Neural Networks

Jeany Son, Mooyeol Baek, Minsu Cho, Bohyung Han

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3786 - 3795

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose Quadruplet Convolutional Neural Networks (Quad-CNN) for multi-object tracking, which learn to associate object detections across frames using quadruplet losses. The proposed networks consider target appearances together with their temporal adjacencies for data association. Unlike conventional ranking losses, the quadruplet loss enforces an additional constraint that makes temporally adjacent...

chapter

ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases

Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3462 - 3471

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The chest X-ray is one of the most commonly accessible radiological examinations for screening and diagnosis of many lung diseases. A tremendous number of X-ray imaging studies accompanied by radiological reports are accumulated and stored in many modern hospitals Picture Archiving and Communication Systems (PACS). On the other side, it is still an open question how this type of hospital-size knowledge...

chapter

Joint Detection and Identification Feature Learning for Person Search

Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3376 - 3385

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing person re-identification benchmarks and methods mainly focus on matching cropped pedestrian images between queries and candidates. However, it is different from real-world scenarios where the annotations of pedestrian bounding boxes are unavailable and the target person needs to be searched from a gallery of whole scene images. To close the gap, we propose a new deep learning framework for...

chapter

Geometric Loss Functions for Camera Pose Regression with Deep Learning

Alex Kendall, Roberto Cipolla

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6555 - 6564

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning has shown to be effective for robust and real-time monocular image relocalisation. In particular, PoseNet [22] is a deep convolutional neural network which learns to regress the 6-DOF camera pose from a single image. It learns to localize using high level features and is robust to difficult lighting, motion blur and unknown camera intrinsics, where point based SIFT registration fails...

chapter

Predicting Salient Face in Multiple-Face Videos

Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3224 - 3232

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Although the recent success of convolutional neural network (CNN) advances state-of-the-art saliency prediction in static images, few work has addressed the problem of predicting attention in videos. On the other hand, we find that the attention of different subjects consistently focuses on a single face in each frame of videos involving multiple faces. Therefore, we propose in this paper a novel...

chapter

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

Yevhen Kuznietsov, Jorg Stuckler, Bastian Leibe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2215 - 2223

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Supervised deep learning often suffers from the lack of sufficient training data. Specifically in the context of monocular depth map prediction, it is barely possible to determine dense ground truth depth images in realistic dynamic outdoor environments. When using LiDAR sensors, for instance, noise is present in the distance measurements, the calibration between sensors cannot be perfect, and the...

chapter

Inverse Compositional Spatial Transformer Networks

Chen-Hsuan Lin, Simon Lucey

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2252 - 2260

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we establish a theoretical connection between the classical Lucas & Kanade (LK) algorithm and the emerging topic of Spatial Transformer Networks (STNs). STNs are of interest to the vision and learning communities due to their natural ability to combine alignment and classification within the same theoretical framework. Inspired by the Inverse Compositional (IC) variant of the LK...

chapter

Procedural Generation of Videos to Train Deep Action Recognition Networks

Cesar Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel Lopez

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2594 - 2604

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning for human action recognition in videos is making significant progress, but is slowed down by its dependency on expensive manual labeling of large video collections. In this work, we investigate the generation of synthetic training data for action recognition, as it has recently shown promising results for a variety of other computer vision tasks. We propose an interpretable parametric...

chapter

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Shan Li, Weihong Deng, JunPing Du

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2584 - 2593

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Past research on facial expressions have used relatively limited datasets, which makes it unclear whether current methods can be employed in real world. In this paper, we present a novel database, RAF-DB, which contains about 30000 facial images from thousands of individuals. Each image has been individually labeled about 40 times, then EM algorithm was used to filter out unreliable labels. Crowdsourcing...

chapter

Deep Hashing Network for Unsupervised Domain Adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5385 - 5394

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, deep neural networks have emerged as a dominant machine learning tool for a wide variety of application domains. However, training a deep neural network requires a large amount of labeled data, which is an expensive process in terms of time, labor and human expertise. Domain adaptation or transfer learning algorithms address this challenge by leveraging labeled data in a different,...

chapter

Deep Learning with Low Precision by Half-Wave Gaussian Quantization

Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5406 - 5414

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The problem of quantizing the activations of a deep neural network is considered. An examination of the popular binary quantization approach shows that this consists of approximating a classical non-linearity, the hyperbolic tangent, by two functions: a piecewise constant sign function, which is used in feedforward network computations, and a piecewise linear hard tanh function, used in the backpropagation...

chapter

Point to Set Similarity Based Deep Feature Learning for Person Re-Identification

Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5028 - 5037

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Person re-identification (Re-ID) remains a challenging problem due to significant appearance changes caused by variations in view angle, background clutter, illumination condition and mutual occlusion. To address these issues, conventional methods usually focus on proposing robust feature representation or learning metric transformation based on pairwise similarity, using Fisher-type criterion. The...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Deep Cross-Modal Hashing

Unsupervised Video Summarization with Adversarial LSTM Networks

Deep TEN: Texture Encoding Network

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data

A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Attentional Correlation Filter Network for Adaptive Visual Tracking

Multi-object Tracking with Quadruplet Convolutional Neural Networks

ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases

Joint Detection and Identification Feature Learning for Person Search

Geometric Loss Functions for Camera Pose Regression with Deep Learning

Predicting Salient Face in Multiple-Face Videos

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

Inverse Compositional Spatial Transformer Networks

Procedural Generation of Videos to Train Deep Action Recognition Networks

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Deep Hashing Network for Unsupervised Domain Adaptation

Deep Learning with Low Precision by Half-Wave Gaussian Quantization

Point to Set Similarity Based Deep Feature Learning for Person Re-Identification

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)