2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Weifeng Ge, Yizhou Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 10 - 19

Deep neural networks require a large amount of labeled training data during supervised learning. However, collecting and labeling so much data might be infeasible in many cases. In this paper, we introduce a deep transfer learning scheme, called selective joint fine-tuning, for improving the performance of deep learning tasks with insufficient training data. In this scheme, a target learning task...

chapter

Universal Adversarial Perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 86 - 94

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a state-of-the-art deep neural network classifier, we show the existence of a universal (image-agnostic) and very small perturbation vector that causes natural images to be misclassified with high probability. We propose a systematic algorithm for computing universal perturbations, and show that state-of-the-art deep neural networks are highly vulnerable to such perturbations, albeit being quasi-imperceptible...

chapter

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 95 - 104

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks. One appealing alternative is rendering synthetic data where ground-truth annotations are generated automatically. Unfortunately, models trained purely on rendered images fail to generalize to real images. To address this shortcoming, prior work introduced unsupervised domain...

chapter

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 105 - 114

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. Recent...

chapter

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 161 - 169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of depth estimation from a single still image. Inspired by recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose...

chapter

Training Object Class Detectors with Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 180 - 189

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate...

chapter

3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions

Andy Zeng, Shuran Song, Matthias NieBner, Matthew Fisher, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 199 - 208

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Matching local geometric features on real-world depth images is a challenging task due to the noisy, low-resolution, and incomplete nature of 3D scan data. These difficulties limit the performance of current state-of-art methods, which are typically based on histograms over geometric properties. In this paper, we present 3DMatch, a data-driven model that learns a local volumetric patch descriptor...

chapter

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 218 - 227

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Camera relocalisation is an important problem in computer vision, with applications in simultaneous localisation and mapping, virtual/augmented reality and navigation. Common techniques either match the current image against keyframes with known poses coming from a tracker, or establish 2D-to-3D correspondences between keypoints in the current image and points in the scene in order to estimate the...

chapter

Deep Video Deblurring for Hand-Held Cameras

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 237 - 246

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motion blur from camera shake is a major problem in videos captured by hand-held devices. Unlike single-image deblurring, video-based approaches can take advantage of the abundant information that exists across neighboring frames. As a result the best performing methods rely on the alignment of nearby frames. However, aligning images is a computationally expensive and fragile procedure, and methods...

chapter

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 257 - 265

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine...

chapter

Diversified Texture Synthesis with Feed-Forward Networks

Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 266 - 274

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent progresses on deep discriminative and generative modeling have shown promising results on texture synthesis. However, existing feed-forward based methods trade off generality for efficiency, which suffer from many issues, such as shortage of generality (i.e., build one network per texture), lack of diversity (i.e., always produce visually identical output) and suboptimality (i.e., generate...

chapter

End-to-End Instance Segmentation with Recurrent Attention

Mengye Ren, Richard S. Zemel

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 293 - 301

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

While convolutional neural networks have gained impressive success recently in solving structured prediction problems such as semantic segmentation, it remains a challenge to differentiate individual object instances in the scene. Instance segmentation is very important in a variety of applications, such as autonomous driving, image captioning, and visual question answering. Techniques that combine...

chapter

SRN: Side-Output Residual Network for Object Symmetry Detection in the Wild

Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 302 - 310

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we establish a baseline for object symmetry detection in complex backgrounds by presenting a new benchmark and an end-to-end deep learning approach, opening up a promising direction for symmetry detection in the wild. The new benchmark, named Sym-PASCAL, spans challenges including object diversity, multi-objects, part-invisibility, and various complex backgrounds that are far beyond...

chapter

Deep Image Matting

Ning Xu, Brian Price, Scott Cohen, Thomas Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 311 - 320

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image has similar foreground and background colors or complicated textures. The main reasons are prior methods 1) only use low-level features and 2) lack high-level context. In this paper, we propose a novel deep learning based algorithm that can tackle both these problems...

chapter

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Yuanming Hu, Baoyuan Wang, Stephen Lin

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 330 - 339

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Improvements in color constancy have arisen from the use of convolutional neural networks (CNNs). However, the patch-based CNNs that exist for this problem are faced with the issue of estimation ambiguity, where a patch may contain insufficient information to establish a unique or even a limited possible range of illumination colors. Image patches with estimation ambiguity not only appear with great...

chapter

Face Normals "In-the-Wild" Using Fully Convolutional Networks

George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 340 - 349

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we pursue a data-driven approach to the problem of estimating surface normals from a single intensity image, focusing in particular on human faces. We introduce new methods to exploit the currently available facial databases for dataset construction and tailor a deep convolutional neural network to the task of estimating facial surface normals in-the-wild. We train a fully convolutional...

chapter

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 388 - 397

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation. We present a deep convolutional neural network (CNN) architecture to localize semantic parts in 2D image and 3D space while inferring their visibility states, given a single RGB image. Our key insight is to exploit domain knowledge to regularize the network by deeply...

chapter

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 407 - 415

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A human action can be seen as transitions between ones body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate static...

chapter

Detecting Masked Faces in the Wild with LLE-CNNs

Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 426 - 434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Detecting faces with occlusions is a challenging task due to two main reasons: 1) the absence of large datasets of masked faces, and 2) the absence of facial cues from the masked regions. To address these two issues, this paper first introduces a dataset, denoted as MAFA, with 30, 811 Internet images and 35, 806 masked faces. Faces in the dataset have various orientations and occlusion degrees, while...

chapter

Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild

Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 474 - 483

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene text detection has attracted great attention these years. Text potentially exist in a wide variety of images or videos and play an important role in understanding the scene. In this paper, we present a novel text detection algorithm which is composed of two cascaded steps: (1) a multi-scale fully convolutional neural network (FCN) is proposed to extract text block regions, (2) a novel instance...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Universal Adversarial Perturbations

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Training Object Class Detectors with Click Supervision

3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions

On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation

Deep Video Deblurring for Hand-Held Cameras

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Diversified Texture Synthesis with Feed-Forward Networks

End-to-End Instance Segmentation with Recurrent Attention

SRN: Side-Output Residual Network for Object Symmetry Detection in the Wild

Deep Image Matting

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling

Face Normals "In-the-Wild" Using Fully Convolutional Networks

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Detecting Masked Faces in the Wild with LLE-CNNs

Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)