2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Weifeng Ge, Yizhou Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 10 - 19

Deep neural networks require a large amount of labeled training data during supervised learning. However, collecting and labeling so much data might be infeasible in many cases. In this paper, we introduce a deep transfer learning scheme, called selective joint fine-tuning, for improving the performance of deep learning tasks with insufficient training data. In this scheme, a target learning task...

chapter

Deep Video Deblurring for Hand-Held Cameras

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 237 - 246

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motion blur from camera shake is a major problem in videos captured by hand-held devices. Unlike single-image deblurring, video-based approaches can take advantage of the abundant information that exists across neighboring frames. As a result the best performing methods rely on the alignment of nearby frames. However, aligning images is a computationally expensive and fragile procedure, and methods...

chapter

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 257 - 265

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine...

chapter

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 416 - 425

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene flow describes the motion of 3D objects in real world and potentially could be the basis of a good feature for 3D action recognition. However, its use for action recognition, especially in the context of convolutional neural networks (ConvNets), has not been previously studied. In this paper, we propose the extraction and use of scene flow for action recognition from RGB-D data. Previous works...

chapter

Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes

S. Alireza Golestaneh, Lina J. Karam

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 596 - 605

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The detection of spatially-varying blur without having any information about the blur type is a challenging task. In this paper, we propose a novel effective approach to address this blur detection problem from a single image without requiring any knowledge about the blur type, level, or camera settings. Our approach computes blur detection maps based on a novel High-frequency multiscale Fusion and...

chapter

Additive Component Analysis

Calvin Murdock, Fernando De la Torre

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 673 - 681

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Principal component analysis (PCA) is one of the most versatile tools for unsupervised learning with applications ranging from dimensionality reduction to exploratory data analysis and visualization. While much effort has been devoted to encouraging meaningful representations through regularization (e.g. non-negativity or sparsity), underlying linearity assumptions can limit their effectiveness. To...

chapter

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 761 - 770

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, DNN model compression based on network architecture design, e.g., SqueezeNet, attracted a lot attention. No accuracy drop on image classification is observed on these extremely compact networks, compared to well-known models. An emerging question, however, is whether these model compression techniques hurt DNNs learning ability other than classifying images on a single dataset. Our preliminary...

chapter

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 945 - 954

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In domain adaptation, maximum mean discrepancy (MMD) has been widely adopted as a discrepancy metric between the distributions of source and target domains. However, existing MMD-based domain adaptation methods generally ignore the changes of class prior distributions, i.e., class weight bias across domains. This remains an open problem but ubiquitous for domain adaptation, which can be caused by...

chapter

Context-Aware Correlation Filter Tracking

Matthias Mueller, Neil Smith, Bernard Ghanem

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1387 - 1395

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Correlation filter (CF) based trackers have recently gained a lot of popularity due to their impressive performance on benchmark datasets, while maintaining high frame rates. A significant amount of recent research focuses on the incorporation of stronger features for a richer representation of the tracking target. However, this only helps to discriminate the target from background within a small...

chapter

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1417 - 1426

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Temporal action localization is an important yet challenging problem. Given a long, untrimmed video consisting of multiple action instances and complex background contents, we need not only to recognize their action categories, but also to localize the start time and end time of each instance. Many state-of-the-art systems use segment-level classifiers to select and rank proposal segments of pre-determined...

chapter

Light Field Reconstruction Using Deep Convolutional Network on EPI

Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1638 - 1646

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we take advantage of the clear texture structure of the epipolar plane image (EPI) in the light field data and model the problem of light field reconstruction from a sparse set of views as a CNN-based angular detail restoration on EPI. We indicate that one of the main challenges in sparsely sampled light field reconstruction is the information asymmetry between the spatial and angular...

chapter

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1743 - 1751

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One of recent trends [31, 32, 14] in network architecture design is stacking small filters (e.g., 1x1 or 3x3) in the entire network because the stacked small filters is more efficient than a large kernel, given the same computational complexity. However, in the field of semantic segmentation, where we need to perform dense per-pixel prediction, we find that the large kernel (and effective receptive...

chapter

Conditional Similarity Networks

Andreas Veit, Serge Belongie, Theofanis Karaletsos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1781 - 1789

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

What makes images similar? To measure the similarity between images, they are typically embedded in a feature-vector space, in which their distance preserve the relative dissimilarity. However, when learning such similarity embeddings the simplifying assumption is commonly made that images are only compared to one unique measure of similarity. A main reason for this is that contradicting notions of...

chapter

More is Less: A More Complicated Network with Less Inference Complexity

Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1895 - 1903

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity. The core idea is to equip each original convolutional layer with another low-cost collaborative layer (LCCL), and the element-wise multiplication of the ReLU outputs of these two...

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Video Frame Interpolation via Adaptive Convolution

Simon Niklaus, Long Mai, Feng Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2270 - 2279

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Video frame interpolation typically involves two steps: motion estimation and pixel synthesis. Such a two-step approach heavily depends on the quality of motion estimation. This paper presents a robust video frame interpolation method that combines these two steps into a single process. Specifically, our method considers pixel synthesis for the interpolated frame as local convolution over two input...

chapter

Unrolling the Shutter: CNN to Correct Motion Distortions

Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2345 - 2353

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Row-wise exposure delay present in CMOS cameras is responsible for skew and curvature distortions known as the rolling shutter (RS) effect while imaging under camera motion. Existing RS correction methods resort to using multiple images or tailor scene-specific correction schemes. We propose a convolutional neural network (CNN) architecture that automatically learns essential scene features from a...

chapter

Light Field Blind Motion Deblurring

Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2354 - 2362

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We study the problem of deblurring light fields of general 3D scenes captured under 3D camera motion and present both theoretical and practical contributions. By analyzing the motion-blurred light field in the primal and Fourier domains, we develop intuition into the effects of camera motion on the light field, show the advantages of capturing a 4D light field instead of a conventional 2D image for...

chapter

Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy

Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2454 - 2462

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we describe a set of robust algorithms for group-wise registration using both rigid and non-rigid transformations of multiple unlabelled point-sets with no bias toward a given set. These methods mitigate the need to establish a correspondence among the point-sets by representing them as probability density functions where the registration is treated as a multiple distribution alignment...

chapter

An Efficient Background Term for 3D Reconstruction and Tracking with Smooth Surface Models

Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2575 - 2583

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel strategy to shrink and constrain a 3D model, represented as a smooth spline-like surface, within the visual hull of an object observed from one or multiple views. This new background or silhouette term combines the efficiency of previous approaches based on an image-plane distance transform with the accuracy of formulations based on raycasting or ray potentials. The overall formulation...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Deep Video Deblurring for Hand-Held Cameras

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks

Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes

Additive Component Analysis

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

Context-Aware Correlation Filter Tracking

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Light Field Reconstruction Using Deep Convolutional Network on EPI

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

Conditional Similarity Networks

More is Less: A More Complicated Network with Less Inference Complexity

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Video Frame Interpolation via Adaptive Convolution

Unrolling the Shutter: CNN to Correct Motion Distortions

Light Field Blind Motion Deblurring

Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy

An Efficient Background Term for 3D Reconstruction and Tracking with Smooth Surface Models

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)