2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 95 - 104

Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks. One appealing alternative is rendering synthetic data where ground-truth annotations are generated automatically. Unfortunately, models trained purely on rendered images fail to generalize to real images. To address this shortcoming, prior work introduced unsupervised domain...

chapter

Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks

Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 455 - 464

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Building robust classifiers trained on data susceptible to group or subject-specific variations is a challenging pattern recognition problem. We develop hierarchical Bayesian neural networks to capture subject-specific variations and share statistical strength across subjects. Leveraging recent work on learning Bayesian neural networks, we build fast, scalable algorithms for inferring the posterior...

chapter

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 761 - 770

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, DNN model compression based on network architecture design, e.g., SqueezeNet, attracted a lot attention. No accuracy drop on image classification is observed on these extremely compact networks, compared to well-known models. An emerging question, however, is whether these model compression techniques hurt DNNs learning ability other than classifying images on a single dataset. Our preliminary...

chapter

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 916 - 925

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compact coding has been widely applied to approximate nearest neighbor search for large-scale image retrieval, due to its computation efficiency and retrieval quality. This paper presents a compact coding solution with a focus on the deep learning to quantization approach, which improves retrieval quality by end-to-end representation learning and compact encoding and has already shown the superior...

chapter

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 945 - 954

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In domain adaptation, maximum mean discrepancy (MMD) has been widely adopted as a discrepancy metric between the distributions of source and target domains. However, existing MMD-based domain adaptation methods generally ignore the changes of class prior distributions, i.e., class weight bias across domains. This remains an open problem but ubiquitous for domain adaptation, which can be caused by...

chapter

Temporal Convolutional Networks for Action Segmentation and Detection

Colin Lea, Michael D. Flynn, Rene Vidal, Austin Reiter, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1003 - 1012

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal features from video frames and then feeding them into a temporal classifier that captures high-level temporal patterns. We describe a class of temporal models, which...

chapter

Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1131 - 1140

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-task learning aims to improve generalization performance of multiple prediction tasks by appropriately sharing relevant information across them. In the context of deep neural networks, this idea is often realized by hand-designed network architectures with layers that are shared across tasks and branches that encode task-specific features. However, the space of possible multi-task deep architectures...

chapter

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1377 - 1386

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Person re-identification is an open and challenging problem in computer vision. Existing approaches have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re-identification problem, where a new camera may...

chapter

Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation

Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1475 - 1483

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on indoor semantic segmentation using RGB-D data. Although the commonly used deconvolution networks (DeconvNet) have achieved impressive results on this task, we find there is still room for improvements in two aspects. One is about the boundary segmentation. DeconvNet aggregates large context to predict the label of each pixel, inherently limiting the segmentation precision of...

chapter

Spatially Adaptive Computation Time for Residual Networks

Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1790 - 1799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a deep learning architecture based on Residual Network that dynamically adjusts the number of executed layers for the regions of the image. This architecture is end-to-end trainable, deterministic and problem-agnostic. It is therefore applicable without any modifications to a wide range of computer vision problems such as image classification, object detection and image segmentation...

chapter

Improving Pairwise Ranking for Multi-label Image Classification

Yuncheng Li, Yale Song, Jiebo Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1837 - 1845

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Learning to rank has recently emerged as an attractive technique to train deep convolutional neural networks for various computer vision tasks. Pairwise ranking, in particular, has been successful in multi-label image classification, achieving state-of-the-art results on various benchmarks. However, most existing approaches use the hinge loss to train their models, which is non-smooth and thus is...

chapter

Linking Image and Text with 2-Way Nets

Aviv Eisenschtat, Lior Wolf

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1855 - 1865

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional...

chapter

Fast Video Classification via Adaptive Cascading of Deep Models

Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2197 - 2205

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances have enabled oracle classifiers that can classify across many classes and input distributions with high accuracy without retraining. However, these classifiers are relatively heavyweight, so that applying them to classify video is costly. We show that day-to-day video exhibits highly skewed class distributions over the short term, and that these distributions can be classified by much...

chapter

Adaptive Class Preserving Representation for Image Classification

Jian-Xun Mi, Qiankun Fu, Weisheng Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2624 - 2632

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In linear representation-based image classification, an unlabeled sample is represented by the entire training set. To obtain a stable and discriminative solution, regularization on the vector of representation coefficients is necessary. For example, the representation in sparse representation-based classification (SRC) uses L1 norm penalty as regularization, which is equal to lasso. However, lasso...

chapter

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2962 - 2971

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Adversarial learning methods are a promising approach to training robust deep networks, and can generate complex samples across diverse domains. They can also improve recognition despite the presence of domain shift or dataset bias: recent adversarial approaches to unsupervised domain adaptation reduce the difference between the training and test domain distributions and thus improve generalization...

chapter

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Yu-Xiong Wang, Deva Ramanan, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3029 - 3038

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CNNs have made an undeniable impact on computer vision through the ability to learn high-capacity models with large annotated training sets. One of their remarkable properties is the ability to transfer knowledge from a large source dataset to a (typically smaller) target dataset. This is usually accomplished through fine-tuning a fixed-size network on new target data. Indeed, virtually every contemporary...

chapter

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning

Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3242 - 3250

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Attention-based neural encoder-decoder frameworks have been widely adopted for image captioning. Most methods force visual attention to be active for every generated word. However, the decoder likely requires little to no visual information from the image to predict non-visual words such as the and of. Other words that may seem visual can often be predicted reliably just from the language model e...

chapter

Compact Matrix Factorization with Dependent Subspaces

Viktor Larsson, Carl Olsson

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4361 - 4370

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Traditional matrix factorization methods approximate high dimensional data with a low dimensional subspace. This imposes constraints on the matrix elements which allow for estimation of missing entries. A lower rank provides stronger constraints and makes estimation of the missing entries less ambiguous at the cost of measurement fit. In this paper we propose a new factorization model that further...

chapter

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4598 - 4607

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We consider the problem of depth-based robust 3D facial pose tracking under unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Unlike the previous depth-based discriminative or data-driven methods that require sophisticated training or manual intervention, we propose a generative framework that unifies pose tracking and face model adaptation on-the-fly. Particularly,...

chapter

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Yinlin Hu, Yunsong Li, Rui Song

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4791 - 4799

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The interpolation of correspondences (EpicFlow) was widely used for optical flow estimation in most-recent works. It has the advantage of edge-preserving and efficiency. However, it is vulnerable to input matching noise, which is inevitable in modern matching techniques. In this paper, we present a Robust Interpolation method of Correspondences (called RicFlow) to overcome the weakness. First, the...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

Temporal Convolutional Networks for Action Segmentation and Detection

Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation

Spatially Adaptive Computation Time for Residual Networks

Improving Pairwise Ranking for Multi-label Image Classification

Linking Image and Text with 2-Way Nets

Fast Video Classification via Adaptive Cascading of Deep Models

Adaptive Class Preserving Representation for Image Classification

Adversarial Discriminative Domain Adaptation

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning

Compact Matrix Factorization with Dependent Subspaces

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)