2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1 - 9

Multi-view subspace clustering aims to partition a set of multi-source data into their underlying groups. To boost the performance of multi-view clustering, numerous subspace learning algorithms have been developed in recent years, but with rare exploitation of the representation complementarity between different views as well as the indicator consistency among the representations, let alone considering...

chapter

The More You Know: Using Knowledge Graphs for Image Classification

Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 20 - 28

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One characteristic that sets humans apart from modern learning-based computer vision algorithms is the ability to acquire knowledge about the world and use that knowledge to reason about the visual world. Humans can learn about the characteristics of objects and the relationships that occur between them to learn a large variety of visual concepts, often with few examples. This paper investigates the...

chapter

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 407 - 415

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A human action can be seen as transitions between ones body poses over time, where the transition depicts a temporal relation between two poses. Recognizing actions thus involves learning a classifier sensitive to these pose transitions as well as to static poses. In this paper, we introduce a novel method called transitions forests, an ensemble of decision trees that both learn to discriminate static...

chapter

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Junwu Weng, Chaoqun Weng, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 445 - 454

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Motivated by previous success of using non-parametric methods to recognize objects, e.g., NBNN [2], we extend it to recognize actions using skeletons. Each 3D action is presented by a sequence of 3D poses. Similar to NBNN, our proposed Spatio-Temporal-NBNN applies stage-to-class distance to classify actions. However, ST-NBNN takes the spatio-temporal structure of 3D actions into consideration and...

chapter

Binary Constraint Preserving Graph Matching

Bo Jiang, Jin Tang, Chris Ding, Bin Luo

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 550 - 557

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Graph matching is a fundamental problem in computer vision and pattern recognition area. In general, it can be formulated as an Integer Quadratic Programming (IQP) problem. Since it is NP-hard, approximate relaxations are required. In this paper, a new graph matching method has been proposed. There are three main contributions of the proposed method: (1) we propose a new graph matching relaxation...

chapter

Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting

Mariano Tepper, Guillermo Sapiro

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 655 - 663

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce a highly efficient algorithm to address the nonnegative matrix underapproximation (NMU) problem, i.e., nonnegative matrix factorization (NMF) with an additional underapproximation constraint. NMU results are interesting as, compared to traditional NMF, they present additional sparsity and part-based behavior, explaining unique data features. To show these features in practice,...

chapter

AMVH: Asymmetric Multi-Valued hashing

Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 898 - 906

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most existing hashing methods resort to binary codes for similarity search, owing to the high efficiency of computation and storage. However, binary codes lack enough capability in similarity preservation, resulting in less desirable performance. To address this issue, we propose an asymmetric multi-valued hashing method supported by two different non-binary embeddings. (1) A real-valued embedding...

chapter

Zero-Shot Action Recognition with Error-Correcting Output Codes

Jie Qin, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1042 - 1051

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, zero-shot action recognition (ZSAR) has emerged with the explosive growth of action categories. In this paper, we explore ZSAR from a novel perspective by adopting the Error-Correcting Output Codes (dubbed ZSECOC). Our ZSECOC equips the conventional ECOC with the additional capability of ZSAR, by addressing the domain shift problem. In particular, we learn discriminative ZSECOC for seen...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

ER3: A Unified Framework for Event Retrieval, Recognition and Recounting

Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2107 - 2116

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We develop a unified framework for complex event retrieval, recognition and recounting. The framework is based on a compact video representation that exploits the temporal correlations in image features. Our feature alignment procedure identifies and removes the feature redundancies across frames and outputs an intermediate tensor representation we call video imprint. The video imprint is then fed...

chapter

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos

Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3205 - 3214

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce Spatio-Temporal Vector of Locally Max Pooled Features (ST-VLMPF), a super vector-based encoding method specifically designed for local deep features encoding. The proposed method addresses an important problem of video understanding: how to build a video representation that incorporates the CNN features over the entire video. Feature assignment is carried out at two levels, by using the...

chapter

Predicting Salient Face in Multiple-Face Videos

Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3224 - 3232

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Although the recent success of convolutional neural network (CNN) advances state-of-the-art saliency prediction in static images, few work has addressed the problem of predicting attention in videos. On the other hand, we find that the attention of different subjects consistently focuses on a single face in each frame of videos involving multiple faces. Therefore, we propose in this paper a novel...

chapter

Online Asymmetric Similarity Learning for Cross-Modal Retrieval

Yiling Wu, Shuhui Wang, Qingming Huang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3984 - 3993

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Cross-modal retrieval has attracted intensive attention in recent years. Measuring the semantic similarity between heterogeneous data objects is an essential yet challenging problem in cross-modal retrieval. In this paper, we propose an online learning method to learn the similarity function between heterogeneous modalities by preserving the relative similarity in the training data, which is modeled...

chapter

Incremental Kernel Null Space Discriminant Analysis for Novelty Detection

Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4123 - 4131

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Novelty detection, which aims to determine whether a given data belongs to any category of training data or not, is considered to be an important and challenging problem in areas of Pattern Recognition, Machine Learning, etc. Recently, kernel null space method (KNDA) was reported to have state-of-the-art performance in novelty detection. However, KNDA is hard to scale up because of its high computational...

chapter

Hardware-Efficient Guided Image Filtering for Multi-label Problem

Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4905 - 4913

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The Guided Filter (GF) is well-known for its linear complexity. However, when filtering an image with an n-channel guidance, GF needs to invert an n × n matrix for each pixel. To the best of our knowledge existing matrix inverse algorithms are inefficient on current hardwares. This shortcoming limits applications of multichannel guidance in computation intensive system such as multi-label...

chapter

Correlational Gaussian Processes for Cross-Domain Visual Recognition

Chengjiang Long, Gang Hua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4932 - 4940

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a probabilistic model that captures higher order co-occurrence statistics for joint visual recognition in a collection of images and across multiple domains. More importantly, we predict the structured output across multiple domains by correlating outputs from the multi-classes Gaussian process classifiers in each individual domain. A set of correlational tensors is adopted to model the...

chapter

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Jing Zhang, Wanqing Li, Philip Ogunbona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5150 - 5158

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel unsupervised domain adaptation method for cross-domain visual recognition. We propose a unified framework that reduces the shift between domains both statistically and geometrically, referred to as Joint Geometrical and Statistical Alignment (JGSA). Specifically, we learn two coupled projections that project the source domain and target domain data into low-dimensional...

chapter

Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Lena Gorelick, Yuri Boykov, Olga Veksler

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6062 - 6070

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Many computer vision problems require optimization of binary non-submodular energies. In this context, iterative submodularization techniques based on trust region (LSA-TR) and auxiliary functions (LSA-AUX) have been recently proposed [9]. They achieve state-of-the-art-results on a number of computer vision applications. In this paper we extend the LSA-AUX framework in two directions. First, unlike...

chapter

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6432 - 6440

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address personalization issues of image captioning, which have not been discussed yet in previous research. For a query image, we aim to generate a descriptive sentence, accounting for prior knowledge such as the users active vocabularies in previous documents. As applications of personalized image captioning, we tackle two post automation tasks: hashtag prediction and post generation, on our newly...

chapter

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6836 - 6845

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Several recent works have used deep convolutional networks to generate realistic imagery. These methods sidestep the traditional computer graphics rendering pipeline and instead generate imagery at the pixel level by learning from large collections of photos (e.g. faces or bedrooms). However, these methods are of limited utility because it is difficult for a user to control what the network produces...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Exclusivity-Consistency Regularized Multi-view Subspace Clustering

The More You Know: Using Knowledge Graphs for Image Classification

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Binary Constraint Preserving Graph Matching

Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting

AMVH: Asymmetric Multi-Valued hashing

Zero-Shot Action Recognition with Error-Correcting Output Codes

Semantically Consistent Regularization for Zero-Shot Recognition

ER3: A Unified Framework for Event Retrieval, Recognition and Recounting

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos

Predicting Salient Face in Multiple-Face Videos

Online Asymmetric Similarity Learning for Cross-Modal Retrieval

Incremental Kernel Null Space Discriminant Analysis for Novelty Detection

Hardware-Efficient Guided Image Filtering for Multi-label Problem

Correlational Gaussian Processes for Cross-Domain Visual Recognition

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)