2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Junwu Weng, Chaoqun Weng, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 445 - 454

Motivated by previous success of using non-parametric methods to recognize objects, e.g., NBNN [2], we extend it to recognize actions using skeletons. Each 3D action is presented by a sequence of 3D poses. Similar to NBNN, our proposed Spatio-Temporal-NBNN applies stage-to-class distance to classify actions. However, ST-NBNN takes the spatio-temporal structure of 3D actions into consideration and...

chapter

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 965 - 974

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We approach the problem of fast detection and recognition of a large number (thousands) of object categories while training on a very limited amount of examples, usually one per category. Examples of this task include: (i) detection of retail products, where we have only one studio image of each product available for training, (ii) detection of brand logos, and (iii) detection of 3D objects and their...

chapter

Captioning Images with Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1170 - 1178

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources – labeled images from object recognition...

chapter

Finding Tiny Faces

Peiyun Hu, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1522 - 1530

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Though tremendous strides have been made in object recognition, one of the remaining open challenges is detecting small objects. We explore three aspects of the problem in the context of finding small faces: the role of scale invariance, image resolution, and contextual reasoning. While most recognition approaches aim to be scale-invariant, the cues for recognizing a 3px tall face are fundamentally...

chapter

Emotion Recognition in Context

Ronak Kosti, Jose M. Alvarez, Adria Recasens, Agata Lapedriza

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1960 - 1968

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Understanding what a person is experiencing from her frame of reference is essential in our everyday life. For this reason, one can think that machines with this type of ability would interact better with people. However, there are no current systems capable of understanding in detail peoples emotional states. Previous research on computer vision to recognize emotions has mainly focused on analyzing...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

Unsupervised Part Learning for Visual Recognition

Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frederic Jurie

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3116 - 3124

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Part-based image classification aims at representing categories by small sets of learned discriminative parts, upon which an image representation is built. Considered as a promising avenue a decade ago, this direction has been neglected since the advent of deep neural networks. In this context, this paper brings two contributions: first, this work proceeds one step further compared to recent part-based...

chapter

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3165 - 3174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we introduce a new video representation for action classification that aggregates local convolutional features across the entire spatio-temporal extent of the video. We do so by integrating state-of-the-art two-stream networks [42] with learnable spatio-temporal feature aggregation [6]. The resulting architecture is end-to-end trainable for whole-video classification. We investigate...

chapter

Deep Feature Flow for Video Recognition

Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4141 - 4150

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep convolutional neutral networks have achieved great success on image recognition tasks. Yet, it is non-trivial to transfer the state-of-the-art image recognition networks to videos as per-frame evaluation is too slow and unaffordable. We present deep feature flow, a fast and accurate framework for video recognition. It runs the expensive convolutional sub-network only on sparse key frames and...

chapter

Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition

Jianlong Fu, Heliang Zheng, Tao Mei

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4476 - 4484

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recognizing fine-grained categories (e.g., bird species) is difficult due to the challenges of discriminative region localization and fine-grained feature learning. Existing approaches predominantly solve these challenges independently, while neglecting the fact that region detection and fine-grained feature learning are mutually correlated and thus can reinforce each other. In this paper, we propose...

chapter

Quality Aware Network for Set to Set Recognition

Yu Liu, Junjie Yan, Wanli Ouyang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4694 - 4703

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper targets on the problem of set to set recognition, which learns the metric between two image sets. Images in each set belong to the same identity. Since images in a set can be complementary, they hopefully lead to higher accuracy in practical applications. However, the quality of each sample cannot be guaranteed, and samples with poor quality will hurt the metric. In this paper, the quality...

chapter

Correlational Gaussian Processes for Cross-Domain Visual Recognition

Chengjiang Long, Gang Hua

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4932 - 4940

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a probabilistic model that captures higher order co-occurrence statistics for joint visual recognition in a collection of images and across multiple domains. More importantly, we predict the structured output across multiple domains by correlating outputs from the multi-classes Gaussian process classifiers in each individual domain. A set of correlational tensors is adopted to model the...

chapter

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Jing Zhang, Wanqing Li, Philip Ogunbona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5150 - 5158

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel unsupervised domain adaptation method for cross-domain visual recognition. We propose a unified framework that reduces the shift between domains both statistically and geometrically, referred to as Joint Geometrical and Statistical Alignment (JGSA). Specifically, we learn two coupled projections that project the source domain and target domain data into low-dimensional...

chapter

Neural Aggregation Network for Video Face Recognition

Jiaolong Yang, Peiran Ren, Dongqing Zhang, Dong Chen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5216 - 5225

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a Neural Aggregation Network (NAN) for video face recognition. The network takes a face video or face image set of a person with a variable number of face images as its input, and produces a compact, fixed-dimension feature representation for recognition. The whole network is composed of two modules. The feature embedding module is a deep Convolutional Neural Network (CNN) which...

chapter

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5996 - 6004

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using...

chapter

Multi-attention Network for One Shot Learning

Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6212 - 6220

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One-shot learning is a challenging problem where the aim is to recognize a class identified by a single training image. Given the practical importance of one-shot learning, it seems surprising that the rich information present in the class tag itself has largely been ignored. Most existing approaches restrict the use of the class tag to finding similar classes and transferring classifiers or metrics...

chapter

Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision

Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6288 - 6297

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we study learning visual classifiers from unstructured text descriptions at part precision with no training images. We propose a learning framework that is able to connect text terms to its relevant parts and suppress connections to non-visual text terms without any part-text annotations. For instance, this learning process enables terms like beak to be sparsely linked to the visual...

chapter

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6335 - 6344

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects...

chapter

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Michael Lam, Behrooz Mahasseni, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6497 - 6506

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This work addresses fine-grained image classification. Our work is based on the hypothesis that when dealing with subtle differences among object classes it is critical to identify and only account for a few informative image parts, as the remaining image context may not only be uninformative but may also hurt recognition. This motivates us to formulate our problem as a sequential search for informative...

chapter

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Jose Lezama, Qiang Qiu, Guillermo Sapiro

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6807 - 6816

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Surveillance cameras today often capture NIR (near infrared) images in low-light environments. However, most face datasets accessible for training and verification are only collected in the VIS (visible light) spectrum. It remains a challenging problem to match NIR to VIS face images due to the different light spectrum. Recently, breakthroughs have been made for VIS face recognition by applying deep...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Captioning Images with Diverse Objects

Finding Tiny Faces

Emotion Recognition in Context

Semantically Consistent Regularization for Zero-Shot Recognition

Unsupervised Part Learning for Visual Recognition

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Deep Feature Flow for Video Recognition

Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition

Quality Aware Network for Set to Set Recognition

Correlational Gaussian Processes for Cross-Domain Visual Recognition

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Neural Aggregation Network for Video Face Recognition

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Multi-attention Network for One Shot Learning

Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)