2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

chapter

Multi-attention Network for One Shot Learning

Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6212 - 6220

One-shot learning is a challenging problem where the aim is to recognize a class identified by a single training image. Given the practical importance of one-shot learning, it seems surprising that the rich information present in the class tag itself has largely been ignored. Most existing approaches restrict the use of the class tag to finding similar classes and transferring classifiers or metrics...

chapter

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3068 - 3076

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we introduce Recipe1M, a new large-scale, structured corpus of over 1m cooking recipes and 800k food images. As the largest publicly available collection of recipe data, Recipe1M affords the ability to train high-capacity models on aligned, multi-modal data. Using these data, we train a neural network to find a joint embedding of recipes and images that yields impressive results on...

chapter

G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition

Qilong Wang, Peihua Li, Lei Zhang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6507 - 6516

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, plugging trainable structural layers into deep convolutional neural networks (CNNs) as image representations has made promising progress. However, there has been little work on inserting parametric probability distributions, which can effectively model feature statistics, into deep CNNs in an end-to-end manner. This paper proposes a Global Gaussian Distribution embedding Network (G2DeNet)...

chapter

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Youssef Tamaazousti, Herve Le Borgne, Celine Hudelot

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5282 - 5291

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In a transfer-learning scheme, the intermediate layers of a pre-trained CNN are employed as universal image representation to tackle many visual classification problems. The current trend to generate such representation is to learn a CNN on a large set of images labeled among the most specific categories. Such processes ignore potential relations between categories, as well as the categorical-levels...

chapter

Asymmetric Feature Maps with Application to Sketch Based Retrieval

Giorgos Tolias, Ondrej Chum

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6185 - 6193

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel concept of asymmetric feature maps (AFM), which allows to evaluate multiple kernels between a query and database entries without increasing the memory requirements. To demonstrate the advantages of the AFM method, we derive a short vector image representation that, due to asymmetric feature maps, supports efficient scale and translation invariant sketch-based image retrieval. Unlike...

chapter

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6156 - 6164

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a new fashion search protocol where attribute manipulation is allowed within the interaction between users and search engines, e.g. manipulating the color attribute of the clothing from red to blue. It is particularly useful for image-based search when the query image cannot perfectly match users expectation of the desired product. To build such a search engine, we propose a novel memory-augmented...

chapter

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 916 - 925

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compact coding has been widely applied to approximate nearest neighbor search for large-scale image retrieval, due to its computation efficiency and retrieval quality. This paper presents a compact coding solution with a focus on the deep learning to quantization approach, which improves retrieval quality by end-to-end representation learning and compact encoding and has already shown the superior...

chapter

Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search

Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4217 - 4226

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In most state-of-the-art hashing-based visual search systems, local image descriptors of an image are first aggregated as a single feature vector. This feature vector is then subjected to a hashing function that produces a binary hash code. In previous work, the aggregating and the hashing processes are designed independently. In this paper, we propose a novel framework where feature aggregating and...

chapter

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6335 - 6344

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects...

chapter

Learned Contextual Feature Reweighting for Image Geo-Localization

Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3251 - 3260

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of large scale image geo-localization where the location of an image is estimated by identifying geo-tagged reference images depicting the same place. We propose a novel model for learning image representations that integrates context-aware feature reweighting in order to effectively focus on regions that positively contribute to geo-localization. In particular, we introduce...

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-attention Network for One Shot Learning

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Asymmetric Feature Maps with Application to Sketch Based Retrieval

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Learned Contextual Feature Reweighting for Image Geo-Localization

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)