Search results

Items from 81 to 100 out of 2,083 results

chapter

Open Vocabulary Scene Parsing

Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2021 - 2029

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing arbitrary objects in the wild has been a challenging problem due to the limitations of existing classification models and datasets. In this paper, we propose a new task that aims at parsing scenes with a large and open vocabulary, and several evaluation metrics are explored for this problem. Our approach is a joint image pixel and word concept embeddings framework, where word concepts...

chapter

Unsupervised Representation Learning by Sorting Sequences

Hsin-Ying Lee, Jia-Bin Huang, Maneesh Singh, Ming-Hsuan Yang

2017 IEEE International Conference on Computer Vision (ICCV) > 667 - 676

2017 IEEE International Conference on Computer Vision (ICCV)

We present an unsupervised representation learning approach using videos without semantic labels. We leverage the temporal coherence as a supervisory signal by formulating representation learning as a sequence sorting task. We take temporally shuffled frames (i.e., in non-chronological order) as inputs and train a convolutional neural network to sort the shuffled sequences. Similar to comparison-based...

chapter

Automatic Spatially-Aware Fashion Concept Discovery

Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1472 - 1480

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes an automatic spatially-aware concept discovery approach using weakly labeled image-text data from shopping websites. We first fine-tune GoogleNet by jointly modeling clothing images and their corresponding descriptions in a visual-semantic embedding space. Then, for each attribute (word), we generate its spatiallyaware representation by combining its semantic word vector representation...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

WordSup: Exploiting Word Annotations for Character Based Text Detection

Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4950 - 4959

2017 IEEE International Conference on Computer Vision (ICCV)

Imagery texts are usually organized as a hierarchy of several visual elements, i.e. characters, words, text lines and text blocks. Among these elements, character is the most basic one for various languages such as Western, Chinese, Japanese, mathematical expression and etc. It is natural and convenient to construct a common text detection engine based on character detectors. However, training character...

chapter

Summarization and Classification of Wearable Camera Streams by Learning the Distributions over Deep Features of Out-of-Sample Image Sequences

Alessandro Penna, Sadegh Mohammadi, Nebojsa Jojic, Vittorio Murino

2017 IEEE International Conference on Computer Vision (ICCV) > 4336 - 4344

2017 IEEE International Conference on Computer Vision (ICCV)

A popular approach to training classifiers of new image classes is to use lower levels of a pre-trained feed-forward neural network and retrain only the top. Thus, most layers simply serve as highly nonlinear feature extractors. While these features were found useful for classifying a variety of scenes and objects, previous work also demonstrated unusual levels of sensitivity to the input especially...

chapter

Learning Visual N-Grams from Web Data

Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten

2017 IEEE International Conference on Computer Vision (ICCV) > 4193 - 4202

2017 IEEE International Conference on Computer Vision (ICCV)

Real-world image recognition systems need to recognize tens of thousands of classes that constitute a plethora of visual concepts. The traditional approach of annotating thousands of images per class for training is infeasible in such a scenario, prompting the use of webly supervised data. This paper explores the training of image-recognition systems on large numbers of images and associated user...

chapter

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin

2017 IEEE International Conference on Computer Vision (ICCV) > 2989 - 2998

2017 IEEE International Conference on Computer Vision (ICCV)

Despite the substantial progress in recent years, the image captioning techniques are still far from being perfect. Sentences produced by existing methods, e.g. those based on RNNs, are often overly rigid and lacking in variability. This issue is related to a learning principle widely used in practice, that is, to maximize the likelihood of training samples. This principle encourages high resemblance...

chapter

Deep Growing Learning

Guangcong Wang, Xiaohua Xie, Jianhuang Lai, Jiaxuan Zhuo

2017 IEEE International Conference on Computer Vision (ICCV) > 2831 - 2839

2017 IEEE International Conference on Computer Vision (ICCV)

Semi-supervised learning (SSL) is an import paradigm to make full use of a large amount of unlabeled data in machine learning. A bottleneck of SSL is the overfitting problem when training over the limited labeled data, especially on a complex model like a deep neural network. To get around this bottleneck, we propose a bio-inspired SSL framework on deep neural network, namely Deep Growing Learning...

chapter

Efficient Online Local Metric Adaptation via Negative Samples for Person Re-identification

Jiahuan Zhou, Pei Yu, Wei Tang, Ying Wu

2017 IEEE International Conference on Computer Vision (ICCV) > 2439 - 2447

2017 IEEE International Conference on Computer Vision (ICCV)

Many existing person re-identification (PRID) methods typically attempt to train a faithful global metric offline to cover the enormous visual appearance variations, so as to directly use it online on various probes for identity match- ing. However, their need for a huge set of positive training pairs is very demanding in practice. In contrast to these methods, this paper advocates a different paradigm:...

chapter

Identity-Aware Textual-Visual Matching with Latent Co-attention

Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1908 - 1917

2017 IEEE International Conference on Computer Vision (ICCV)

Textual-visual matching aims at measuring similarities between sentence descriptions and images. Most existing methods tackle this problem without effectively utilizing identity-level annotations. In this paper, we propose an identity-aware two-stage framework for the textual-visual matching problem. Our stage-1 CNN-LSTM network learns to embed cross-modal features with a novel Cross-Modal Cross-Entropy...

chapter

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Soravit Changpinyo, Wei-Lun Chao, Fei Sha

2017 IEEE International Conference on Computer Vision (ICCV) > 3496 - 3505

2017 IEEE International Conference on Computer Vision (ICCV)

Leveraging class semantic descriptions and examples of known objects, zero-shot learning makes it possible to train a recognition model for an object class whose examples are not available. In this paper, we propose a novel zero-shot learning model that takes advantage of clustering structures in the semantic embedding space. The key idea is to impose the structural constraint that semantic representations...

chapter

Large-Scale Image Retrieval with Attentive Deep Local Features

Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3476 - 3485

2017 IEEE International Conference on Computer Vision (ICCV)

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for key point selection,...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles

Ke Yan, Yonghong Tian, Yaowei Wang, Wei Zeng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 562 - 570

2017 IEEE International Conference on Computer Vision (ICCV)

Precise search of visually-similar vehicles poses a great challenge in computer vision, which needs to find exactly the same vehicle among a massive vehicles with visually similar appearances for a given query image. In this paper, we model the relationship of vehicle images as multiple grains. Following this, we propose two approaches to alleviate the precise vehicle search problem by exploiting...

chapter

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

2017 IEEE International Conference on Computer Vision (ICCV) > 589 - 598

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing how objects interact with each other is a crucial task in visual recognition. If we define the context of the interaction to be the objects involved, then most current methods can be categorized as either: (i) training a single classifier on the combination of the interaction and its context; or (ii) aiming to recognize the interaction independently of its explicit context. Both methods...

chapter

Representation Learning by Learning to Count

Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro

2017 IEEE International Conference on Computer Vision (ICCV) > 5899 - 5907

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce a novel method for representation learning that uses an artificial supervision signal based on counting visual primitives. This supervision signal is obtained from an equivariance relation, which does not require any manual annotation. We relate transformations of images to transformations of the representations. More specifically, we look for the representation that satisfies such relation...

chapter

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu

2017 IEEE International Conference on Computer Vision (ICCV) > 4345 - 4353

2017 IEEE International Conference on Computer Vision (ICCV)

Unsupervised learning from visual data is one of the most difficult challenges in computer vision. It is essential for understanding how visual recognition works. Learning from unsupervised input has an immense practical value, as huge quantities of unlabeled videos can be collected at low cost. Here we address the task of unsupervised learning to detect and segment foreground objects in single images...

chapter

Analysis of dialogue stimulated by science videos and reference materials

Daichi Sunouchi, Kiyoshi Nosu

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1119 - 1122

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

Recently, many have begun to believe that learning and training approaches known as learner-centered, active learning, and cooperative learning improve learning and practicing performance and are more effective than traditional lectures. Moreover, in addition to paper-based materials such as textbooks, face-to-face co-located communication frequently utilizes digital video and other visual reference...

chapter

What looks good with my sofa: Multimodal search engine for interior design

Ivona Tautkute, Aleksandra Mozejko, Wojciech Stokowiec, Tomasz Trzcinski, more

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1275 - 1282

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

In this paper, we propose a multi-modal search engine for interior design that combines visual and textual queries. The goal of our engine is to retrieve interior objects, e.g. furniture or wall clocks, that share visual and aesthetic similarities with the query. Our search engine allows the user to take a photo of a room and retrieve with a high recall a list of items identical or visually similar...

Keywords:
VISUALIZATION
Publication type:
book

Publication date

Set your own date range

Content availability

Available (2,078)
None (5)

Keywords

FEATURE EXTRACTION (665)
SUPPORT VECTOR MACHINES (254)
COMPUTATIONAL MODELING (189)
SEMANTICS (185)
IMAGE COLOR ANALYSIS (175)
ACCURACY (157)
IMAGE CLASSIFICATION (147)
DATA MINING (134)
IMAGE SEGMENTATION (132)
HISTOGRAMS (121)
KERNEL (114)
OBJECT RECOGNITION (114)
LEARNING (ARTIFICIAL INTELLIGENCE) (113)
NEURAL NETWORKS (108)
COMPUTER VISION (107)
TESTING (104)
OBJECT DETECTION (103)
VECTORS (101)
IMAGE RECOGNITION (98)
DATABASES (97)
CAMERAS (96)
IMAGE RETRIEVAL (95)
CORRELATION (92)
DETECTORS (91)
SHAPE (88)
ROBOTS (87)
VOCABULARY (87)
ROBUSTNESS (86)
GAMES (85)
MACHINE LEARNING (84)
ELECTROENCEPHALOGRAPHY (82)
DICTIONARIES (80)
TRAINING DATA (77)
CONTEXT (73)
HAPTIC INTERFACES (73)
VIRTUAL REALITY (73)
HIDDEN MARKOV MODELS (71)
TARGET TRACKING (71)
FACE (69)
CLASSIFICATION ALGORITHMS (68)
THREE-DIMENSIONAL DISPLAYS (68)
HUMANS (64)
DATA MODELS (62)
SOLID MODELING (60)
MEASUREMENT (59)
NEURONS (58)
OPTIMIZATION (58)
TRAJECTORY (57)
ARTIFICIAL NEURAL NETWORKS (55)
IMAGE REPRESENTATION (55)
SPEECH (55)
ENCODING (54)
CONFERENCES (52)
DEEP LEARNING (51)
IMAGE EDGE DETECTION (51)
PREDICTIVE MODELS (50)
STANDARDS (50)
ESTIMATION (49)
FACE RECOGNITION (49)
VIDEOS (49)
EDUCATIONAL INSTITUTIONS (48)
PIXEL (46)
PRINCIPAL COMPONENT ANALYSIS (46)
FORCE (45)
MATHEMATICAL MODEL (45)
ADAPTATION MODELS (44)
COMPUTER ARCHITECTURE (43)
DATA VISUALIZATION (42)
CLUSTERING ALGORITHMS (41)
IMAGE RECONSTRUCTION (40)
JOINTS (39)
NAVIGATION (39)
COMPUTERS (38)
DATA VISUALISATION (38)
MULTIMEDIA COMMUNICATION (38)
CONVOLUTION (37)
PROTOTYPES (36)
ROBOT SENSING SYSTEMS (36)
INTERNET (35)
PATTERN RECOGNITION (35)
SOFTWARE (35)
VIDEO SIGNAL PROCESSING (35)
SPEECH RECOGNITION (34)
PATTERN CLASSIFICATION (32)
PSYCHOLOGY (32)
BUILDINGS (31)
CLASSIFICATION (31)
IMAGE CODING (31)
LABELING (31)
NEURAL NETS (31)
THREE DIMENSIONAL DISPLAYS (31)
VEHICLES (31)
ALGORITHM DESIGN AND ANALYSIS (29)
CONTENT-BASED RETRIEVAL (29)
ELECTRODES (29)
IMAGE RESOLUTION (29)
LEGGED LOCOMOTION (29)
NOISE MEASUREMENT (29)
more

Data set

ieee (2,082)
Springer (1)

INFONA - science communication portal

Search results

Open Vocabulary Scene Parsing

Unsupervised Representation Learning by Sorting Sequences

Automatic Spatially-Aware Fashion Concept Discovery

Generalized Orderless Pooling Performs Implicit Salient Matching

WordSup: Exploiting Word Annotations for Character Based Text Detection

Summarization and Classification of Wearable Camera Streams by Learning the Distributions over Deep Features of Out-of-Sample Image Sequences

Learning Visual N-Grams from Web Data

Towards Diverse and Natural Image Descriptions via a Conditional GAN

Deep Growing Learning

Efficient Online Local Metric Adaptation via Negative Samples for Person Re-identification

Identity-Aware Textual-Visual Matching with Latent Co-attention

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Large-Scale Image Retrieval with Attentive Deep Local Features

DualNet: Learn Complementary Features for Image Recognition

Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Representation Learning by Learning to Count

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

Analysis of dialogue stimulated by science videos and reference materials

What looks good with my sofa: Multimodal search engine for interior design

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options