Search results

chapter

Contextual superpixel description for remote sensing image classification

J. E. Vargas, A. X. Falcao, J. A. dos Santos, J. C. D. M. Esquerdo, more

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 1132 - 1135

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

The performance of pattern classifiers depends on the separability of the classes in the feature space — a property related to the quality of the descriptors — and the choice of informative training samples for user labeling — a procedure that usually requires active learning. This work is devoted to improve the quality of the descriptors when samples are superpixels from remote sensing images. We...

chapter

Dictionary based pooling for object categorization

Sean Ryan Fanello, Nicoletta Noceti, Giorgio Metta, Francesca Odone

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 2 > 269 - 274

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

It is well known that image representations learned through ad-hoc dictionaries improve the overall results in object categorization problems. Following the widely accepted coding-pooling visual recognition pipeline, these representations are often tightly coupled with a coding stage. In this paper we show how to exploit ad-hoc representations both within the coding and the pooling phases. We learn...

chapter

Human action recognition using an improved string edit distance

Pasquale Foggia, Benoit Gauzere, Alessia Saggese, Mario Vento

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

In this paper we propose an improvement of a human action recognition method that uses a string-based representation and a string edit distance to compare the observed action with reference actions in the training set. In particular, the original improvement is based on a specific formulation of the string edit distance that is more suited to take into account the problems related to noise and to...

chapter

Improving bag of visual words representations with genetic programming

Hugo Jair Escalante, Jose Martinez-Carraza, Sergio Escalera, Victor Ponce-Lopez, more

2015 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2015 International Joint Conference on Neural Networks (IJCNN)

The bag of visual words is a well established representation in diverse computer vision problems. Taking inspiration from the fields of text mining and retrieval, this representation has proved to be very effective in a large number of domains. In most cases, a standard term-frequency weighting scheme is considered for representing images and videos in computer vision. This is somewhat surprising,...

chapter

Effective semantic pixel labelling with convolutional networks and Conditional Random Fields

Sakrapee Paisitkriangkrai, Jamie Sherrah, Pranam Janney, Anton Van-Den Hengel

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 36 - 43

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Large amounts of available training data and increasing computing power have led to the recent success of deep convolutional neural networks (CNN) on a large number of applications. In this paper, we propose an effective semantic pixel labelling using CNN features, hand-crafted features and Conditional Random Fields (CRFs). Both CNN and hand-crafted features are applied to dense image patches to produce...

chapter

Learning to count with deep object features

Santi Segui, Oriol Pujol, Jordi Vitria

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 90 - 96

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Learning to count is a learning strategy that has been recently proposed in the literature for dealing with problems where estimating the number of object instances in a scene is the final objective. In this framework, the task of learning to detect and localize individual object instances is seen as a harder task that can be evaded by casting the problem as that of computing a regression value from...

chapter

Beyond Bag-of-Words: Fast video classification with Fisher Kernel Vector of Locally Aggregated Descriptors

Ionut Mironica, Ionut Duta, Bogdan Ionescu, Nicu Sebe

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

In this paper we introduce a new video description framework that replaces traditional Bag-of-Words with a combination of Fisher Kernels (FK) and Vector of Locally Aggregated Descriptors (VLAD). The main contributions are: (i) a fast algorithm to densely extract global frame features, easier and faster to compute than spatio-temporal local features; (ii) replacing the traditional k-means based vocabulary...

chapter

Recipe recognition with large multimodal food dataset

Xin Wang, Devinder Kumar, Nicolas Thome, Matthieu Cord, more

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

This paper deals with automatic systems for image recipe recognition. For this purpose, we compare and evaluate leading vision-based and text-based technologies on a new very large multimodal dataset (UPMC Food-101) containing about 100,000 recipes for a total of 101 food categories. Each item in this dataset is represented by one image plus textual information. We present deep experiments of recipe...

chapter

PET: An eye-tracking dataset for animal-centric Pascal object classes

Syed Omer Gilani, Ramanathan Subramanian, Yan Yan, David Melcher, more

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

We present PET- the Pascal animal classes Eye Tracking database. Our database comprises eye movement recordings compiled from forty users for the bird, cat, cow, dog, horse and sheep trainval sets from the VOC 2012 image set. Different from recent eye-tracking databases such as [1, 2], a salient aspect of PET is that it contains eye movements recorded for both the free-viewing and visual search task...

chapter

Comparing the effect of concurrent and delayed visual feedback on consolidating motor memory in force control

Gaofeng Yang, Dangxiao Wang, Yuru Zhang

2015 IEEE World Haptics Conference (WHC) > 440 - 444

2015 IEEE World Haptics Conference (WHC)

The capability of applying a weak force with expected accuracy is an important motor skill in surgical operations. Acquiring such a skill is challenging for novices. In this paper, we studied how the accuracy of the force control could be enhanced through repetitive training. Twelve participants were divided into two groups. They were trained to apply a target force of 0.25N with ±20% accuracy under...

chapter

Fine-grained classification of identity document types with only one example

Marcel Simon, Erik Rodner, Joachim Denzler

2015 14th IAPR International Conference on Machine Vision Applications (MVA) > 126 - 129

2015 14th IAPR International Conference on Machine Vision Applications (MVA)

In this paper, we tackle the task of recognizing types of partly very similar identity documents using state-of-the-art visual recognition approaches. Given a scanned document, the goal is to identify the country of issue, the type of document, and its version. Whereas recognizing the individual parts of a document with known standardized layout can be done reliably, identifying the type of a document...

chapter

Discriminative learning of apparel features

Rasmus Rothe, Marko Ristin, Matthias Dantone, Luc Van Gool

2015 14th IAPR International Conference on Machine Vision Applications (MVA) > 5 - 9

2015 14th IAPR International Conference on Machine Vision Applications (MVA)

Fashion is a major segment in e-commerce with growing importance and a steadily increasing number of products. Since manual annotation of apparel items is very tedious, the product databases need to be organized automatically, e.g. by image classification. Common image classification approaches are based on features engineered for general purposes which perform poorly on specific images of apparel...

chapter

Model of Human Visual Cortex Inspired Computational Models for Visual Recognition

Jinjun Wang, Qiqi Hou, Nan Liu, Shizhou Zhang

2015 IEEE International Conference on Multimedia Big Data > 88 - 91

2015 IEEE International Conference on Multimedia Big Data (BigMM)

In this paper, we are mostly interested in investigating how the study and discovery of the human visual cortex could be utilised to improve the computational models for visual recognition by computer vision. Many of the brain perceptual abilities in vision have corresponding algorithms exist in computer vision, and in this paper we discuss three such models. First we present a model that has the...

chapter

Face recognition for great apes: Identification of primates in videos

Alexander Loos, Talat Anand Mohan Kalyanasundaram

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1548 - 1552

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Due to the ongoing biodiversity crisis, many species including great apes such as chimpanzees or gorillas are threatened and need to be protected. To overcome the catastrophic decline of biodiversity, biologists recently started to use remote cameras for wildlife monitoring. However, the manual analysis of the resulting image and video material is extremely tedious, time consuming, and highly cost...

chapter

How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes

Ziad Al-Halah, Rainer Stiefelhagen

2015 IEEE Winter Conference on Applications of Computer Vision > 837 - 843

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

Attribute based knowledge transfer has proven very successful in visual object analysis and learning previously unseen classes. However, the common approach learns and transfers attributes without taking into consideration the embedded structure between the categories in the source set. Such information provides important cues on the intraattribute variations. We propose to capture these variations...

chapter

Bikers Are Like Tobacco Shops, Formal Dressers Are Like Suits: Recognizing Urban Tribes with Caffe

Yufei Wang, Garrison W. Cottrell

2015 IEEE Winter Conference on Applications of Computer Vision > 876 - 883

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

Recognition of social styles of people is an interesting but relatively unexplored task. Recognizing "style" appears to be a quite different problem than categorization, it is like recognizing a letter's font as opposed to recognizing the letter itself. Similar-looking things must be mapped to different categories. Hence a priori it would appear that features that are good for categorization...

chapter

Wolf search algorithm for attribute reduction in classification

Waleed Yamany, E. Emary, Aboul Ella Hassanien

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) > 351 - 358

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)

Data sets ordinarily includes a huge number of attributes, with irrelevant and redundant attributes. Redundant and irrelevant attributes might minimize the classification accuracy because of the huge search space. The main goal of attribute reduction is choose a subset of relevant attributes from a huge number of available attributes to obtain comparable or even better classification accuracy than...

chapter

Chromatic SSVEP BCI paradigm targeting the higher frequency EEG responses

Daiki Aminaka, Shoji Makino, Tomasz M. Rutkowski

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 7

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

A novel approach to steady-state visual evoked potential (SSVEP) based brain-computer interface (BCI) is presented in the paper. To minimize possible side-effects of the monochromatic light SSVEP-based BCI we propose to utilize chromatic green-blue flicker stimuli in higher, comparing to the traditionally used, frequencies. The developed safer SSVEP responses are processed an classified with features...

chapter

Template-Based Multiple Codebooks Generation for Fine-Grained Shopping Classification and Retrieval

Hui Liu, Zhuo Su

2014 5th International Conference on Digital Home > 293 - 298

2014 5th International Conference on Digital Home (ICDH)

Visual codebook based quantization of robust appearance descriptors extracted from local image patches is an effective means of capturing image statistics for object classification. A codebook is usually constructed by using a cluster method such as k-means at object level or image level. The codebook is global. For fine-grained categorization and recognition problems, however, the global object-level...

chapter

A hardware accelerated multilevel visual classifier for embedded visual-assist systems

Matthew Cotter, Siddharth Advani, Jack Sampson, Kevin Irick, more

2014 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 96 - 100

2014 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Embedded visual assist systems are emerging as increasingly viable tools for aiding visually impaired persons in their day-to-day life activities. Novel wearable devices with imaging capabilities will be uniquely positioned to assist visually impaired in activities such as grocery shopping. However, supporting such time-sensitive applications on embedded platforms requires an intelligent trade-off...

INFONA - science communication portal

Search results

Contextual superpixel description for remote sensing image classification

Dictionary based pooling for object categorization

Human action recognition using an improved string edit distance

Improving bag of visual words representations with genetic programming

Effective semantic pixel labelling with convolutional networks and Conditional Random Fields

Learning to count with deep object features

Beyond Bag-of-Words: Fast video classification with Fisher Kernel Vector of Locally Aggregated Descriptors

Recipe recognition with large multimodal food dataset

PET: An eye-tracking dataset for animal-centric Pascal object classes

Comparing the effect of concurrent and delayed visual feedback on consolidating motor memory in force control

Fine-grained classification of identity document types with only one example

Discriminative learning of apparel features

Model of Human Visual Cortex Inspired Computational Models for Visual Recognition

Face recognition for great apes: Identification of primates in videos

How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes

Bikers Are Like Tobacco Shops, Formal Dressers Are Like Suits: Recognizing Urban Tribes with Caffe

Wolf search algorithm for attribute reduction in classification

Chromatic SSVEP BCI paradigm targeting the higher frequency EEG responses

Template-Based Multiple Codebooks Generation for Fine-Grained Shopping Classification and Retrieval

A hardware accelerated multilevel visual classifier for embedded visual-assist systems

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options