Search results

Items from 101 to 120 out of 1,080 results

1 ...
3
4
5
6
7
8
9

chapter

Multi-task Curriculum Transfer Deep Learning of Clothing Attributes

Qi Dong, Shaogang Gong, Xiatian Zhu

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 520 - 529

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Recognising detailed clothing characteristics (finegrained attributes) in unconstrained images of people inthe-wild is a challenging task for computer vision, especially when there is only limited training data from the wild whilst most data available for model learning are captured in well-controlled environments using fashion models (well lit, no background clutter, frontal view, high-resolution)...

chapter

Complex Event Recognition from Images with Few Training Examples

Unaiza Ahsan, Chen Sun, James Hays, Irfan Essa

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 669 - 678

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

We propose to leverage concept-level representations for complex event recognition in photographs given limited training examples. We introduce a novel framework to discover event concept attributes from the web and use that to extract semantic features from images and classify them into social event categories with few training examples. Discovered concepts include a variety of objects, scenes, actions...

chapter

First-Person Action Decomposition and Zero-Shot Learning

Yun C. Zhang, Yin Li, James M. Rehg

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 121 - 129

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

In this work, we decompose a first-person action into verb and noun. We then study how the coupling of an action's constituent verb and noun affects the learners' ability to learn them separately and to combine them to perform recognition. We compare different information fusion methods on conventional action recognition and zero-shot learning, of which the latter is a strong indication of the feature's...

chapter

Image augmentation by blocky artifact in Deep Convolutional Neural Network for handwritten digit recognition

Md Shopon, Nabeel Mohammed, Md Anowarul Abedin

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR) > 1 - 6

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)

Deep Convolutional Neural Networks - also known as DCNN - are powerful models for different visual pattern classification problems. Many works in this field use image augmentation at the training phase to achieve better accuracy. This paper presents blocky artifact as an augmentation technique to increase the accuracy of DCNN for handwritten digit recognition, both English and Bangla digits, i.e.,...

chapter

Action Recognition in Still Images Using Word Embeddings from Natural Language Descriptions

Karan Sharma, Arun CS Kumar, Suchendra M. Bhandarkar

2017 IEEE Winter Applications of Computer Vision Workshops (WACVW) > 58 - 66

2017 IEEE Winter Applications of Computer Vision Workshops (WACVW)

Detecting actions or verbs in still images is a challenging problem for a variety of reasons such as the absence of temporal information and polysemy of verbs which lead to difficulty in generating large verb datasets. In this paper, we propose to first detect the prominent objects in the image and then infer the relevant actions or verbs using Natural Language Processing (NLP)-based techniques. The...

chapter

Recognition of handwritten bilingual Characters-Numerals using shape context

Ranjana S. Zinjore, R. J. Ramteke

2016 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE) > 265 - 268

2016 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE)

This paper presents a methodology for recognition of handwritten Marathi and English Characters-Numerals using shape context descriptor. During pre-processing an algorithm is developed to extract the Marathi and English Characters-Numerals form grid formatted datasheets. The corresponding sample points around the boundary of a character are computed. This is followed by obtaining the centroid of the...

chapter

Fine-grained vehicle recognition using hierarchical fine-tuning strategy for Urban Surveillance Videos

Qiang Zhang, Li Zhuo, Xiaochen Hu, Jing Zhang

2016 International Conference on Progress in Informatics and Computing (PIC) > 233 - 236

2016 International Conference on Progress in Informatics and Computing (PIC)

The Fine-grained Vehicle recognition is easily affected by small visual changes. The existing recognition methods have less robustness to these conditions (such as illumination, weather changes, etc.) and the accuracy of vehicle recognition in complex environments cannot achieve a satisfying result. In this paper, a high-accuracy fine-grained vehicle recognition method using Convolutional Neural Network...

chapter

Machine learning framework for image classification

Sehla Loussaief, Afef Abdelkrim

2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT) > 58 - 61

2016 7th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)

Hereby in this paper, we are interested to extraction methods and classification in case of image classification and recognition application. We expose the performance of training models on varying classifier algorithms on Caltech 101 images categories. For feature extraction functions we evaluate the use of the classical SURF technique against global color feature extraction. The purpose of our work...

chapter

Mutually incoherent pose bases for Action recognition

Yinzhong Qian, Wenbin Chen, I-fan Shen

2016 23rd International Conference on Pattern Recognition (ICPR) > 823 - 828

2016 23rd International Conference on Pattern Recognition (ICPR)

We propose mutually incoherent pose bases for action recognition in static image, each of which implicitly represents co-occurrence of poselets. First of all, action specific poselets are trained. To suppress the ambiguity of detection, we cluster poselet activations by the overlap of predicted torso bound of each poselet. Then pose feature of an action person can be extracted which is a vector composed...

chapter

Person re-identification using CNN features learned from combination of attributes

Tetsu Matsukawa, Einoshin Suzuki

2016 23rd International Conference on Pattern Recognition (ICPR) > 2428 - 2433

2016 23rd International Conference on Pattern Recognition (ICPR)

This paper presents fine-tuned CNN features for person re-identification. Recently, features extracted from top layers of pre-trained Convolutional Neural Network (CNN) on a large annotated dataset, e.g., ImageNet, have been proven to be strong off-the-shelf descriptors for various recognition tasks. However, large disparity among the pre-trained task, i.e., ImageNet classification, and the target...

chapter

Effect of injected noise in deep neural networks

Naresh Nagabushan, Nishank Satish, S Raghuram

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC) > 1 - 5

2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Deep Neural Networks have become increasingly popular due to their efficient realization in GPU hardware. Problems that were once considered computationally intensive to implement using Neural networks have now become possible due to the vast amount of flexibility and capability offered by the GPU and Deep networks combination. In this work, we attempt to improve the recognition rate for images, using...

chapter

Learning face recognition from limited training data using deep neural networks

Xi Peng, Nalini Ratha, Sharathchandra Pankanti

2016 23rd International Conference on Pattern Recognition (ICPR) > 1442 - 1447

2016 23rd International Conference on Pattern Recognition (ICPR)

Often deep learning methods are associated with huge amounts of training data. The deeper the network gets, the larger is the need for training data. A large amount of labeled data helps the network learn about the variations it needs to handle in the prediction stage. It is not easy for everyone to get access to huge amounts of labeled data leaving a few to have the luxury to design very deep networks...

chapter

Dominant plane recognition in interior scenes from a single image

J. A. de Jesus Osuna-Coutino, Jose Martinez-Carranza, Miguel Arias-Estrada, Walterio Mayol-Cuevas

2016 23rd International Conference on Pattern Recognition (ICPR) > 1923 - 1928

2016 23rd International Conference on Pattern Recognition (ICPR)

Recognition of dominant planes is an important task used in areas such as robot navigation, augmented reality, 3D reconstruction, among others. There are several approaches for recognizing planar structures, however, most of these approaches are based on processing two or more images captured from different camera views or on processing 3D data in the form of point clouds associated with the camera...

chapter

A hierarchical approach to event discovery from single images using MIL framework

Kashif Ahmad, Francesco De Natale, Giulia Boato, Andrea Rosani

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 1223 - 1227

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In this paper we propose to face the problem of event detection from single images, by exploiting both background information often containing revealing contextual clues and details, which are salient for recognizing the event. Such details are visual objects critical to understand the underlying event depicted in the image and were recently defined in the literature as “event-saliency”. Adopting...

chapter

Hybrid hypergraph construction for facial expression recognition

Yuchi Huang, Hanqing Lu

2016 23rd International Conference on Pattern Recognition (ICPR) > 4142 - 4147

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we proposed a novel framework for facial expression recognition, in which face images were taken as vertices in a hypergraph and the task of expression recognition was formulated as the problem of hypergraph based inference. A hybrid strategy was developed to construct hyperedges: we generated probabilities of facial action units by deep convolutional networks and took each action unit...

chapter

Exploiting supervised learning for finetuning deep CNNs in content based image retrieval

Maria Tzelepi, Anastasios Tefas

2016 23rd International Conference on Pattern Recognition (ICPR) > 2918 - 2923

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper a novel CNN-based approach in the Content Based Image Retrieval domain that exploits supervised learning is proposed. We employ a deep CNN model to derive feature representations from the activations of the deepest layers and we refine the weights of the utilized layers in order to produce better image descriptors using information obtained from the available data labels. To this end,...

chapter

Supervised dictionary learning in BoF framework for Scene Character recognition

Maroua Tounsi, Ikram Moalla, Adel M. Alimi

2016 23rd International Conference on Pattern Recognition (ICPR) > 3987 - 3992

2016 23rd International Conference on Pattern Recognition (ICPR)

In recent years, growing attention has been paid to recognizing text in natural scenes images. Scene Character recognition (SCR) is an important step in automatizing the process of reading text in natural scenes.

chapter

Simultaneous food localization and recognition

Marc Bolanos, Petia Radeva

2016 23rd International Conference on Pattern Recognition (ICPR) > 3140 - 3145

2016 23rd International Conference on Pattern Recognition (ICPR)

The development of automatic nutrition diaries, which would allow to keep track objectively of everything we eat, could enable a whole new world of possibilities for people concerned about their nutrition patterns. With this purpose, in this paper we propose the first method for simultaneous food localization and recognition. Our method is based on two main steps, which consist in, first, produce...

chapter

Scene text recognition with CNN classifier and WFST-based word labeling

Xinhao Liu, Takahito Kawanishi, Xiaomeng Wu, Kunio Kashino

2016 23rd International Conference on Pattern Recognition (ICPR) > 3999 - 4004

2016 23rd International Conference on Pattern Recognition (ICPR)

Natural scene text recognition has proved to be challenging due to the unconstrained wild conditions. In this paper, to solve this problem we propose a method which first detects and recognizes characters by utilizing the high performance Convolutional Neural Network (CNN). Then for post-processing, inspired by its success in speech recognition, we employ the efficient and flexible Weight Finite State...

chapter

User-generated content curation with deep convolutional neural networks

Ruben Tous, Otto Wust, Mauro Gomez, Jonatan Poveda, more

2016 IEEE International Conference on Big Data (Big Data) > 2535 - 2540

2016 IEEE International Conference on Big Data (Big Data)

In this paper, we report a work consisting in using deep convolutional neural networks (CNNs) for curating and filtering photos posted by social media users (Instagram and Twitter). The final goal is to facilitate searching and discovering user-generated content (UGC) with potential value for digital marketing tasks. The images are captured in real time and automatically annotated with multiple CNNs...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
IMAGE RECOGNITION

Publication date

Set your own date range

Content availability

Available (1,074)
None (6)

Keywords

FEATURE EXTRACTION (541)
FACE RECOGNITION (231)
DATABASES (209)
FACE (182)
ACCURACY (162)
ARTIFICIAL NEURAL NETWORKS (161)
SUPPORT VECTOR MACHINES (150)
PATTERN RECOGNITION (141)
PRINCIPAL COMPONENT ANALYSIS (137)
CLASSIFICATION ALGORITHMS (134)
IMAGE SEGMENTATION (133)
TESTING (124)
IMAGE CLASSIFICATION (116)
DATA MINING (115)
SHAPE (104)
COMPUTER VISION (99)
VISUALIZATION (98)
PIXEL (95)
OBJECT RECOGNITION (86)
CHARACTER RECOGNITION (84)
LEARNING (ARTIFICIAL INTELLIGENCE) (82)
IMAGE COLOR ANALYSIS (80)
NEURAL NETWORKS (79)
CAMERAS (75)
HIDDEN MARKOV MODELS (75)
NEURAL NETS (71)
COMPUTATIONAL MODELING (70)
IMAGE PROCESSING (70)
ALGORITHM DESIGN AND ANALYSIS (68)
MACHINE LEARNING (68)
ROBUSTNESS (68)
NEURONS (67)
HISTOGRAMS (65)
KERNEL (65)
LIGHTING (65)
HUMANS (60)
HANDWRITING RECOGNITION (59)
SIGNAL PROCESSING (55)
VECTORS (54)
IMAGE REPRESENTATION (53)
SUPPORT VECTOR MACHINE CLASSIFICATION (53)
IMAGE RESOLUTION (52)
TRAINING DATA (52)
TRANSFORMS (51)
MATHEMATICAL MODEL (50)
NOISE (50)
EQUATIONS (48)
WAVELET TRANSFORMS (44)
IMAGE RECONSTRUCTION (43)
SIGNAL PROCESSING ALGORITHMS (43)
SUPPORT VECTOR MACHINE (42)
BACKPROPAGATION (41)
BIOMETRICS (ACCESS CONTROL) (41)
CORRELATION (41)
DETECTORS (41)
EIGENVALUES AND EIGENFUNCTIONS (41)
IMAGE EDGE DETECTION (41)
OBJECT DETECTION (41)
ESTIMATION (39)
IMAGE SEQUENCES (39)
NEURAL NETWORK (37)
COVARIANCE MATRIX (35)
COMPUTERS (34)
CONFERENCES (34)
TARGET RECOGNITION (33)
IMAGE MOTION ANALYSIS (32)
VEHICLES (32)
DEEP LEARNING (31)
IMAGE TEXTURE (31)
IMAGE CODING (30)
REAL TIME SYSTEMS (30)
BIOLOGICAL NEURAL NETWORKS (29)
DATA MODELS (29)
PCA (29)
EDUCATIONAL INSTITUTIONS (28)
CONVOLUTION (27)
DICTIONARIES (27)
GABOR FILTERS (27)
PATTERN CLASSIFICATION (27)
HIDDEN MARKOV MODEL (26)
TEXT RECOGNITION (26)
THREE DIMENSIONAL DISPLAYS (26)
VISUAL DATABASES (26)
IMAGE MATCHING (25)
MEDICAL IMAGE PROCESSING (25)
OPTIMIZATION (25)
COMPUTER ARCHITECTURE (24)
ENCODING (24)
GENETIC ALGORITHMS (23)
GESTURE RECOGNITION (23)
LEGGED LOCOMOTION (23)
VIDEO SIGNAL PROCESSING (23)
BIOMEDICAL IMAGING (22)
DISTANCE MEASUREMENT (22)
ELECTRONIC MAIL (22)
IMAGE COLOUR ANALYSIS (22)
MATRIX DECOMPOSITION (22)
SVM (22)
more

INFONA - science communication portal

Search results

Multi-task Curriculum Transfer Deep Learning of Clothing Attributes

Complex Event Recognition from Images with Few Training Examples

First-Person Action Decomposition and Zero-Shot Learning

Image augmentation by blocky artifact in Deep Convolutional Neural Network for handwritten digit recognition

Action Recognition in Still Images Using Word Embeddings from Natural Language Descriptions

Recognition of handwritten bilingual Characters-Numerals using shape context

Fine-grained vehicle recognition using hierarchical fine-tuning strategy for Urban Surveillance Videos

Machine learning framework for image classification

Mutually incoherent pose bases for Action recognition

Person re-identification using CNN features learned from combination of attributes

Effect of injected noise in deep neural networks

Learning face recognition from limited training data using deep neural networks

Dominant plane recognition in interior scenes from a single image

A hierarchical approach to event discovery from single images using MIL framework

Hybrid hypergraph construction for facial expression recognition

Exploiting supervised learning for finetuning deep CNNs in content based image retrieval

Supervised dictionary learning in BoF framework for Scene Character recognition

Simultaneous food localization and recognition

Scene text recognition with CNN classifier and WFST-based word labeling

User-generated content curation with deep convolutional neural networks

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options