Advanced search

Advanced search in people

From:

To:

Items from 21 to 40 out of 1,154 results

chapter

Hairstyle pattern recognition based on CNNs

Chao Sun, Won-Sook Lee

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1840 - 1845

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Hairstyle recognition is a challenging task since hairstyles span a diverse range of appearances in real-world. However, it is possible to start from recognizing the most basic hairstyles then dealing with more complex hairstyles. In this paper, we present a novel hairstyle pattern recognition system based on CNNs. We first give the definitions of four basic hairstyles: straight hairstyle, curly hairstyle,...

chapter

Rare Chinese character recognition by Radical Extraction Network

Ziang Yan, Chengzhe Yan, Changshui Zhang

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 924 - 929

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Building a modern Optical Character Recognition (OCR) system for Chinese is hard due to the large Chinese vocabulary list. Training images for rare Chinese characters are extremely expensive to obtain. Radical-based OCR systems tackle this problem by first extracting and recognizing basic graphical components (i.e., radicals) of a Chinese character. However, how to reliably recognize radicals still...

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

Gender recognition from face images using a geometric descriptor

Marcos Vinicius Mussel Cirne, Helio Pedrini

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2006 - 2011

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Gender recognition from face images is a challenging problem with applications in various knowledge domains, such as biometrics, security and surveillance, human-computer interaction, among others. In this work, we propose and evaluate a novel method for gender recognition based on a geometric descriptor constructed from a pre-defined face shape model. The proposed approach, tested on four different...

chapter

Deep features for breast cancer histopathological image classification

Fabio A. Spanhol, Luiz S. Oliveira, Paulo R. Cavalin, Caroline Petitjean, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1868 - 1873

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Breast cancer (BC) is a deadly disease, killing millions of people every year. Developing automated malignant BC detection system applied on patient's imagery can help dealing with this problem more efficiently, making diagnosis more scalable and less prone to errors. Not less importantly, such kind of research can be extended to other types of cancer, making even more impact to help saving lives...

chapter

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Lili Pan, Samira Pouyanfar, Hao Chen, Jiaohua Qin, more

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 181 - 189

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...

chapter

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem

2017 IEEE International Conference on Computer Vision (ICCV) > 4223 - 4232

2017 IEEE International Conference on Computer Vision (ICCV)

An important goal of computer vision is to build systems that learn visual representations over time that can be applied to many tasks. In this paper, we investigate a vision-language embedding as a core representation and show that it leads to better cross-task transfer than standard multitask learning. In particular, the task of visual recognition is aligned to the task of visual question answering...

chapter

Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework

Michal Busta, Lukas Neumann, Jiri Matas

2017 IEEE International Conference on Computer Vision (ICCV) > 2223 - 2231

2017 IEEE International Conference on Computer Vision (ICCV)

A method for scene text localization and recognition is proposed. The novelties include: training of both text detection and recognition in a single end-to-end pass, the structure of the recognition CNN and the geometry of its input layer that preserves the aspect of the text and adapts its resolution to the data.,,The proposed method achieves state-of-the-art accuracy in the end-to-end text recognition...

chapter

Recurrent Models for Situation Recognition

Arun Mallya, Svetlana Lazebnik

2017 IEEE International Conference on Computer Vision (ICCV) > 455 - 463

2017 IEEE International Conference on Computer Vision (ICCV)

This work proposes Recurrent Neural Network (RNN) models to predict structured ‘image situations’ – actions and noun entities fulfilling semantic roles related to the action. In contrast to prior work relying on Conditional Random Fields (CRFs), we use a specialized action prediction network followed by an RNN for noun prediction. Our system obtains state-of-the-art accuracy on the challenging recent...

chapter

A novel connectivity of deep convolutional neural networks

Zhixi Shen, Yong Liu

2017 Chinese Automation Congress (CAC) > 7779 - 7783

2017 Chinese Automation Congress (CAC)

Residual network(ResNet) is an effective instance and a significant extension of deep convolutional neural network. ResNet utilizes skip-connection between input layers and output layers to solve the vanishing gradient problem. Due to the powerfulness of skip-connection, the gradient can flow directly through the identity function from later layers to the earlier layers. However, skip-connection makes...

chapter

Lattice Long Short-Term Memory for Human Action Recognition

Lin Sun, Kui Jia, Kevin Chen, Dit Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2166 - 2175

2017 IEEE International Conference on Computer Vision (ICCV)

Human actions captured in video sequences are threedimensional signals characterizing visual appearance and motion dynamics. To learn action patterns, existing methods adopt Convolutional and/or Recurrent Neural Networks (CNNs and RNNs). CNN based methods are effective in learning spatial appearances, but are limited in modeling long-term motion dynamics. RNNs, especially Long Short- Term Memory (LSTM),...

chapter

Learning Visual N-Grams from Web Data

Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten

2017 IEEE International Conference on Computer Vision (ICCV) > 4193 - 4202

2017 IEEE International Conference on Computer Vision (ICCV)

Real-world image recognition systems need to recognize tens of thousands of classes that constitute a plethora of visual concepts. The traditional approach of annotating thousands of images per class for training is infeasible in such a scenario, prompting the use of webly supervised data. This paper explores the training of image-recognition systems on large numbers of images and associated user...

chapter

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

Timnit Gebru, Judy Hoffman, Li Fei-Fei

2017 IEEE International Conference on Computer Vision (ICCV) > 1358 - 1367

2017 IEEE International Conference on Computer Vision (ICCV)

While fine-grained object recognition is an important problem in computer vision, current models are unlikely to accurately classify objects in the wild. These fully supervised models need additional annotated images to classify objects in every new scenario, a task that is infeasible. However, sources such as e-commerce websites and field guides provide annotated images for many classes. In this...

chapter

Large-Scale Image Retrieval with Attentive Deep Local Features

Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3476 - 3485

2017 IEEE International Conference on Computer Vision (ICCV)

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for key point selection,...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

2017 IEEE International Conference on Computer Vision (ICCV) > 589 - 598

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing how objects interact with each other is a crucial task in visual recognition. If we define the context of the interaction to be the objects involved, then most current methods can be categorized as either: (i) training a single classifier on the combination of the interaction and its context; or (ii) aiming to recognize the interaction independently of its explicit context. Both methods...

article

Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition

Sebastien C. Wong, Victor Stamatescu, Adam Gatt, David Kearney, more

IEEE Transactions on Image Processing > 2017 > 26 > 10 > 4669 - 4683

This paper addresses the problem of online tracking and classification of multiple objects in an image sequence. Our proposed solution is to first track all objects in the scene without relying on object-specific prior knowledge, which in other systems can take the form of hand-crafted features or user-based track initialization. We then classify the tracked objects with a fast-learning image classifier,...

chapter

Elevator button and floor number recognition through hybrid image classification approach for navigation of service robot in buildings

Kh Tohidul Islam, Ghulam Mujtaba, Ram Gopal Raj, Henry Friday Nweke

2017 International Conference on Engineering Technology and Technopreneurship (ICE2T) > 1 - 4

2017 International Conference on Engineering Technology and Technopreneurship (ICE2T)

To successfully move a robot into the building, the elevator button and elevator floor number detection and recognition can play an important role. It can help a robot move in the building, just as it also can help a visually impaired person who wants to move another floor in the building. Due to vision-based approach, the difference in lighting condition and the complex background are the main obstacles...

chapter

A layer-block-wise pipeline for memory and bandwidth reduction in distributed deep learning

Haruki Mori, Tetsuya Youkawa, Shintaro Izumi, Masahiko Yoshimoto, more

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper describes a pipelined stochastic gradient descent (SGD) algorithm and its hardware architecture with a memory distributed structure. In the proposed architecture, a pipeline stage takes charge of multiple layers: a “layer block.” The layer-block-wise pipeline has much less weight parameters for network training than conventional multithreading because weight memory is distributed to workers...

chapter

Image invariant description based on local Fourier-Mellin transform

Yassine Lehiani, Madjid Maidi, Marius Preda, Faouzi Ghorbel

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 159 - 163

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

In this paper, we present a novel approach for real-time object identification on a mobile platform. First, our system detects keypoints within a scaled pyramid-based FAST detector and then descriptors of the object of interest are computed using an Analytical Fourier-Mellin transform. The Fourier-Mellin is used in similarity studies due to its invariance property and discrimination power. In this...

Keywords:
TRAINING
IMAGE RECOGNITION

Publication date

Set your own date range

Content availability

Available (1,148)
None (6)

Publication type

book (1,080)
article (74)

Keywords

FEATURE EXTRACTION (575)
FACE RECOGNITION (246)
DATABASES (213)
FACE (190)
ACCURACY (168)
ARTIFICIAL NEURAL NETWORKS (162)
SUPPORT VECTOR MACHINES (158)
PATTERN RECOGNITION (144)
PRINCIPAL COMPONENT ANALYSIS (143)
IMAGE SEGMENTATION (138)
CLASSIFICATION ALGORITHMS (136)
TESTING (129)
IMAGE CLASSIFICATION (123)
DATA MINING (115)
VISUALIZATION (114)
SHAPE (108)
COMPUTER VISION (104)
PIXEL (98)
OBJECT RECOGNITION (92)
IMAGE COLOR ANALYSIS (87)
NEURAL NETWORKS (86)
CHARACTER RECOGNITION (84)
LEARNING (ARTIFICIAL INTELLIGENCE) (83)
CAMERAS (79)
HIDDEN MARKOV MODELS (77)
COMPUTATIONAL MODELING (76)
KERNEL (73)
ROBUSTNESS (73)
ALGORITHM DESIGN AND ANALYSIS (71)
HISTOGRAMS (71)
NEURAL NETS (71)
IMAGE PROCESSING (70)
LIGHTING (70)
MACHINE LEARNING (70)
NEURONS (69)
HUMANS (61)
VECTORS (60)
HANDWRITING RECOGNITION (59)
IMAGE REPRESENTATION (58)
TRAINING DATA (57)
SIGNAL PROCESSING (55)
IMAGE RESOLUTION (53)
MATHEMATICAL MODEL (53)
SUPPORT VECTOR MACHINE CLASSIFICATION (53)
TRANSFORMS (51)
NOISE (50)
EQUATIONS (48)
DETECTORS (46)
IMAGE RECONSTRUCTION (46)
WAVELET TRANSFORMS (45)
CORRELATION (44)
OBJECT DETECTION (44)
SIGNAL PROCESSING ALGORITHMS (44)
BIOMETRICS (ACCESS CONTROL) (42)
EIGENVALUES AND EIGENFUNCTIONS (42)
SUPPORT VECTOR MACHINE (42)
BACKPROPAGATION (41)
IMAGE EDGE DETECTION (41)
IMAGE SEQUENCES (41)
ESTIMATION (40)
TARGET RECOGNITION (39)
NEURAL NETWORK (38)
COVARIANCE MATRIX (36)
DEEP LEARNING (36)
COMPUTERS (34)
CONFERENCES (34)
VEHICLES (34)
DICTIONARIES (33)
IMAGE MOTION ANALYSIS (33)
IMAGE TEXTURE (33)
DATA MODELS (31)
IMAGE CODING (31)
REAL TIME SYSTEMS (30)
BIOLOGICAL NEURAL NETWORKS (29)
GABOR FILTERS (29)
PCA (29)
CONVOLUTION (28)
EDUCATIONAL INSTITUTIONS (28)
OPTIMIZATION (28)
THREE DIMENSIONAL DISPLAYS (28)
HIDDEN MARKOV MODEL (27)
IMAGE MATCHING (27)
PATTERN CLASSIFICATION (27)
VISUAL DATABASES (27)
COMPUTER ARCHITECTURE (26)
MEDICAL IMAGE PROCESSING (26)
TEXT RECOGNITION (26)
SEMANTICS (25)
CONVOLUTIONAL NEURAL NETWORK (24)
ENCODING (24)
GESTURE RECOGNITION (24)
LEGGED LOCOMOTION (24)
GENETIC ALGORITHMS (23)
IMAGE COLOUR ANALYSIS (23)
MATRIX DECOMPOSITION (23)
VIDEO SIGNAL PROCESSING (23)
BIOMEDICAL IMAGING (22)
DISTANCE MEASUREMENT (22)
more

Data set

ieee (1,153)
Springer (1)

INFONA - science communication portal

Advanced search

Advanced search in people

Hairstyle pattern recognition based on CNNs

Rare Chinese character recognition by Radical Extraction Network

A minimal convolutional neural network for handwritten digit recognition

Gender recognition from face images using a geometric descriptor

Deep features for breast cancer histopathological image classification

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework

Recurrent Models for Situation Recognition

A novel connectivity of deep convolutional neural networks

Lattice Long Short-Term Memory for Human Action Recognition

Learning Visual N-Grams from Web Data

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

Large-Scale Image Retrieval with Attentive Deep Local Features

DualNet: Learn Complementary Features for Image Recognition

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition

Elevator button and floor number recognition through hybrid image classification approach for navigation of service robot in buildings

A layer-block-wise pipeline for memory and bandwidth reduction in distributed deep learning

Image invariant description based on local Fourier-Mellin transform

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options