Search results

Items from 21 to 40 out of 1,080 results

chapter

Deep features for breast cancer histopathological image classification

Fabio A. Spanhol, Luiz S. Oliveira, Paulo R. Cavalin, Caroline Petitjean, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1868 - 1873

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Breast cancer (BC) is a deadly disease, killing millions of people every year. Developing automated malignant BC detection system applied on patient's imagery can help dealing with this problem more efficiently, making diagnosis more scalable and less prone to errors. Not less importantly, such kind of research can be extended to other types of cancer, making even more impact to help saving lives...

chapter

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Lili Pan, Samira Pouyanfar, Hao Chen, Jiaohua Qin, more

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 181 - 189

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...

chapter

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem

2017 IEEE International Conference on Computer Vision (ICCV) > 4223 - 4232

2017 IEEE International Conference on Computer Vision (ICCV)

An important goal of computer vision is to build systems that learn visual representations over time that can be applied to many tasks. In this paper, we investigate a vision-language embedding as a core representation and show that it leads to better cross-task transfer than standard multitask learning. In particular, the task of visual recognition is aligned to the task of visual question answering...

chapter

Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework

Michal Busta, Lukas Neumann, Jiri Matas

2017 IEEE International Conference on Computer Vision (ICCV) > 2223 - 2231

2017 IEEE International Conference on Computer Vision (ICCV)

A method for scene text localization and recognition is proposed. The novelties include: training of both text detection and recognition in a single end-to-end pass, the structure of the recognition CNN and the geometry of its input layer that preserves the aspect of the text and adapts its resolution to the data.,,The proposed method achieves state-of-the-art accuracy in the end-to-end text recognition...

chapter

Recurrent Models for Situation Recognition

Arun Mallya, Svetlana Lazebnik

2017 IEEE International Conference on Computer Vision (ICCV) > 455 - 463

2017 IEEE International Conference on Computer Vision (ICCV)

This work proposes Recurrent Neural Network (RNN) models to predict structured ‘image situations’ – actions and noun entities fulfilling semantic roles related to the action. In contrast to prior work relying on Conditional Random Fields (CRFs), we use a specialized action prediction network followed by an RNN for noun prediction. Our system obtains state-of-the-art accuracy on the challenging recent...

chapter

A novel connectivity of deep convolutional neural networks

Zhixi Shen, Yong Liu

2017 Chinese Automation Congress (CAC) > 7779 - 7783

2017 Chinese Automation Congress (CAC)

Residual network(ResNet) is an effective instance and a significant extension of deep convolutional neural network. ResNet utilizes skip-connection between input layers and output layers to solve the vanishing gradient problem. Due to the powerfulness of skip-connection, the gradient can flow directly through the identity function from later layers to the earlier layers. However, skip-connection makes...

chapter

Lattice Long Short-Term Memory for Human Action Recognition

Lin Sun, Kui Jia, Kevin Chen, Dit Yan Yeung, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2166 - 2175

2017 IEEE International Conference on Computer Vision (ICCV)

Human actions captured in video sequences are threedimensional signals characterizing visual appearance and motion dynamics. To learn action patterns, existing methods adopt Convolutional and/or Recurrent Neural Networks (CNNs and RNNs). CNN based methods are effective in learning spatial appearances, but are limited in modeling long-term motion dynamics. RNNs, especially Long Short- Term Memory (LSTM),...

chapter

Learning Visual N-Grams from Web Data

Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten

2017 IEEE International Conference on Computer Vision (ICCV) > 4193 - 4202

2017 IEEE International Conference on Computer Vision (ICCV)

Real-world image recognition systems need to recognize tens of thousands of classes that constitute a plethora of visual concepts. The traditional approach of annotating thousands of images per class for training is infeasible in such a scenario, prompting the use of webly supervised data. This paper explores the training of image-recognition systems on large numbers of images and associated user...

chapter

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

Timnit Gebru, Judy Hoffman, Li Fei-Fei

2017 IEEE International Conference on Computer Vision (ICCV) > 1358 - 1367

2017 IEEE International Conference on Computer Vision (ICCV)

While fine-grained object recognition is an important problem in computer vision, current models are unlikely to accurately classify objects in the wild. These fully supervised models need additional annotated images to classify objects in every new scenario, a task that is infeasible. However, sources such as e-commerce websites and field guides provide annotated images for many classes. In this...

chapter

Large-Scale Image Retrieval with Attentive Deep Local Features

Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3476 - 3485

2017 IEEE International Conference on Computer Vision (ICCV)

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for key point selection,...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid

2017 IEEE International Conference on Computer Vision (ICCV) > 589 - 598

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing how objects interact with each other is a crucial task in visual recognition. If we define the context of the interaction to be the objects involved, then most current methods can be categorized as either: (i) training a single classifier on the combination of the interaction and its context; or (ii) aiming to recognize the interaction independently of its explicit context. Both methods...

chapter

Elevator button and floor number recognition through hybrid image classification approach for navigation of service robot in buildings

Kh Tohidul Islam, Ghulam Mujtaba, Ram Gopal Raj, Henry Friday Nweke

2017 International Conference on Engineering Technology and Technopreneurship (ICE2T) > 1 - 4

2017 International Conference on Engineering Technology and Technopreneurship (ICE2T)

To successfully move a robot into the building, the elevator button and elevator floor number detection and recognition can play an important role. It can help a robot move in the building, just as it also can help a visually impaired person who wants to move another floor in the building. Due to vision-based approach, the difference in lighting condition and the complex background are the main obstacles...

chapter

A layer-block-wise pipeline for memory and bandwidth reduction in distributed deep learning

Haruki Mori, Tetsuya Youkawa, Shintaro Izumi, Masahiko Yoshimoto, more

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper describes a pipelined stochastic gradient descent (SGD) algorithm and its hardware architecture with a memory distributed structure. In the proposed architecture, a pipeline stage takes charge of multiple layers: a “layer block.” The layer-block-wise pipeline has much less weight parameters for network training than conventional multithreading because weight memory is distributed to workers...

chapter

Image invariant description based on local Fourier-Mellin transform

Yassine Lehiani, Madjid Maidi, Marius Preda, Faouzi Ghorbel

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA) > 159 - 163

2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)

In this paper, we present a novel approach for real-time object identification on a mobile platform. First, our system detects keypoints within a scaled pyramid-based FAST detector and then descriptors of the object of interest are computed using an Analytical Fourier-Mellin transform. The Fourier-Mellin is used in similarity studies due to its invariance property and discrimination power. In this...

chapter

Recognition of Indian license plate number from live stream videos

B. Sachin Prabhu, Subramaniam Kalambur, Dinkar Sitaram

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 2359 - 2365

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Automatic License Plate Recognition (ALPR) has been employed in many developed countries for traffic management, automatic speed control, tracking stolen cars and also in automatic toll systems for improving the traffic control. ALPR is a surveillance system that extracts the information from the vehicle license plate by capturing the images. Human intervention to recognize the license plates results...

chapter

Holistic recognition of low quality license plates by CNN using track annotated data

Jakub Spanhel, Jakub Sochor, Roman Juranek, Adam Herout, more

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

This work is focused on recognition of license plates in low resolution and low quality images. We present a methodology for collection of real world (non-synthetic) dataset of low quality license plate images with ground truth transcriptions. Our approach to the license plate recognition is based on a Convolutional Neural Network which holistically processes the whole image, avoiding segmentation...

chapter

Optical character recognition using KNN on custom image dataset

Tapan Kumar Hazra, Dhirendra Pratap Singh, Nikunj Daga

2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON) > 110 - 114

2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON)

The aim is to develop an efficient method which uses a custom image to train the classifier. This OCR extract distinct features from the input image for classifying its contents as characters specifically letters and digits. Input to the system is digital images containing the patterns to be classified. The analysis and recognition of the patterns in images are becoming more complex, yet easy with...

chapter

Study of image-based expression recognition techniques on three recent spontaneous databases

Hayfaa Hussein, Mohsen Naqvi, Jonathon Chambers

2017 22nd International Conference on Digital Signal Processing (DSP) > 1 - 5

2017 22nd International Conference on Digital Signal Processing (DSP)

Recent work in the recognition of naturalistic expressions, which is also known as spontaneous facial expressions recognition, has attracted researchers' attention due to its importance in different behavioural and clinical applications. The main design challenges in the area of emotion computing for automatic recognition of spontaneous facial expression are the face pose, capture distance, illumination...

chapter

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Jose Lezama, Qiang Qiu, Guillermo Sapiro

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6807 - 6816

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Surveillance cameras today often capture NIR (near infrared) images in low-light environments. However, most face datasets accessible for training and verification are only collected in the VIS (visible light) spectrum. It remains a challenging problem to match NIR to VIS face images due to the different light spectrum. Recently, breakthroughs have been made for VIS face recognition by applying deep...

Keywords:
TRAINING
IMAGE RECOGNITION

Publication date

Set your own date range

Content availability

Available (1,074)
None (6)

Keywords

FEATURE EXTRACTION (541)
FACE RECOGNITION (231)
DATABASES (209)
FACE (182)
ACCURACY (162)
ARTIFICIAL NEURAL NETWORKS (161)
SUPPORT VECTOR MACHINES (150)
PATTERN RECOGNITION (141)
PRINCIPAL COMPONENT ANALYSIS (137)
CLASSIFICATION ALGORITHMS (134)
IMAGE SEGMENTATION (133)
TESTING (124)
IMAGE CLASSIFICATION (116)
DATA MINING (115)
SHAPE (104)
COMPUTER VISION (99)
VISUALIZATION (98)
PIXEL (95)
OBJECT RECOGNITION (86)
CHARACTER RECOGNITION (84)
LEARNING (ARTIFICIAL INTELLIGENCE) (82)
IMAGE COLOR ANALYSIS (80)
NEURAL NETWORKS (79)
CAMERAS (75)
HIDDEN MARKOV MODELS (75)
NEURAL NETS (71)
COMPUTATIONAL MODELING (70)
IMAGE PROCESSING (70)
ALGORITHM DESIGN AND ANALYSIS (68)
MACHINE LEARNING (68)
ROBUSTNESS (68)
NEURONS (67)
HISTOGRAMS (65)
KERNEL (65)
LIGHTING (65)
HUMANS (60)
HANDWRITING RECOGNITION (59)
SIGNAL PROCESSING (55)
VECTORS (54)
IMAGE REPRESENTATION (53)
SUPPORT VECTOR MACHINE CLASSIFICATION (53)
IMAGE RESOLUTION (52)
TRAINING DATA (52)
TRANSFORMS (51)
MATHEMATICAL MODEL (50)
NOISE (50)
EQUATIONS (48)
WAVELET TRANSFORMS (44)
IMAGE RECONSTRUCTION (43)
SIGNAL PROCESSING ALGORITHMS (43)
SUPPORT VECTOR MACHINE (42)
BACKPROPAGATION (41)
BIOMETRICS (ACCESS CONTROL) (41)
CORRELATION (41)
DETECTORS (41)
EIGENVALUES AND EIGENFUNCTIONS (41)
IMAGE EDGE DETECTION (41)
OBJECT DETECTION (41)
ESTIMATION (39)
IMAGE SEQUENCES (39)
NEURAL NETWORK (37)
COVARIANCE MATRIX (35)
COMPUTERS (34)
CONFERENCES (34)
TARGET RECOGNITION (33)
IMAGE MOTION ANALYSIS (32)
VEHICLES (32)
DEEP LEARNING (31)
IMAGE TEXTURE (31)
IMAGE CODING (30)
REAL TIME SYSTEMS (30)
BIOLOGICAL NEURAL NETWORKS (29)
DATA MODELS (29)
PCA (29)
EDUCATIONAL INSTITUTIONS (28)
CONVOLUTION (27)
DICTIONARIES (27)
GABOR FILTERS (27)
PATTERN CLASSIFICATION (27)
HIDDEN MARKOV MODEL (26)
TEXT RECOGNITION (26)
THREE DIMENSIONAL DISPLAYS (26)
VISUAL DATABASES (26)
IMAGE MATCHING (25)
MEDICAL IMAGE PROCESSING (25)
OPTIMIZATION (25)
COMPUTER ARCHITECTURE (24)
ENCODING (24)
GENETIC ALGORITHMS (23)
GESTURE RECOGNITION (23)
LEGGED LOCOMOTION (23)
VIDEO SIGNAL PROCESSING (23)
BIOMEDICAL IMAGING (22)
DISTANCE MEASUREMENT (22)
ELECTRONIC MAIL (22)
IMAGE COLOUR ANALYSIS (22)
MATRIX DECOMPOSITION (22)
SVM (22)
more

INFONA - science communication portal

Search results

Deep features for breast cancer histopathological image classification

DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks

Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework

Recurrent Models for Situation Recognition

A novel connectivity of deep convolutional neural networks

Lattice Long Short-Term Memory for Human Action Recognition

Learning Visual N-Grams from Web Data

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

Large-Scale Image Retrieval with Attentive Deep Local Features

DualNet: Learn Complementary Features for Image Recognition

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

Elevator button and floor number recognition through hybrid image classification approach for navigation of service robot in buildings

A layer-block-wise pipeline for memory and bandwidth reduction in distributed deep learning

Image invariant description based on local Fourier-Mellin transform

Recognition of Indian license plate number from live stream videos

Holistic recognition of low quality license plates by CNN using track annotated data

Optical character recognition using KNN on custom image dataset

Study of image-based expression recognition techniques on three recent spontaneous databases

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options