2016 23rd International Conference on Pattern Recognition (ICPR)

chapter

Sequential vs. batch machine-learning with evolutionary hyperparameter optimization for segmenting aortic dissection thrombus

Cosmin Adrian Morariu, Malte Thomas, Josef Pauli, Daniel Sebastian Dohle, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 1189 - 1194

While delineation of aortic aneurysms has been subject of research in several publications, this represents the first contribution to address segmentation of thrombus in case of aortic dissections. The segmentation process ensues in multiplanar reformated slices (MPRs). In 3D CTA data, thrombus hardly differs from surrounding tissue outside the aorta. Segmentation is further complicated by the high...

chapter

A DBN-crf for spectral-spatial classification of hyperspectral data

Ping Zhong, Zhiqiang Gong, Carola-Bibiane Schonlieb

2016 23rd International Conference on Pattern Recognition (ICPR) > 1219 - 1224

2016 23rd International Conference on Pattern Recognition (ICPR)

This work shows how to improve hyperspectral image classification through using both a deep representation and contextual information. To implement this objective, this work proposes a new Conditional Random Field (CRF) model (named DBN-CRF) with potentials defined over deep features produced by the Deep Belief Networks (DBNs). The newly formulated DBN-CRF model takes advantage of strength of the...

chapter

What does scene text tell us?

Seiichi Uchida, Yuto Shinahara

2016 23rd International Conference on Pattern Recognition (ICPR) > 4047 - 4052

2016 23rd International Conference on Pattern Recognition (ICPR)

Scene text is one of the most important information sources for our daily life because it has particular functions such as disambiguation and navigation. In contrast, ordinary document text has no such function. Consequently, it is natural to have a hypothesis that scene text and document text have different characteristics. This paper tries to prove this hypothesis by semantic analysis of texts by...

chapter

anyOCR: A sequence learning based OCR system for unlabeled historical documents

Martin Jenckel, Syed Saqib Bukhari, Andreas Dengel

2016 23rd International Conference on Pattern Recognition (ICPR) > 4035 - 4040

2016 23rd International Conference on Pattern Recognition (ICPR)

Institutes and libraries around the globe are preserving the literary heritage by digitizing historical documents. However, to make this data easily accessible the scanned documents need to be transformed into search-able text. State of the art OCR systems using Long-Short-Term-Memory networks (LSTM) have been applied successfully to recognize text in both printed and handwritten form. Besides the...

chapter

HEp-2 specimen classification via deep CNNs and pattern histogram

Hongwei Li, Hao Huang, Wei-Shi Zheng, Xiaohua Xie, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 2145 - 2149

2016 23rd International Conference on Pattern Recognition (ICPR)

Automatic classification of Human Epithelial Type-2 (HEp-2) specimen patterns is an important yet challenging problem in medical image analysis. Most prior works have primarily focused on cells images classification problem which is one of the early essential steps in the system pipeline, while less attention has been paid to the classification of whole-specimen ones. In this work, a specimen pattern...

chapter

Hybrid hypergraph construction for facial expression recognition

Yuchi Huang, Hanqing Lu

2016 23rd International Conference on Pattern Recognition (ICPR) > 4142 - 4147

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we proposed a novel framework for facial expression recognition, in which face images were taken as vertices in a hypergraph and the task of expression recognition was formulated as the problem of hypergraph based inference. A hybrid strategy was developed to construct hyperedges: we generated probabilities of facial action units by deep convolutional networks and took each action unit...

chapter

Distinguishing text/non-text natural images with Multi-Dimensional Recurrent Neural Networks

Pengyuan Lyu, Baoguang Shi, Chengquan Zhang, Xiang Bai

2016 23rd International Conference on Pattern Recognition (ICPR) > 3981 - 3986

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we focus on the text/non-text classification problem: distinguishing images that contain text from a lot of natural images. To this end, we propose a novel neural network architecture, termed Convolutional Multi-Dimensional Recurrent Neural Network (CMDRNN), which distinguishes text/non-text images by classifying local image blocks, taking both region pixels and dependencies among blocks...

chapter

Beyond verbs: Understanding actions in videos with text

Shujon Naha, Yang Wang

2016 23rd International Conference on Pattern Recognition (ICPR) > 1833 - 1838

2016 23rd International Conference on Pattern Recognition (ICPR)

We consider the problem of joint modeling of videos and their corresponding textual descriptions (e.g. sentences or phrases). Our approach consists of three components: the video representation, the textual representation, and a joint model that links videos and text. Our video representation uses the state-of-the-art deep 3D ConvNet to capture the semantic information in the video. Our textual representation...

chapter

Adaptive hierarchical classification networks

Sai Prasad Nooka, Sumanth Chennupati, Karthik Veerabhadra, Shagan Sah, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 3578 - 3583

2016 23rd International Conference on Pattern Recognition (ICPR)

Hierarchical decomposition enables increased number of classes in a classification problem. Class similarities guide the creation of a family of course to fine classifiers which solve categorical problems more effectively than a single flat classifier. High accuracies require precise configurations for each of the family of classifiers. This paper proposes a method to adaptively select the configuration...

chapter

HEp-2 specimen classification with fully convolutional network

Yuexiang Li, Linlin Shen, Xiande Zhou, Shiqi Yu

2016 23rd International Conference on Pattern Recognition (ICPR) > 96 - 100

2016 23rd International Conference on Pattern Recognition (ICPR)

Reliable automatic system for Human Epithelial-2 (HEp-2) cell image classification can facilitate the diagnosis of systemic autoimmune diseases. In this paper, an automatic pattern recognition system using fully convolutional network (FCN) was proposed to address the HEp-2 specimen classification problem. The FCN in the proposed framework was adapted from VGG-16, which was trained with ICPR 2016 dataset...

chapter

Extracting a background image by a multi-modal scene background model

Lucia Maddalena, Alfredo Petrosino

2016 23rd International Conference on Pattern Recognition (ICPR) > 143 - 148

2016 23rd International Conference on Pattern Recognition (ICPR)

In scene analysis, the availability of an initial background model that describes the scene without foreground objects is at the basis of many computer vision applications. Multi-modal models of the scene background are frequently adopted in the applications, where each mode tries to keep track of the multiple background modes observed along the sequence. In this work we specifically address the problem...

chapter

A preliminary study of CNNs for iris and periocular verification in the visible spectrum

Karan Ahuja, Rahul Islam, Ferdous A. Barbhuiya, Kuntal Dey

2016 23rd International Conference on Pattern Recognition (ICPR) > 181 - 186

2016 23rd International Conference on Pattern Recognition (ICPR)

Ocular biometrics in the visible spectrum has emerged as an area of significant research activity. In this paper, we propose two convolution-based models for verifying a pair of periocular images containing the iris, and compare the two approaches amongst each other as well as with a baseline model. In the first approach, we perform deep learning in an unsupervised manner using a stacked convolutional...

chapter

Bi-modal regression for Apparent Personality trait Recognition

Nishant Rai

2016 23rd International Conference on Pattern Recognition (ICPR) > 55 - 60

2016 23rd International Conference on Pattern Recognition (ICPR)

The task of the ChaLearn Apparent Personality Analysis: First Impressions Challenge is to rate/quantify personality traits of users in short video sequences. Although the validity of personality judgments from short interactions is questionable, studies show the possibility of predicting attributed traits (First Impressions) using facial [15] and acoustic [13] features. The challenge introduces a...

chapter

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks

Pichao Wang, Wanqing Li, Song Liu, Yuyao Zhang, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 13 - 18

2016 23rd International Conference on Pattern Recognition (ICPR)

This paper addresses the problem of continuous gesture recognition from sequences of depth maps using Convolutional Neural networks (ConvNets). The proposed method first segments individual gestures from a depth sequence based on quantity of movement (QOM). For each segmented gesture, an Improved Depth Motion Map (IDMM), which converts the depth sequence into one image, is constructed and fed to a...

chapter

Multi-script writer identification using dissimilarity

Diego Bertolini, Luiz S. Oliveira, Robert Sabourin

2016 23rd International Conference on Pattern Recognition (ICPR) > 3025 - 3030

2016 23rd International Conference on Pattern Recognition (ICPR)

Multi-script writer identification consists in identifying a person of a given text written in one script from the samples of the same person written in another script. The rationale behind this is that the writing style of an individual remains constant across different scripts. While this hypothesis may hold, recent results on a multi-script writer identification competition show that classical...

chapter

Coupled multiple dictionary learning based on edge sharpness for single-image super-resolution

Junaid Ahmed, Reinhard Klette

2016 23rd International Conference on Pattern Recognition (ICPR) > 3838 - 3843

2016 23rd International Conference on Pattern Recognition (ICPR)

In this article a new strategy for single-image super-resolution is proposed. A selective sparse coding strategy based on patch sharpness is assumed to be invariant for patch resolution. This sharpness criterion is used at training stage to classify image patches into different clusters. It is suggested that the use of coupled dictionary learning, with a mapping function can improve the representation...

chapter

BranchyNet: Fast inference via early exiting from deep neural networks

Surat Teerapittayanon, Bradley McDanel, H.T. Kung

2016 23rd International Conference on Pattern Recognition (ICPR) > 2464 - 2469

2016 23rd International Conference on Pattern Recognition (ICPR)

Deep neural networks are state of the art methods for many learning tasks due to their ability to extract increasingly better features at each network layer. However, the improved performance of additional layers in a deep network comes at the cost of added latency and energy usage in feedforward inference. As networks continue to get deeper and larger, these costs become more prohibitive for real-time...

chapter

Context-aware mathematical expression recognition: An end-to-end framework and a benchmark

Wenhao He, Yuxuan Luo, Fei Yin, Han Hu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 3246 - 3251

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper we propose a novel end-to-end framework for mathematical expression (ME) recognition. The method uses a convolutional neural network (CNN) to perform mathematical symbol detection and recognition simultaneously incorporating spatial context, and can handle multi-part and touching symbols effectively. To evaluate the performance, we provide a benchmark that contains MEs both from real-life...

chapter

On looking at faces in an automobile: Issues, algorithms and evaluation on naturalistic driving dataset

Kevan Yuen, Sujitha Martin, Mohan M. Trivedi

2016 23rd International Conference on Pattern Recognition (ICPR) > 2777 - 2782

2016 23rd International Conference on Pattern Recognition (ICPR)

Face detection is a vital step in the process of extracting semantic information about the driver's state, such as distraction and fatigue, from pixel values in images looking at the driver. Therefore, in the context of time and safety critical situation like driving, efficient use of time and reliable detection of faces is essential. While challenges like lighting and occlusion are prevalent in the...

chapter

Deep structured-output regression learning for computational color constancy

Yanlin Qian, Ke Chen, Joni-Kristian Kamarainen, Jarno Nikkanen, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 1899 - 1904

2016 23rd International Conference on Pattern Recognition (ICPR)

The color constancy problem is addressed by structured-output regression on the values of the fully-connected layers of a convolutional neural network. The AlexNet and the VGG are considered and VGG slightly outperformed AlexNet. Best results were obtained with the first fully-connected “fc₆” layer and with multi-output support vector regression. Experiments on the SFU Color Checker and Indoor Dataset...

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR)

Sequential vs. batch machine-learning with evolutionary hyperparameter optimization for segmenting aortic dissection thrombus

A DBN-crf for spectral-spatial classification of hyperspectral data

What does scene text tell us?

anyOCR: A sequence learning based OCR system for unlabeled historical documents

HEp-2 specimen classification via deep CNNs and pattern histogram

Hybrid hypergraph construction for facial expression recognition

Distinguishing text/non-text natural images with Multi-Dimensional Recurrent Neural Networks

Beyond verbs: Understanding actions in videos with text

Adaptive hierarchical classification networks

HEp-2 specimen classification with fully convolutional network

Extracting a background image by a multi-modal scene background model

A preliminary study of CNNs for iris and periocular verification in the visible spectrum

Bi-modal regression for Apparent Personality trait Recognition

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks

Multi-script writer identification using dissimilarity

Coupled multiple dictionary learning based on edge sharpness for single-image super-resolution

BranchyNet: Fast inference via early exiting from deep neural networks

Context-aware mathematical expression recognition: An end-to-end framework and a benchmark

On looking at faces in an automobile: Issues, algorithms and evaluation on naturalistic driving dataset

Deep structured-output regression learning for computational color constancy

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 23rd International Conference on Pattern Recognition (ICPR)