Search results

chapter

[POSTER] A Probabilistic Combination of CNN and RNN Estimates for Hand Gesture Based Interaction in Car

Aditya Tewari, Bertram Taetz, Frederic Grandidier, Didier Stricker

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 1 - 6

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...

chapter

Lip-reading via a DNN-HMM hybrid system using combination of the image-based and model-based features

Mohammad Hasan Rahmani, Farshad Almasganj

2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA) > 195 - 199

2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Introducing features that better represent the visual information of speakers during the speech production is still an open issue that highly affects the quality of the lip-reading and Audio Visual Speech Recognition (AVSR) tasks. In this paper, three different types of visual features from both the image-based and model-based ones are investigated inside a professional lip reading task. The simple...

chapter

Human Activity Recognition using depth body part histograms and Hidden Markov Models

Md. Zia Uddin, Jim Torresen, Taskeed Jabid

2016 International Conference on Innovations in Science, Engineering and Technology (ICISET) > 1 - 4

2016 International Conference on Innovations in Science, Engineering and Technology (ICISET)

This paper proposes a novel approach for human activity recognition based on body part histograms and Hidden Markov Models. From a depth video frame, body parts are segmented first using a trained random forest. Then, a histogram for each body part is combined to represent histogram features for a depth image. The depth video activity features are then applied on hidden Markov models for training...

chapter

DNN-HMM for Large Vocabulary Mongolian Offline Handwriting Recognition

Fan Daoerji, Gao Guanglai

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 72 - 77

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

In this paper, we propose a large vocabulary Mongolian offline handwriting recognition system, using hidden Markov models (HMMs)-deep neural networks (DNN) hybrid architectures which shows superior performance on auto speech recognize (ASR) tasks. We select 50 sub-characters from all shape of Mongolian letters as the smallest modeling unit. First, a set of intensity features are extracted from each...

chapter

Class-Based Contextual Modeling for Handwritten Arabic Text Recognition

Irfan Ahmad, Gernot A. Fink

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 554 - 559

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

In this paper we will present our investigations related to contextual modeling for HMM-based handwritten Arabic text recognition. We will, first, discuss the justifications and the need for contextual modeling for handwritten Arabic text recognition. Next, we will discuss the issues related to contextual modeling for Arabic text recognition. Finally, we will present our novel class-based contextual...

chapter

Frequency count based two stage classification for online handwritten character recognition

Subhasis Mandal, Himakshi Choudhury, S. R. Mahadeva Prasanna, Suresh Sundaram

2016 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2016 International Conference on Signal Processing and Communications (SPCOM)

A frequency count based two stage classification approach is proposed by combining generative and discriminative modeling principles for online handwritten character recognition. The first stage classifier based on Hidden Markov Model (HMM) returns top-K ranking characters out of the total N classes. In the second stage, pairwise classifiers for K(K − 1)/2 unique combinations of top-K characters using...

chapter

Deep Learning of Mouth Shapes for Sign Language

Oscar Koller, Hermann Ney, Richard Bowden

2015 IEEE International Conference on Computer Vision Workshop (ICCVW) > 477 - 483

2015 IEEE International Conference on Computer Vision Workshop (ICCVW)

This paper deals with robust modelling of mouth shapes in the context of sign language recognition using deep convolutional neural networks. Sign language mouth shapes are difficult to annotate and thus hardly any publicly available annotations exist. As such, this work exploits related information sources as weak supervision. Humans mainly look at the face during sign language communication, where...

chapter

A non-Gaussian approach for biosignal classification based on the Johnson SU translation system

Hideaki Hayashi, Yuichi Kurita, Toshio Tsuji

2015 IEEE 8th International Workshop on Computational Intelligence and Applications (IWCIA) > 115 - 120

2015 IEEE 8th International Workshop on Computational Intelligence and Applications (IWCIA)

This paper proposes a non-Gaussian approach for biosignal classification based on the Johnson SU translation system. The Johnson system is a normalizing translation that transforms data without normality to normal distribution using four parameters, thereby enabling the representation of a wide range of shapes for marginal distribution with skewness and kurtosis. In this study, a discriminative model...

chapter

Fisher's discriminant and relevant component analysis for static facial expression classification

M. Sorci, G. Antonini, Jean-Philippe Thiran

2007 15th European Signal Processing Conference > 115 - 119

2007 15th European Signal Processing Conference

This paper addresses the issue of automatic classification of the six universal emotional categories (joy, surprise, fear, anger, disgust, sadness) in the case of static images. Appearance parameters are extracted by an active appearance model(AAM) representing the input for the classification step. We show how Relevant Component Analysis (RCA) in combination with Fisher's Linear Discriminant (FLD)...

chapter

Shape and Motion Features Approach for Activity Tracking and Recognition from Kinect Video Camera

Ahmad Jalal, Shaharyar Kamal, Daijin Kim

2015 IEEE 29th International Conference on Advanced Information Networking and Applications Workshops > 445 - 450

2015 IEEE 29th International Conference on Advanced Information Networking and Applications Workshops (WAINA)

Recent development in depth sensors opens up new challenging task in the field of computer vision research areas, including human-computer interaction, computer games and surveillance systems. This paper addresses shape and motion features approach to observe, track and recognize human silhouettes using a sequence of RGB-D images. Under our proposed activity recognition framework, the required procedure...

chapter

Curvature point based HMM state prediction for online handwritten assamese strokes recognition

Subhasis Mandal, S. R. Mahadeva Prasanna, Suresh Sundaram

2015 Twenty First National Conference on Communications (NCC) > 1 - 6

2015 Twenty First National Conference on Communications (NCC)

Hidden Markov Models (HMM) are used in handwritten strokes recognition task. The two design parameters of HMM are the number of states and number of mixtures in each state. There are two approaches for finding the number of states, namely, equal number of states and variable number of states. Since the shape of strokes will be different, variable number of states approach should be beneficial. This...

chapter

Improvement of Context Dependent Modeling for Arabic Handwriting Recognition

Mahdi Hamdani, Patrick Doetsch, Hermann Ney

2014 14th International Conference on Frontiers in Handwriting Recognition > 494 - 499

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

This paper proposes the improvement of context dependent modeling for Arabic handwriting recognition. Since the number of parameters in context dependent models is huge, CART trees are used for state tying. This work is based on a new set of questions for the CART tree construction based on a "lossy mapping" categorization of the Arabic shapes. The used system is a combination of Hidden...

chapter

Improvements in Sub-character HMM Model Based Arabic Text Recognition

Irfan Ahmad, Gernot A. Fink, Sabri A. Mahmoud

2014 14th International Conference on Frontiers in Handwriting Recognition > 537 - 542

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

Sub-character HMM models for Arabic text recognition allow sharing of common patterns between different position-dependent shape forms of an Arabic character as well as between different characters. The number of HMMs gets reduced considerably while still capturing the variations in shape patterns. This results in a compact, efficient, and robust recognizer with reduced model set. In the current paper...

chapter

Word Spotting in Handwritten Text Using Contour-Based Models

Angelos P. Giotis, Demetrios P. Gerogiannis, Christophoros Nikou

2014 14th International Conference on Frontiers in Handwriting Recognition > 399 - 404

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

In this paper, we propose a method for spotting keywords in images of handwritten text. Relying on an object detection system in real images, local contour features are extracted from segmented word images in order to obtain a representative shape of a word-class. Thus, word spotting is cast following a query-by-word-class scenario where class models are generated using a random subset of the images...

chapter

Segmenting Handwritten Math Symbols Using AdaBoost and Multi-scale Shape Context Features

Lei Hu, Richard Zanibbi

2013 12th International Conference on Document Analysis and Recognition > 1180 - 1184

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

This paper presents a new symbol segmentation method based on AdaBoost with confidence weighted predictions for online handwritten mathematical expressions. The handwritten mathematical expression is preprocessed and rendered to an image. Then for each stroke, we compute three kinds of shape context features (stroke pair, local neighborhood and global shape contexts) with different scales, 21 stroke...

chapter

A Novel Baseline-independent Feature Set for Arabic Handwriting Recognition

Bing Su, Xiaoqing Ding, Liangrui Peng, Changsong Liu

2013 12th International Conference on Document Analysis and Recognition > 1250 - 1254

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

HMM-based analytical methods have been widely used for Arabic handwriting recognition. A key factor influencing the performance of HMM-based systems is the features extracted from a sliding window. In this paper, we propose a novel baseline-independent feature set extracted from a wider sliding window to directly capture the contextual information. This feature set is a combination of center of mass...

chapter

A simple and effective pitch re-estimation method for rich prosody and speaking styles in HMM-based speech synthesis

Cheng-Yuan Lin, Chien-Hung Huang, Chih-Chung Kuo

2012 8th International Symposium on Chinese Spoken Language Processing > 286 - 290

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper proposes a novel way of controllable pitch re-estimation that can produce better pitch contour or provide diverse speaking styles for text-to-speech (TTS) systems. The method is composed of a pitch re-estimation model and a set of control parameters. The pitch re-estimation model is employed to reduce over-smoothing effects which is usually introduced by TTS training. The control parameters...

chapter

On-line learning of temporal state models for flexible objects

N. Bergstrom, C. H. Ek, D. Kragic, Y. Yamakawa, more

2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012) > 712 - 718

2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012)

State estimation and control are intimately related processes in robot handling of flexible and articulated objects. While for rigid objects, we can generate a CAD model before-hand and a state estimation boils down to estimation of pose or velocity of the object, in case of flexible and articulated objects, such as a cloth, the representation of the object's state is heavily dependent on the task...

chapter

Combining local and non-local information with dual decomposition for named entity recognition from text

Hai Leong Chieu, Loo-Nin Teow

2012 15th International Conference on Information Fusion > 231 - 238

2012 15th International Conference on Information Fusion (FUSION)

Named entity recognition (NER) is the task of segmenting and classifying occurrences of names in text. In NER, local contextual cues provide important evidence, but non-local information from the whole document could also prove useful: for example, it is useful to know that “Mary Kay Inc.” has been mentioned in a document to classify subsequent mentions of “Mary Kay” as an organization and not as...

chapter

A discriminative prototype selection approach for graph embedding in human action recognition

Ehsan Zare Borzeshi, Massimo Piccardi, Richard Yi Da Xu

2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops) > 1295 - 1301

2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops)

This paper proposes a novel graph-based method for representing a human's shape during the performance of an action. Despite their strong representational power, graphs are computationally cumbersome for pattern analysis. One way of circumventing this problem is that of transforming the graphs into a vector space by means of graph embedding. Such an embedding can be conveniently obtained by way of...

INFONA - science communication portal

Search results

[POSTER] A Probabilistic Combination of CNN and RNN Estimates for Hand Gesture Based Interaction in Car

Lip-reading via a DNN-HMM hybrid system using combination of the image-based and model-based features

Human Activity Recognition using depth body part histograms and Hidden Markov Models

DNN-HMM for Large Vocabulary Mongolian Offline Handwriting Recognition

Class-Based Contextual Modeling for Handwritten Arabic Text Recognition

Frequency count based two stage classification for online handwritten character recognition

Deep Learning of Mouth Shapes for Sign Language

A non-Gaussian approach for biosignal classification based on the Johnson SU translation system

Fisher's discriminant and relevant component analysis for static facial expression classification

Shape and Motion Features Approach for Activity Tracking and Recognition from Kinect Video Camera

Curvature point based HMM state prediction for online handwritten assamese strokes recognition

Improvement of Context Dependent Modeling for Arabic Handwriting Recognition

Improvements in Sub-character HMM Model Based Arabic Text Recognition

Word Spotting in Handwritten Text Using Contour-Based Models

Segmenting Handwritten Math Symbols Using AdaBoost and Multi-scale Shape Context Features

A Novel Baseline-independent Feature Set for Arabic Handwriting Recognition

A simple and effective pitch re-estimation method for rich prosody and speaking styles in HMM-based speech synthesis

On-line learning of temporal state models for flexible objects

Combining local and non-local information with dual decomposition for named entity recognition from text

A discriminative prototype selection approach for graph embedding in human action recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options