Search results

chapter

Research on multi-base depth neural network speech recognition

Cai Jun, Li Fei, Zhang Yi, Liu Yu

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1540 - 1544

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In speech recognition system, an improved multi-base neural network speech recognition model is proposed to solve the problem of long learning time and slow convergence rate of deep neural network. However, the improved model introduces a large number of parameters in the training process to make the model over-fitted in the test set, resulting in the deterioration of generalization ability and the...

chapter

Face recognition system using HMM-PSO for feature selection

Mai Mohamed Mahmoud Farag, Tarek Elghazaly, Hesham Ahmed Hefny

2016 12th International Computer Engineering Conference (ICENCO) > 105 - 110

2016 12th International Computer Engineering Conference (ICENCO)

In this paper we apply particle swarm optimization (PSO) feature selection to enhance Hidden Markov Model (HMM) states and parameters for face recognition systems. Ideal Feature selection for face images based on the idea of collaborative behavior of bird flocking to reduce the feature size and hence recognition time complicity. The framework has been inspected on 400 face pictures of the Olivetti...

chapter

Automatic speech recognition models: A characteristic and performance review

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing Communication Control and automation (ICCUBEA) > 1 - 7

2016 International Conference on Computing Communication Control and automation (ICCUBEA)

This paper presents a review on few notable speech recognition models that are reported in the last decade. Firstly, the models are categorized into sparse models, learning models and domain - specific models. Subsequently, the characteristics of the models have been observed using speech constraints, algorithmic constraints and performance constraints. The performance of these models reported in...

chapter

Automatic speech annotation based on enhanced wavelet Packets Best Tree Encoding (EWPBTE) feature

Mohamed Hassan Mohamed, Ashraf Mohamed Ali Hassan, N.M. Hussein Hassan

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 2611 - 2616

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper aimed at introducing a completely automated Arabic phone recognition system based on Enhanced Wavelet Packets Best Tree Encoding (EWPBTE) 15-point speech feature. The process of enhancing of WPBTE is provided by adding energy component to WPBTE, which is implemented in Matlab software and makes an enhancement of 65 % to recognizer accuracy which is the most contribution in this paper. EWPBTE...

chapter

Experimental study in emotion recognition using prosodie features

Ioan Pavaloi, Elena Musca

2015 E-Health and Bioengineering Conference (EHB) > 1 - 4

2015 E-Health and Bioengineering Conference (EHB)

The paper describes an experimental study on emotion recognition using a collection of emotional recordings from SRoL corpus. Its goal is to study and to obtain a simple tool that can be used in recordings validation in the process of building large voice corpora. The tools can help or even replace the human validation. In this study we used two classifiers, k-NN (k — Nearest Neighborhood) and SVM...

chapter

Spatial information in classification of activity videos

Shreeya Sengupta, Hui Wang, William Blackburn, Piyush Ojha

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 145 - 153

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

Spatial information describes the relative spatial position of an object in a video. Such information may aid several video analysis tasks such as object, scene, event and activity recognition. This paper studies the effect of spatial information on video activity recognition. The paper firstly performs activity recognition on KTH and Weizmann videos using Hidden Markov Model and k-Nearest Neighbour...

chapter

Kinect based people identification system using fusion of clustering and classification

Aniruddha Sinha, Diptesh Das, Kingshuk Chakravarty, Amit Konar, more

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 3 > 171 - 179

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

The demand of human identification in a non-intrusive manner has risen increasingly in recent years. Several works have already been done in this context using gait-cycle detection from human skeleton data using Microsoft Kinect as a data capture sensor. In this paper we have proposed a novel method for automatic human identification in real time using the fusion of both supervised and unsupervised...

chapter

Profiling and identifying users' activities with network traffic analysis

Ma Tao, Ye Chun Ming, Chen Juan

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 503 - 506

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Traffic identification technique is used for classification of different network protocols and applications even with detection of users' network activities. In this paper, we conduct our study on some typical users' network activities and present a traffic identification method to describe the feature about users' behaviors. We convert users' network activities information into different sequences...

chapter

On the use of EMD for automatic newborn cry segmentation

Lina Abou-Abbas, Leila Montazeri, Christian Gargour, Chakib Tadj

2015 International Conference on Advances in Biomedical Engineering (ICABME) > 262 - 265

2015 International Conference on Advances in Biomedical Engineering (ICABME)

Cry segmentation is an essential preprocessing step in any infant crying diagnosis system. Besides crying sounds consisting of expiration phases followed by short periods of inspiration episodes, each recording of newborn cries also includes silence sections as well as other sounds such as speech of caregivers, noise and sound of medical equipments. This paper is devoted to a newly developed Empirical...

chapter

Automatic Language Identification for Romance Languages Using Stop Words and Diacritics

Ciprian-Octavian Truica, Julien Velcin, Alexandru Boicea

2015 17th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC) > 243 - 246

2015 17th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC)

Automatic language identification is a natural language processing problem that tries to determine the natural language of a given content. In this paper we present a statistical method for automatic language identification of written text using dictionaries containing stop words and diacritics. We propose different approaches that combine the two dictionaries to accurately determine the language...

chapter

Human action recognition using an improved string edit distance

Pasquale Foggia, Benoit Gauzere, Alessia Saggese, Mario Vento

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

In this paper we propose an improvement of a human action recognition method that uses a string-based representation and a string edit distance to compare the observed action with reference actions in the training set. In particular, the original improvement is based on a specific formulation of the string edit distance that is more suited to take into account the problems related to noise and to...

chapter

An Anomaly Detection System Based on Ensemble of Detectors with Effective Pruning Techniques

Amirreza Soudi, Wael Khreich, Abdelwahab Hamou-Lhadj

2015 IEEE International Conference on Software Quality, Reliability and Security > 109 - 118

2015 IEEE International Conference on Software Quality, Reliability and Security (QRS)

Anomaly detection systems rely on machine learning techniques to model the normal behavior of the system. This model is used during operation to detect anomalies due to attacks or design faults. Ensemble methods have been used to improve the overall detection accuracy by combining the outputs of several accurate and diverse models. Existing Boolean combination techniques either require an exponential...

chapter

O-MAP: A per-component online anomaly predicting method for Cloud infrastructure

Bin Hong, Fuyang Peng, Bo Deng, Yuchao Zhang

2015 IEEE International Conference on Information and Automation > 3026 - 3031

2015 IEEE International Conference on Information and Automation (ICIA)

Virtualized cloud systems are prone to performance anomalies due to various reasons such as resource contentions, software bugs, and hardware failures. It will be a daunting task for system administrators to manually keep track of the execution status of a large number of virtual machines all the time. Anomaly prediction is an effective approach to enhancing availability and reliability of Cloud infrastructures...

chapter

Rapid recognition of dynamic hand gestures using leap motion

Yanmei Chen, Zeyu Ding, Yen-Lun Chen, Xinyu Wu

2015 IEEE International Conference on Information and Automation > 1419 - 1424

2015 IEEE International Conference on Information and Automation (ICIA)

Human Computer Interaction would be much more smooth with the implementation of rapid recognition, the aim of which is to recognize the hand gesture before it is completed. In this paper, a rapid recognition for dynamic hand gestures using leap motion is proposed. The database contains the three-dimensional motion trajectory of the numbers and the alphabet (36 gestures in total) which captured by...

chapter

A human motion prediction algorithm for Non-binding Lower Extremity Exoskeleton

Min Wang, Xinyu Wu, Duxin Liu, Can Wang, more

2015 IEEE International Conference on Information and Automation > 369 - 374

2015 IEEE International Conference on Information and Automation (ICIA)

This paper introduces a novel approach to predict human motion for the Non-binding Lower Extremity Exoskeleton (NBLEX). Most of the exoskeletons must be attached to the pilot, which exists potential security problems. In order to solve these problems, the NBLEX is studied and designed to free pilots from the exoskeletons. Rather than applying Electromyography (EMG) and Ground Reaction Force (GFR)...

chapter

A real-time dynamic gesture recognition based on 3D trajectories in distinguishing similar gestures

Zeyu Ding, Zexiong Zhang, Yanmei Chen, Yen-Lun Chen, more

2015 IEEE International Conference on Information and Automation > 250 - 255

2015 IEEE International Conference on Information and Automation (ICIA)

There are many shape-similar gestures which cause errors in the process of hand gesture recognition. In this paper, a new method which can distinguish the similar gestures was proposed. The information of motion trajectory is captured by a leap motion in three-dimension space, and the orientation characteristics are quantified and coded as the feature. Then the Hidden Markov Model (HMM) algorithm...

chapter

A hybrid Parts Of Speech tagger for Malayalam language

Anisha Aziz T, Sunitha C

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1502 - 1507

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Parts of speech tagging is an important research topic in Natural Language Processing research are. Since it is one among the first steps of any natural language processing (NLP) techniques such as machine translation, if any error happens for tagging the same will repeat in the whole NLP process. So far works had been done on POS tagging based on SVM, MBLP, HMM, Ngram. All of these methods were not...

chapter

A new system for Chinese sign language recognition

Jihai Zhang, Wengang Zhou, Houqiang Li

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 534 - 538

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In this paper, we propose a new system for isolated sign language recognition (SLR) and continuous SLR. In isolated SLR, Histogram of Oriented Displacement is used to describe the trajectories, and multi-SVM is adopted for classification. In continuous SLR, we propose a Dynamic Programming method with warping templates obtained by Dynamic Time Warping (DTW) algorithm. We evaluate our approach with...

chapter

Reducing morpho-phonetic confusion in sub-word based Uyghur ASR

Mijit Ablimit, Askar Hamdulla, Akbar Pattar

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 348 - 352

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Sub-word units like morphemes are selected as the lexicon for highly inflectional languages, as they can provide better coverage and a smaller vocabulary size. However, short units shrink the context of statistical models, prone to morpho-phonetic changes, and not always outperform the word based model. When sequence of units are merged or split, unit boundaries are phonetically harmonized in the...

chapter

On statistical machine translation method for lexicon refinement in speech recognition

Haihua Xu, Xiong Xiao, Eng-Siong Chng, Haizhou Li

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 25 - 29

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In low resource Automatic Speech Recognition (ASR), one usually resorts to the Statistical Machine Translation (SMT) technique to learn transform rules to refine grapheme lexicon. To do this, we face two challenges. One is to generate grapheme sequences from the training data as the targets, which is paired with the original transcripts to train SMT models; the other is to effectively prune the learned...

INFONA - science communication portal

Search results

Research on multi-base depth neural network speech recognition

Face recognition system using HMM-PSO for feature selection

Automatic speech recognition models: A characteristic and performance review

Automatic speech annotation based on enhanced wavelet Packets Best Tree Encoding (EWPBTE) feature

Experimental study in emotion recognition using prosodie features

Spatial information in classification of activity videos

Kinect based people identification system using fusion of clustering and classification

Profiling and identifying users' activities with network traffic analysis

On the use of EMD for automatic newborn cry segmentation

Automatic Language Identification for Romance Languages Using Stop Words and Diacritics

Human action recognition using an improved string edit distance

An Anomaly Detection System Based on Ensemble of Detectors with Effective Pruning Techniques

O-MAP: A per-component online anomaly predicting method for Cloud infrastructure

Rapid recognition of dynamic hand gestures using leap motion

A human motion prediction algorithm for Non-binding Lower Extremity Exoskeleton

A real-time dynamic gesture recognition based on 3D trajectories in distinguishing similar gestures

A hybrid Parts Of Speech tagger for Malayalam language

A new system for Chinese sign language recognition

Reducing morpho-phonetic confusion in sub-word based Uyghur ASR

On statistical machine translation method for lexicon refinement in speech recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options