Search results

Items from 1 to 20 out of 312 results

chapter

Research on multi-base depth neural network speech recognition

Cai Jun, Li Fei, Zhang Yi, Liu Yu

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1540 - 1544

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

In speech recognition system, an improved multi-base neural network speech recognition model is proposed to solve the problem of long learning time and slow convergence rate of deep neural network. However, the improved model introduces a large number of parameters in the training process to make the model over-fitted in the test set, resulting in the deterioration of generalization ability and the...

chapter

Face recognition system using HMM-PSO for feature selection

Mai Mohamed Mahmoud Farag, Tarek Elghazaly, Hesham Ahmed Hefny

2016 12th International Computer Engineering Conference (ICENCO) > 105 - 110

2016 12th International Computer Engineering Conference (ICENCO)

In this paper we apply particle swarm optimization (PSO) feature selection to enhance Hidden Markov Model (HMM) states and parameters for face recognition systems. Ideal Feature selection for face images based on the idea of collaborative behavior of bird flocking to reduce the feature size and hence recognition time complicity. The framework has been inspected on 400 face pictures of the Olivetti...

chapter

Automatic speech recognition models: A characteristic and performance review

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing Communication Control and automation (ICCUBEA) > 1 - 7

2016 International Conference on Computing Communication Control and automation (ICCUBEA)

This paper presents a review on few notable speech recognition models that are reported in the last decade. Firstly, the models are categorized into sparse models, learning models and domain - specific models. Subsequently, the characteristics of the models have been observed using speech constraints, algorithmic constraints and performance constraints. The performance of these models reported in...

chapter

Kinect based people identification system using fusion of clustering and classification

Aniruddha Sinha, Diptesh Das, Kingshuk Chakravarty, Amit Konar, more

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 3 > 171 - 179

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

The demand of human identification in a non-intrusive manner has risen increasingly in recent years. Several works have already been done in this context using gait-cycle detection from human skeleton data using Microsoft Kinect as a data capture sensor. In this paper we have proposed a novel method for automatic human identification in real time using the fusion of both supervised and unsupervised...

chapter

Human action recognition using an improved string edit distance

Pasquale Foggia, Benoit Gauzere, Alessia Saggese, Mario Vento

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

In this paper we propose an improvement of a human action recognition method that uses a string-based representation and a string edit distance to compare the observed action with reference actions in the training set. In particular, the original improvement is based on a specific formulation of the string edit distance that is more suited to take into account the problems related to noise and to...

chapter

An Anomaly Detection System Based on Ensemble of Detectors with Effective Pruning Techniques

Amirreza Soudi, Wael Khreich, Abdelwahab Hamou-Lhadj

2015 IEEE International Conference on Software Quality, Reliability and Security > 109 - 118

2015 IEEE International Conference on Software Quality, Reliability and Security (QRS)

Anomaly detection systems rely on machine learning techniques to model the normal behavior of the system. This model is used during operation to detect anomalies due to attacks or design faults. Ensemble methods have been used to improve the overall detection accuracy by combining the outputs of several accurate and diverse models. Existing Boolean combination techniques either require an exponential...

chapter

O-MAP: A per-component online anomaly predicting method for Cloud infrastructure

Bin Hong, Fuyang Peng, Bo Deng, Yuchao Zhang

2015 IEEE International Conference on Information and Automation > 3026 - 3031

2015 IEEE International Conference on Information and Automation (ICIA)

Virtualized cloud systems are prone to performance anomalies due to various reasons such as resource contentions, software bugs, and hardware failures. It will be a daunting task for system administrators to manually keep track of the execution status of a large number of virtual machines all the time. Anomaly prediction is an effective approach to enhancing availability and reliability of Cloud infrastructures...

chapter

A human motion prediction algorithm for Non-binding Lower Extremity Exoskeleton

Min Wang, Xinyu Wu, Duxin Liu, Can Wang, more

2015 IEEE International Conference on Information and Automation > 369 - 374

2015 IEEE International Conference on Information and Automation (ICIA)

This paper introduces a novel approach to predict human motion for the Non-binding Lower Extremity Exoskeleton (NBLEX). Most of the exoskeletons must be attached to the pilot, which exists potential security problems. In order to solve these problems, the NBLEX is studied and designed to free pilots from the exoskeletons. Rather than applying Electromyography (EMG) and Ground Reaction Force (GFR)...

chapter

A hybrid Parts Of Speech tagger for Malayalam language

Anisha Aziz T, Sunitha C

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1502 - 1507

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Parts of speech tagging is an important research topic in Natural Language Processing research are. Since it is one among the first steps of any natural language processing (NLP) techniques such as machine translation, if any error happens for tagging the same will repeat in the whole NLP process. So far works had been done on POS tagging based on SVM, MBLP, HMM, Ngram. All of these methods were not...

chapter

Reducing morpho-phonetic confusion in sub-word based Uyghur ASR

Mijit Ablimit, Askar Hamdulla, Akbar Pattar

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 348 - 352

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Sub-word units like morphemes are selected as the lexicon for highly inflectional languages, as they can provide better coverage and a smaller vocabulary size. However, short units shrink the context of statistical models, prone to morpho-phonetic changes, and not always outperform the word based model. When sequence of units are merged or split, unit boundaries are phonetically harmonized in the...

chapter

Evaluation of wains as a classifier for automatic speech recognition

Rosemary T. Salaja, Ronan Flynn, Michael Russell

2015 26th Irish Signals and Systems Conference (ISSC) > 1 - 6

2015 26th Irish Signals and Systems Conference (ISSC)

This paper introduces a new back-end classifier for a speech recognition system that is based on artificial life (ALife). The ALife species being used for classification purposes are called wains, which were developed using the Créatúr framework. The speech recognition task used in the evaluation of the new classifier is that of isolated digit recognition. Performance of the proposed back-end classifier...

chapter

Instructive video retrieval for surgical skill coaching using attribute learning

Lin Chen, Qiang Zhang, Peng Zhang, Baoxin Li

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

Video-based coaching systems have seen increasing adoption in various applications including dance, sports, and surgery training. Most existing systems are either passive (for data capture only) or barely active (with limited automated feedback to a trainee). In this paper, we present a video-based skill coaching system for simulation-based surgical training by exploring a newly proposed problem of...

chapter

Multi-modal learning for gesture recognition

Congqi Cao, Yifan Zhang, Hanqing Lu

2015 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2015 IEEE International Conference on Multimedia and Expo (ICME)

With the development of sensing equipments, data from different modalities is available for gesture recognition. In this paper, we propose a novel multi-modal learning framework. A coupled hidden Markov model (CHMM) is employed to discover the correlation and complementary information across different modalities. In this framework, we use two configurations: one is multi-modal learning and multi-modal...

chapter

A trace abstraction approach for host-based anomaly detection

Syed Shariyar Murtaza, Wael Khreich, Abdelwahab Hamou-Lhadj, Stephane Gagnon

2015 IEEE Symposium on Computational Intelligence for Security and Defense Applications (CISDA) > 1 - 8

2015 IEEE Symposium on Computational Intelligence for Security and Defense Applications (CISDA)

High false alarm rates and execution times are among the key issues in host-based anomaly detection systems. In this paper, we investigate the use of trace abstraction techniques for reducing the execution time of anomaly detectors while keeping the same accuracy. The key idea is to represent system call traces as traces of kernel module interactions and use the resulting abstract traces as input...

chapter

Speech event detection by non negative matrix deconvolution

Carla Lopes, Fernando Perdigao

2007 15th European Signal Processing Conference > 1280 - 1284

2007 15th European Signal Processing Conference

Support Vector Machines (SVM) are applied to the problem of detecting and classifying broad acoustic-phonetic classes (events). In this paper an approach based on Non-Negative Matrix Deconvolution (NMD) is proposed to merge frame-based SVM predictions into segmental events. To turn the SVM outputs, which are frame-based, into a signal segmented in terms of events, two different event merger methods...

chapter

Automatic identification of bird species: A comparison between kNN and SOM classifiers

Dorota Kaminska, Artur Gmerek

2012 Joint Conference New Trends In Audio & Video And Signal Processing: Algorithms, Architectures, Arrangements And Applications (NTAV/SPA) > 77 - 82

2012 Joint Conference New Trends in Audio & Video and Signal Processing: Algorithms, Architectures, Arrangements, and Applications (NTAV/SPA)

This paper presents a system for automatic bird identification, which uses audio input. The experiments have been conducted on three groups of birds, which were created basing finishing on classification, the system is fully automated. The main problem in automatic bird recognition (ABR) is the choice of proper features and classifiers. Identification has been made using two classifiers-kNN (k Nearest...

chapter

Speech recognition with prediction-adaptation-correction recurrent neural networks

Yu Zhang, Dong Yu, Michael L. Seltzer, Jasha Droppo

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5004 - 5008

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose the prediction-adaptation-correction RNN (PAC-RNN), in which a correction DNN estimates the state posterior probability based on both the current frame and the prediction made on the past frames by a prediction DNN. The result from the main DNN is fed back to the prediction DNN to make better predictions for the future frames. In the PAC-RNN, we can consider that, given the new, current...

chapter

Discriminative spectral learning of hidden markov models for human activity recognition

Alfredo Nazabal, Antonio Artes-Rodriguez

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1966 - 1970

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Hidden Markov Models (HMMs) are one of the most important techniques to model and classify sequential data. Maximum Likelihood (ML) and (parametric and non-parametric) Bayesian estimation of the HMM parameters suffers from local maxima and in massive datasets they can be specially time consuming. In this paper, we extend the spectral learning of HMMs, a moment matching learning technique free from...

chapter

Brandt's GLR method & refined HMM segmentation for TTS synthesis application

Safaa Jarifi, Dominique Pastor, Olivier Rosec

2005 13th European Signal Processing Conference > 1 - 4

2005 13th European Signal Processing Conference

In comparison with standard HMM (Hidden Markov Model) with forced alignment, this paper discusses two automatic segmentation algorithms from different points of view: the probabilities of insertion and omission, and the accuracy. The first algorithm, hereafter named the refined HMM algorithm, aims at refining the segmentation performed by standard HMM via a GMM (Gaussian Mixture Model) of each boundary...

chapter

Video classification based on HMM using text and faces

Nevenka Dimitrova, Lalitha Agnihotri, Gang Wei

2000 10th European Signal Processing Conference > 1 - 4

2000 10th European Signal Processing Conference

Video content classification and retrieval is a necessary tool in the current merging of entertainment and information media. With the advent of broadband networking, every consumer will have video programs available on-line as well as in the traditional distribution channels. Systems that help in content management have to discern between different categories of video in order to provide for fast...

Keywords:
TRAINING
ACCURACY
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Content availability

Available (311)
None (1)

Keywords

SPEECH RECOGNITION (107)
SPEECH (99)
FEATURE EXTRACTION (76)
HIDDEN MARKOV MODEL (45)
ACOUSTICS (44)
SUPPORT VECTOR MACHINES (43)
NATURAL LANGUAGE PROCESSING (35)
TESTING (32)
DATABASES (31)
COMPUTATIONAL MODELING (30)
HANDWRITING RECOGNITION (27)
HMM (26)
DATA MINING (25)
DATA MODELS (25)
TAGGING (23)
TRAINING DATA (23)
CLASSIFICATION ALGORITHMS (21)
VECTORS (21)
LEARNING (ARTIFICIAL INTELLIGENCE) (20)
SPEECH PROCESSING (19)
AUTOMATIC SPEECH RECOGNITION (18)
CHARACTER RECOGNITION (18)
MATHEMATICAL MODEL (18)
ARTIFICIAL NEURAL NETWORKS (17)
IMAGE RECOGNITION (15)
MACHINE LEARNING (14)
DICTIONARIES (13)
GAUSSIAN PROCESSES (13)
MEL FREQUENCY CEPSTRAL COEFFICIENT (13)
PATTERN CLASSIFICATION (13)
SUPPORT VECTOR MACHINE (13)
HANDWRITTEN CHARACTER RECOGNITION (12)
HUMANS (12)
IMAGE SEGMENTATION (12)
VOCABULARY (12)
GESTURE RECOGNITION (11)
OPTIMIZATION (11)
PREDICTIVE MODELS (11)
SHAPE (11)
SPEAKER RECOGNITION (11)
EMOTION RECOGNITION (10)
IMAGE CLASSIFICATION (10)
KERNEL (10)
NOISE (10)
PATTERN RECOGNITION (10)
ROBUSTNESS (10)
SENSORS (10)
ACOUSTIC MODELING (9)
ADAPTATION MODEL (9)
CLASSIFICATION (9)
DECODING (9)
FACE RECOGNITION (9)
PROBABILITY (9)
SIGNAL PROCESSING (9)
COMPUTERS (8)
CONTEXT (8)
INFORMATION RETRIEVAL (8)
MONITORING (8)
NEURAL NETS (8)
NEURAL NETWORKS (8)
PREDICTION ALGORITHMS (8)
SIGNAL CLASSIFICATION (8)
STATISTICAL ANALYSIS (8)
TEXT ANALYSIS (8)
VISUALIZATION (8)
VITERBI ALGORITHM (8)
ALGORITHM DESIGN AND ANALYSIS (7)
CLUSTERING ALGORITHMS (7)
CONFERENCES (7)
CRF (7)
DISCRIMINATIVE TRAINING (7)
ERROR ANALYSIS (7)
ESTIMATION (7)
JOINTS (7)
MAXIMUM LIKELIHOOD ESTIMATION (7)
NATURAL LANGUAGES (7)
SEMANTICS (7)
SPEECH SYNTHESIS (7)
SUPPORT VECTOR MACHINE CLASSIFICATION (7)
ACOUSTIC SIGNAL PROCESSING (6)
ANALYTICAL MODELS (6)
CAMERAS (6)
CONDITIONAL RANDOM FIELDS (6)
EQUATIONS (6)
GENETIC ALGORITHMS (6)
HISTOGRAMS (6)
IMAGE SEQUENCES (6)
INTERNET (6)
LABELING (6)
LANGUAGE MODEL (6)
MUSIC (6)
PRINCIPAL COMPONENT ANALYSIS (6)
ADAPTATION MODELS (5)
BRAIN MODELING (5)
CEPSTRAL ANALYSIS (5)
COMPLEXITY THEORY (5)
COMPUTER VISION (5)
more

INFONA - science communication portal

Search results

Research on multi-base depth neural network speech recognition

Face recognition system using HMM-PSO for feature selection

Automatic speech recognition models: A characteristic and performance review

Kinect based people identification system using fusion of clustering and classification

Human action recognition using an improved string edit distance

An Anomaly Detection System Based on Ensemble of Detectors with Effective Pruning Techniques

O-MAP: A per-component online anomaly predicting method for Cloud infrastructure

A human motion prediction algorithm for Non-binding Lower Extremity Exoskeleton

A hybrid Parts Of Speech tagger for Malayalam language

Reducing morpho-phonetic confusion in sub-word based Uyghur ASR

Evaluation of wains as a classifier for automatic speech recognition

Instructive video retrieval for surgical skill coaching using attribute learning

Multi-modal learning for gesture recognition

A trace abstraction approach for host-based anomaly detection

Speech event detection by non negative matrix deconvolution

Automatic identification of bird species: A comparison between kNN and SOM classifiers

Speech recognition with prediction-adaptation-correction recurrent neural networks

Discriminative spectral learning of hidden markov models for human activity recognition

Brandt's GLR method & refined HMM segmentation for TTS synthesis application

Video classification based on HMM using text and faces

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options