Search results

Items from 1 to 20 out of 49 results

chapter

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, more

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) > 1 - 5

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

This paper describes an implementation of speech recognition that recognizes and suppresses ten (10) defined profane and vulgar Filipino words. The adapted speech recognition architecture was that of the Oregon Graduate Institute's (OGI) Center for Spoken Language and Learning (CSLU). It utilizes a hybrid Hidden Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature...

article

A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks

Seyed Reza Shahamiri, Siti Salwah Binti Salim

IEEE Transactions on Neural Systems and Rehabilitation Engineering > 2014 > 22 > 5 > 1053 - 1063

Automatic speech recognition (ASR) can be very helpful for speakers who suffer from dysarthria, a neurological disability that damages the control of motor speech articulators. Although a few attempts have been made to apply ASR technologies to sufferers of dysarthria, previous studies show that such ASR systems have not attained an adequate level of performance. In this study, a dysarthric multi-networks...

chapter

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

Jasdeep Singh Bhalla, Anmol Aggarwal

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE) > 1 - 6

2013 International Conference on Control, Automation, Robotics and Embedded Systems (CARE)

Emotion Recognition from speech has evolved itself as the most significant research area in the field of affective computing. In this paper, two emotional speech datasets, have been analyzed, based on gender distinction (male and female speech). This paper introduces a new approach of speech-emotion recognition based on the use of AdaBoost classification Algorithm. Artificial neural network has been...

chapter

A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs)

Burcu Can, Harun Artuner

2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR) > 219 - 224

2013 International Conference of Soft Computing and Pattern Recognition (SoCPaR)

In this paper, we present a model for Turkish speech recognition. The model is syllable-based, where the recognition is performed through syllables as speech recognition units. The main goal of the model is to recognize as much as possible of a given continuous speech by identifying only a small set of syllables in the language. For that purpose, only the syllable types with a higher frequency are...

chapter

Three steps of Neuron Network classification for EMG-based Thai tones speech recognition

Niyawadee Srisuwan, Pornchai Phukpattaranont, Chusak Limsakul

2013 10th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology > 1 - 6

2013 10th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON 2013)

In order to overcome the problem existing in original speech recognition (e.g. noise interruption and private data loss), many researchers have investigated to deal with these problems. Electromyography (EMG) from the muscles producing speech was used to replace a voiced signal. Similarly, we aim to develop EMG speech recognition based on Thai language. Tone is the important characteristic of this...

chapter

Feature Extraction Methods Based on Linear Predictive Coding and Wavelet Packet Decomposition for Recognizing Spoken Words in Malayalam

Sonia Sunny, David Peter S., K. Poulose Jacob

2012 International Conference on Advances in Computing and Communications > 27 - 30

2012 International Conference on Advances in Computing and Communications (ICACC)

Speech signals are one of the most important means of communication among the human beings. In this paper, a comparative study of two feature extraction techniques are carried out for recognizing speaker independent spoken isolated words. First one is a hybrid approach with Linear Predictive Coding (LPC) and Artificial Neural Networks (ANN) and the second method uses a combination of Wavelet Packet...

chapter

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Pretesh B. Patel, Tshilidzi Marwala

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 298 - 303

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

A classification system that accurately categorizes caller behavior within Interactive Voice Response systems would assist in developing good automated self service applications. This paper details the implementation of such a classification system for a pay beneficiary application. Adaptive Neuro-Fuzzy Inference System (ANFIS), Feed forward Artificial Neural Network (ANN) and Support Vector Machine...

chapter

Spoken term detection from noisy input

G Gosztolya, G Kovacs, L Toth

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI) > 91 - 96

2011 6th IEEE International Symposium on Applied Computational Intelligence and Informatics (SACI)

The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. The kind of techniques that are used usually are vocabulary-independent, using only the acoustic information available. In this scenario, however, we rely exclusively on the acoustic model, which is a drawback when it is unreliable; for example when the input is noisy. In...

chapter

Large vocabulary continuous speech recognition with context-dependent DBN-HMMS

George E. Dahl, Dong Yu, Li Deng, Alex Acero

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4688 - 4691

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The context-independent deep belief network (DBN) hidden Markov model (HMM) hybrid architecture has recently achieved promising results for phone recognition. In this work, we propose a context-dependent DBN-HMM system that dramatically outperforms strong Gaussian mixture model (GMM)-HMM baselines on a challenging, large vocabulary, spontaneous speech recognition dataset from the Bing mobile voice...

chapter

Evaluating the effect of voice activity detection in isolated Yoruba word recognition system

A. M Aibinu, M. J. E Salami, Athaur Rahman Najeeb, J. F. Azeez, more

2011 4th International Conference on Mechatronics (ICOM) > 1 - 5

2011 4th International Conference on Mechatronics (ICOM)

This paper discusses and evaluates the effect of voice Activity Detection (VAD) in an isolated Yoruba word recognition system (IYWRS). The word database used in this paper are collected from 22 speakers by repeating the numbers 1 to 9 three times each. A hybrid configuration of Mel-Frequency Cepstral coefficient (MFCC) and Linear Predictive Coding (LPC) have been used to extract the features of the...

chapter

Application of Orthogonal Least Square (OLS) for selection of Mel Frequency Cepstrum Coefficients for classification of spoken letters using MLP classifier

M F Rozali, I M Yassin, A Zabidi, W Mansor, more

2011 IEEE 7th International Colloquium on Signal Processing and its Applications > 464 - 468

2011 IEEE 7th International Colloquium on Signal Processing & its Applications (CSPA 2011)

This paper describes an application of the Orthogonal Least Squares (OLS) algorithm for feature selection of spoken letters. Traditionally used for system identification purposes, the OLS method was used to select important Mel-Frequency Cepstrum Coefficients (MFCC) for classification of two spoken letters - `A' and `S' using Multi-Layer Perceptron (MLP) neural network. We evaluated several network...

chapter

Spoken term detection based on the most probable phoneme sequence

G Gosztolya, L Toth

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI) > 101 - 106

2011 IEEE 9th International Symposium on Applied Machine Intelligence and Informatics (SAMI)

The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. In this area, besides the accuracy of hits returned, the speed of search is also very important, for which an intermediate representation of recordings is normally used. In this paper we evaluate a spoken term detection method which represents the speech signals by their...

chapter

Bangla speech recognition using two stage multilayer neural networks

Qamrun Nahar Eity, M Banik, N J Lisa, F Hassan, more

2010 International Conference on Signal and Image Processing > 222 - 226

2010 International Conference on Signal and Image Processing (ICSIP 2010)

This paper describes a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of two stages: i) a multilayer neural network (MLN), which converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities and ii) the phoneme probabilities obtained from the first stage and corresponding Δ and ΔΔ are inserted into another MLN to...

chapter

Bangla phoneme recognition using hybrid features

M R A Kotwal, M S Hossain, F Hassan, G Muhammad, more

International Conference on Electrical&Computer Engineering (ICECE 2010) > 718 - 721

2010 6th International Conference on Electrical & Computer Engineering (ICECE 2010)

This paper presents a Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of three stages: i) a multilayer neural network (MLN), which converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities, ii) the phoneme probabilities obtained from the first stage and corresponding Δ and ΔΔ are inserted into another MLN to improve...

chapter

Factor Analysis and Majority Voting Based Speech Emotion Recogntion

Lu Xu, Mingxing Xu, Dali Yang

2010 International Conference on Intelligent System Design and Engineering Application > 1 > 716 - 720

2010 International Conference on Intelligent System Design and Engineering Application (ISDEA 2010)

There are some problems to be resolved for speech emotion recognition, such as the dimension of feature sets is usually too high and the redundancy among various features is relatively stronger. Considering these problems, the factor analysis and majority voting based speech emotion recognition was proposed. How to extract emotional factors from global statistical features and GMM super vectors was...

chapter

Distinctive Phonetic Features (DPFs)-Based Isolated Word Recognition Using Multilayer Neural Networks

M N Huda, M M Hasan, S Ahmed, D F Rahman, more

2010 First International Conference on Integrated Intelligent Computing > 51 - 55

2010 First International Conference on Integrated Intelligent Computing (ICIIC 2010)

This paper describes an isolated word recognition method based on distinctive phonetic features (DPFs). The method comprises two multilayer neural networks (MLNs). The first MLN, MLNLF-DPF, maps local features (LFs) of an input speech signal into discrete DPFs and the second MLN, MLNDyn, restricts dynamics of outputted DPFs by the MLNLF-DPF. In the experiments on Tohokudai Isolated Spoken-Word Database...

chapter

Parameter influence on speech recognition rate of modified RBF neural network

Xia Wang, Jian Tian, Mengjun Wang

2010 International Conference on Intelligent Control and Information Processing > 76 - 78

2010 International Conference on Intelligent Control and Information Processing (ICICIP 2010)

In RBF neural network designing, hidden neuron number and parameter influence the performance of network. The paper discusses influences of pruning hidden neurons using different criteria and parameter on speech recognition rate of modified RBF neural network. First we introduce three hidden neuron pruning criteria, then propose a modified RBF neural network, at last recognition results before and...

chapter

Thai speech recognition using Double filter banks for basic voice commanding

Pisit Phokharatkul, Kriengkrai Nantanitikorn, Supachai Phaiboon

2010 International Conference on Computer, Mechatronics, Control and Electronic Engineering > 6 > 33 - 36

2010 International Conference on Computer, Mechatronics, Control and Electronic Engineering (CMCE 2010)

This paper describes the methodology to recognize Thai speech words by integrating two approaches e.g., Double filter banks and Euclidian distance in a feature extraction and the recognition processes, respectively. Firstly, the speech signals are transformed into the 3-dimension of signal or spectrogram. The spectrogram displays energy information along both time and frequency axes. Secondly, the...

chapter

Isolated question words recognition from speech queries by using Artificial Neural Networks

A R Sukumar, A F Shah, P B Anto

2010 Second International conference on Computing, Communication and Networking Technologies > 1 - 4

2010 International Conference on Computing, Communication and Networking Technologies (ICCCNT'10)

Most of the research works in Information Extraction focus only on written language processing, in which a few are devoted to the study of Spoken Language Information Extraction. This paper discusses a novel technique for recognition of the isolated question words from Malayalam (one of the south Indian languages) speech query. We have created and analyzed a database consisting of 250 isolated question...

chapter

Speaker identification using Partially Connected Locally Recurrent Probabilistic Neural Networks

Petru-Marian Briciu

2010 8th International Conference on Communications > 87 - 90

2010 8th International Conference on Communications (COMM)

This paper introduces Partially Connected Locally Recurrent Probabilistic Neural Networks (PC-LRPNN) as an extension of the well-known Probabilistic Neural Networks (PNN) and Locally Recurrent Probabilistic Neural Networks (LRPNN). Besides the definition of the PC-LRPNN architecture a fast four-step training method is proposed. The first two steps are identical to the training of traditional PNNs,...

Keywords:
ARTIFICIAL NEURAL NETWORKS
SPEECH RECOGNITION

Publication date

Set your own date range

INFONA - science communication portal

Search results

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

A Multi-Views Multi-Learners Approach Towards Dysarthric Speech Recognition Using Multi-Nets Artificial Neural Networks

Using Adaboost Algorithm along with Artificial neural networks for efficient human emotion recognition from speech

A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs)

Three steps of Neuron Network classification for EMG-based Thai tones speech recognition

Feature Extraction Methods Based on Linear Predictive Coding and Wavelet Packet Decomposition for Recognizing Spoken Words in Malayalam

Adaptive Neuro Fuzzy Inference System, Neural Network and Support Vector Machine for Caller Behavior Classification

Spoken term detection from noisy input

Large vocabulary continuous speech recognition with context-dependent DBN-HMMS

Evaluating the effect of voice activity detection in isolated Yoruba word recognition system

Application of Orthogonal Least Square (OLS) for selection of Mel Frequency Cepstrum Coefficients for classification of spoken letters using MLP classifier

Spoken term detection based on the most probable phoneme sequence

Bangla speech recognition using two stage multilayer neural networks

Bangla phoneme recognition using hybrid features

Factor Analysis and Majority Voting Based Speech Emotion Recogntion

Distinctive Phonetic Features (DPFs)-Based Isolated Word Recognition Using Multilayer Neural Networks

Parameter influence on speech recognition rate of modified RBF neural network

Thai speech recognition using Double filter banks for basic voice commanding

Isolated question words recognition from speech queries by using Artificial Neural Networks

Speaker identification using Partially Connected Locally Recurrent Probabilistic Neural Networks

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options