Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 24 wyników

Poprzednia

Następna

rozdział

Malay speaker identification using Neural Networks

J D Tan, H N Ting

International Conference on Information Science and Technology > 476 - 479

2011 International Conference on Information Science and Technology (ICIST 2011)

This paper investigates the Malay speaker identification using Neural Networks. Speech database was developed with five speakers as trainers and five speakers as imposters. The speech training set included 30 vowel sounds of five trainer speakers. The test set included 30 vowel sounds from the five trainers and 30 vowel sounds from five imposters. The speech sounds were sampled at 20 kHz with 16 bit...

rozdział

Binaural Speaker Recognition for humanoid robots

Bastien Breteau, Sylvain Argentieri, Jean-Luc Zarader, Zefeng Wang, więcej

2010 IEEE International Conference on Robotics and Biomimetics > 1405 - 1410

2010 IEEE International Conference on Robotics and Biomimetics (ROBIO)

This paper deals with Automatic Speaker Recognition in a binaural context. Such a problematic, not so widely dealt with within the speech processing community, can have potential applications in humanoid robots where speech can be used as the most natural interface between humans and robots. The proposed recognition system is based on parallel Predictive Neural Networks exploiting MFCCs (Mel Frequency...

rozdział

Robust speech recognition by improvement missing features using Bidirectional Neural Network

Hojat Mohammadnejad, Mansoor Vali

2010 17th Iranian Conference of Biomedical Engineering (ICBME) > 1 - 4

2010 17th Iranian Conference Of Biomedical Engineering (ICBME 2010)

In this paper we present a new method for nonlinear compensation of mismatches, e.g. additive noise, on clean and noisy speech recognition. We were inspired by the human recognition system in development and implementation of a new Bidirectional Neural Network (BNN). This procedure, results in improvement of input features and consequently increasing the overall recognition accuracy. The feedforward...

rozdział

Classifier fusion for speech emotion recognition

Liqin Fu, Changjiang Wang, Yongmei Zhang

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 407 - 410

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2010)

According to multidimensional emotion space model, an improved queuing voting algorithm was proposed to implement the fusion among multiple emotion classifiers for a good emotion recognition result. Firstly, three kinds of classifier were designed based on hidden Markov model (HMM) and artificial neural network (ANN). Then, the improved queuing voting algorithm was used to fuse them. Experimental...

rozdział

Improved Malay Vowel Feature Extraction Method Based on First and Second Formants

Azmi M Y Shahrul, F Siraj, S Yaacob, M P Paulraj, więcej

2010 Second International Conference on Computational Intelligence, Modelling and Simulation > 339 - 344

2010 Second International Conference on Computational Intelligence, Modelling and Simulation (CIMSiM 2010)

There are many speech recognition applications that use vowels phonemes. Among them are speech therapy systems that improve utterances of word pronunciation especially to children. There are also systems that teach hearing impaired person to speak properly by pronouncing words with a good degree of intelligibility. All of these systems require high degree of vowel recognition capability. This paper...

rozdział

A ANN Based High Quality Method for Voice Conversion

Z Chen, L H Zhang

2010 6th International Conference on Wireless Communications Networking and Mobile Computing (WiCOM) > 1 - 4

2010 6th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM)

In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F₀ sequences are usually...

rozdział

Isolated question words recognition from speech queries by using Artificial Neural Networks

A R Sukumar, A F Shah, P B Anto

2010 Second International conference on Computing, Communication and Networking Technologies > 1 - 4

2010 International Conference on Computing, Communication and Networking Technologies (ICCCNT'10)

Most of the research works in Information Extraction focus only on written language processing, in which a few are devoted to the study of Spoken Language Information Extraction. This paper discusses a novel technique for recognition of the isolated question words from Malayalam (one of the south Indian languages) speech query. We have created and analyzed a database consisting of 250 isolated question...

rozdział

Key-Word Based Query Recognition in a Speech Corpus by Using Artificial Neural Networks

A Raji Sukumar, A Sarin Sukumar, A Firoz Shah, Babu Anto P

2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks > 33 - 36

2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks (CICSyN 2010)

Information Retrieval deals with the easy access to the information based on the user's request, which will be presented in the form of a query. A dialog system that understands spoken natural language queries asks for further information if necessary and produces an answer to the speaker's query. Most of the research works in Information Extraction focus only on written language processing, in which...

rozdział

Improving ANN performance for imbalanced data sets by means of the NTIL technique

Carlos Vivaracho-Pascual, Arancha Simon-Hurtado

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 6

2010 International Joint Conference on Neural Networks (IJCNN 2010)

This paper deals with the problem of training an Artificial Neural Network (ANN) when the data sets are very imbalanced. Most learning algorithms, including ANN, are designed for well-balanced data and do not work properly on imbalanced ones. Of the approaches proposed for dealing with this problem, we are interested in the re-sampling ones, since they are algorithm-independent. We have recently proposed...

rozdział

Emotion recognition using LP residual

Arun Chauhan, Shashidhar G Koolagudi, Sabin Kafley, K Sreenivasa Rao

2010 IEEE Students Technology Symposium (TechSym) > 255 - 261

2010 IEEE Students' Technology Symposium (TechSym 2010)

This paper explores the Linear Prediction (LP) residual of speech signal for characterizing the basic emotions. The emotions used in this study are anger, compassion, disgust, fear, happy, neutral, sarcastic and surprise. LP residual is derived by inverse filtering of the speech signal, and the process is known as LP analysis. LP residual mainly contains higher order relations among the samples. For...

rozdział

Emotional speech analysis using artificial neural networks

Jana Tuckova, Martin Sramka

Proceedings of the International Multiconference on Computer Science and Information Technology > 141 - 147

2010 International Multiconference on Computer Science and Information Technology (IMCSIT 2010)

In the present text, we deal with the problem of classification of speech emotion. Problems of speech processing are addressed through the use of artificial neural networks (ANN). The results can be use for two research projects - for prosody modelling and for analysis of disordered speech. The first ANN topology discussed is the multilayer neural network (MLNN) with the BPG learning algorithm, while...

rozdział

An Automatic Grading Model for Learning Assessment

Yang Liu

2010 International Conference on e-Education, e-Business, e-Management and e-Learning > 217 - 220

2010 International Conference on e-Education, e-Business, e-Management, and e-Learning, (IC4E)

Particle swarm optimization (PSO) is an algorithm modelled on swarm intelligence that finds a solution to an optimization problem in a search space. In this paper, a PSO-based artificial neural network algorithm is proposed to automatically grading the learning results. Basically, the PSO algorithm is utilized to adjust the connection weights of the selected ANN topology. Taken mandarin learning as...

rozdział

Speaker and text dependent automatic emotion recognition from female speech by using artificial neural networks

S.A. Firoz, S.A. Raji, A.P. Babu

2009 World Congress on Nature&Biologically Inspired Computing (NaBIC) > 1411 - 1413

2009 World Congress on Nature & Biologically Inspired Computing (NaBIC 2009)

We have created and analyzed an elicited emotional database consisting of 340 emotional speech samples under four different emotions neutral, happy, sad and anger. Malayalam (one of the south Indian languages) was used for the experiment. Daubechies8 wavelet was used for feature extraction and artificial neural network was used for pattern recognition. An overall recognition accuracy of 72.055% obtained...

rozdział

Speaker segmentation using parallel fusion between three classifiers

S. Ouamour, H. Sayoud, M. Guerti

2009 3rd International Conference on Signals, Circuits and Systems (SCS) > 1 - 4

2009 3rd International Conference on Signals, Circuits and Systems (SCS 2009)

In this paper, we deal with the problem of speaker segmentation. This speciality consists in splitting the audio document into homogeneous areas. Each area is attributed to one speaker. Speaker segmentation (or speaker change detection) consists in detecting the points where the speaker identity changes, in a multi-speaker audio stream. These points or times are called ??Break Points??.

rozdział

Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms

A. Firoz Shah, V.R. Vimal Krishnan, A. Raji Sukumar, A. Jayakumar, więcej

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 528 - 531

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

Automatic Emotion Recognition (AER) from speech is one of the most interested research domains for the scientific world. AER simply means to make a machine able to recognize the different emotions from speech. We have created and analyzed an elicited database consisting of 700 utterances under four different emotional classes such as neutral happy sad and anger. Malayalam (One of the south Indian...

rozdział

Feature analysis for quality assessment of reverberated speech

A.A. de Lima, T.M. de Prego, S.L. Netto, B. Lee, więcej

2009 IEEE International Workshop on Multimedia Signal Processing > 1 - 5

2009 IEEE International Workshop on Multimedia Signal Processing (MMSP)

This paper analyzes the ability of several measurements to quantify the reverberation effect in speech signals. We consider an intrusive scheme, in which the clean and reverberated signals are available, allowing one to estimate the corresponding room impulse response (RIR) signal. An artificial neural network (ANN) is trained for all features and used in a regression approach to estimate the human...

rozdział

Emotion Recognition in Spontaneous Speech within Work and Family Environments

Ling He, M. Lech, N. Maddage, S. Memon, więcej

2009 3rd International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2009 3rd International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2009)

The speech signal is an important tool for conveying information between humans; at the same time, it is an indicator of a speaker's emotions. In this paper, the automatic identification of affect from speech containing spontaneously expressed (not acted) emotions within different environments was investigated. The teager energy operator-perceptual wavelet packet (TEO-PWP) features as well as the...

rozdział

A Comparative Study to Evaluate a Text-Independent Speaker Identification Engine for Arabic Speakers Using a CHMM-Based Approach

H. Tolba

2009 16th International Conference on Systems, Signals and Image Processing > 1 - 4

2009 16th International Conference on Systems, Signals and Image Processing

This paper reports a comparative study between two identification engines to identify speakers automatically from their voices when speaking spontaneously in Arabic. The first engine is based on the continuous hidden Markov models (CHMMs) while the second one is based on the artificial neural networks (ANNs). The Mel frequency cepstral coefficients (MFCCs) were selected to describe the speech signal...

rozdział

Relative Speech Emotion Recognition Based Artificial Neural Network

Liqin Fu, Xia Mao, Lijiang Chen

2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application > 2 > 140 - 144

2008 Pacific-Asia Workshop on Computational Intelligence and Industrial Application. PACIIA 2008

Artificial neural network (ANN) models based on static features vector as well as normalized temporal features vector, were used to recognize emotion state from speech. Moreover, relative features obtained by computing the changes of acoustic features of emotional speech relative to those of neutral speech were adopted to weaken the influence from the individual difference. The methods to relativize...

rozdział

Global syllable set for building speech synthesis in Indian languages

E.V. Raghavendra, S. Desai, B. Yegnanarayana, A.W. Black, więcej

2008 IEEE Spoken Language Technology Workshop > 49 - 52

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

Indian languages are syllabic in nature where many syllables are found common across its languages. This motivates us to build a global syllable set by combining multiple language syllables to build a synthesizer which can borrow units from a different language when the required syllable is not found. Such synthesizer make use of speech database in different languages spoken by different speakers,...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
SPEECH
NEURAL NETS

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (23)
Brak (1)

Słowa kluczowe

SPEECH RECOGNITION (17)
ARTIFICIAL NEURAL NETWORKS (16)
ARTIFICIAL NEURAL NETWORK (10)
FEATURE EXTRACTION (9)
TRAINING (9)
EMOTION RECOGNITION (7)
SPEECH PROCESSING (7)
ACCURACY (6)
GAUSSIAN PROCESSES (5)
SIGNAL CLASSIFICATION (4)
SPEAKER RECOGNITION (4)
ACOUSTICS (3)
ANN (3)
CEPSTRAL ANALYSIS (3)
DISCRETE WAVELET TRANSFORM (3)
DISCRETE WAVELET TRANSFORMS (3)
HIDDEN MARKOV MODELS (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (3)
NEURAL NETWORKS (3)
ANN TOPOLOGY (2)
ARTIFICIAL INTELLIGENCE (2)
BANDWIDTH (2)
DATA MINING (2)
ESTIMATION (2)
GAUSSIAN MIXTURE MODEL (2)
GAUSSIAN MIXTURE MODELS (2)
INDIAN LANGUAGES (2)
ISOLATED QUESTION WORD RECOGNITION (2)
MATHEMATICAL MODEL (2)
MULTI LAYER PERCEPTRON (2)
MULTILAYER PERCEPTRON (2)
MULTILAYER PERCEPTRONS (2)
NATURAL LANGUAGE PROCESSING (2)
NEURONS (2)
PROPOSALS (2)
QUERY PROCESSING (2)
REGRESSION ANALYSIS (2)
REVERBERATION (2)
SPEECH DATABASE (2)
SPEECH EMOTION RECOGNITION (2)
SPOKEN LANGUAGE INFORMATION EXTRACTION (2)
SUPPORT VECTOR MACHINES (2)
ACOUSTIC NOISE (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADDITIVE NOISE (1)
AFFECTIVE COMPUTING (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANALYTICAL MODELS (1)
ANN REGRESSION (1)
ARABIC SPEAKERS (1)
ARTIFICIAL NEURAL NETWORK MODEL (1)
AUDIO DATABASES (1)
AUDIO DOCUMENT (1)
AUTOASSOCIATIVE NEURAL NETWORK (1)
AUTOMATIC EMOTION RECOGNITION (1)
AUTOMATIC GRADING MODEL (1)
AUTOMATIC SPEAKER RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION (1)
BACKPROPAGATION (1)
BANDWIDTH 3.1 KHZ (1)
BANDWIDTH 8 KHZ (1)
BASIS FILTERS (1)
BEIHANG UNIVERSITY (1)
BERLIN DATABASE (1)
BERLIN DATABASE OF EMOTIONAL SPEECH (1)
BIDIRECTIONAL NEURAL NETWORK (1)
BINAURAL SPEAKER RECOGNITION (1)
BIOLOGICAL SYSTEM MODELING (1)
BNN (1)
BPG LEARNING ALGORITHM (1)
BROADCAST NEWS (1)
BROADCAST NEWS DOMAIN (1)
BROADCASTING (1)
BUILDINGS (1)
CEPSTRAL COEFFICIENTS (1)
CEPSTRUM (1)
CHANNEL EQUALIZATION (1)
CHMM-BASED APPROACH (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFIER FUSION (1)
CLEAN SPEECH RECOGNITION (1)
COMMUNICATIONS TECHNOLOGY (1)
COMPENSATION (1)
COMPUTER ARCHITECTURE (1)
COMPUTER LANGUAGES (1)
COMPUTERS (1)
CONTINUOUS HIDDEN MARKOV MODELS (1)
CONVEX COMBINATION (1)
CORRELATION (1)
DATA ACQUISITION (1)
DATABASE (1)
DATABASE MANAGEMENT SYSTEMS (1)
DAUBECHIES8 WAVELET (1)
DISCRETE COSINE TRANSFORM (1)
DISCRETE COSINE TRANSFORMS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu