Search results

Items from 1 to 20 out of 28 results

chapter

Feature selection experiments on emotional speech classification

Piyawat Sukhummek, Sawit Kasuriya, Thanaruk Theeramunkong, Chai Wutiwiwatchai, more

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 4

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents the experiments on feature selection for emotional speech classification. There are 152 features used in this experiment. The minimum redundancy maximum relevance (mRMR) feature selection is applied as the features selection. The experiments are constructed from two corpora; Interactive Emotional Dyadic Motion Capture (IEMOCAP) and Emotional Tagged Corpus on Lakorn (EMOLA) which...

chapter

Automatic identification of bird species: A comparison between kNN and SOM classifiers

Dorota Kaminska, Artur Gmerek

2012 Joint Conference New Trends In Audio & Video And Signal Processing: Algorithms, Architectures, Arrangements And Applications (NTAV/SPA) > 77 - 82

2012 Joint Conference New Trends in Audio & Video and Signal Processing: Algorithms, Architectures, Arrangements, and Applications (NTAV/SPA)

This paper presents a system for automatic bird identification, which uses audio input. The experiments have been conducted on three groups of birds, which were created basing finishing on classification, the system is fully automated. The main problem in automatic bird recognition (ABR) is the choice of proper features and classifiers. Identification has been made using two classifiers-kNN (k Nearest...

chapter

Akshara transcription of mrudangam strokes in Carnatic music

Jom Kuriakose, J Chaitanya Kumar, Padi Sarala, Hema A Murthy, more

2015 Twenty First National Conference on Communications (NCC) > 1 - 6

2015 Twenty First National Conference on Communications (NCC)

Percussion instruments play a significant role in Carnatic music concerts. The percussion artist enjoys a great degree of freedom in improvising within the defined tāla structure of a composition. The objective of this paper is to transcribe the improvisations, treating the percussion strokes as syllables or aksharas.

chapter

Speaker based Language Independent Isolated Speech Recognition System

Shanthi Therese S., Chelpa Lingam

2015 International Conference on Communication, Information & Computing Technology (ICCICT) > 1 - 7

2015 International Conference on Communication, Information & Computing Technology (ICCICT)

This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used...

chapter

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Khan Suhail Ahmad, Anil S. Thosar, Jagannath H. Nirmal, Vinay S. Pande

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC) and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian filter banks for text independent speaker recognition. MFCC modeled on the human auditory system shows robustness against noise and session changes and hence has become synonymous with speaker recognition. Our main aim is to test...

chapter

Proposed combination of PCA and MFCC feature extraction in speech recognition system

Hoang Trang, Tran Hoang Loc, Huynh Bui Hoang Nam

2014 International Conference on Advanced Technologies for Communications (ATC 2014) > 697 - 702

2014 International Conference on Advanced Technologies for Communications (ATC)

In speech recognition system, the Mel Frequency Cepstrum Coefficients (i.e. MFCC) feature extraction is an important process. It has also been wildly used in many applications. In this paper, we present the conventional MFCC feature extraction method and propose two novel versions of MFCC method that will combine the PCA technique and conventional MFCC feature extraction method. Finally, these three...

chapter

Devnagari Phoneme Recognition System

Priyanka Pratapsinh Patil, Sanjay Arjunsinh Pardeshi

2014 Fourth International Conference on Advances in Computing and Communications > 5 - 8

2014 Fourth International Conference on Advances in Computing and Communications (ICACC)

Devnagari (Marathi) is an Indo-Aryan language and has a number of speakers all around the world. Marathi language has gained acceptability in the media & communication and therefore deserves to have a place in the growing field of automatic speech recognition. This manuscript describes the automatic speech recognition system that recognizes Marathi phoneme using Continuous Density Hidden Markov...

chapter

Auditory scene analysis and recognition with LDA topic model

Feng Su

2014 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2014 IEEE International Conference on Multimedia and Expo (ICME)

Analysis and recognition of auditory scenes play an important role in content-based multimedia processing and context-aware applications. In this paper, we propose an auditory scene recognition scheme that integrates the analysis of the audio data of scene with LDA topic model to discover latent structures (i.e. contextual correlations) of audio words, and generation of intermediate contextual descriptions...

chapter

Impact of gender and emotion type in dialogue emotion recognition

Farah Chenchah, Zied Lachiri

2014 1st International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 464 - 467

2014 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

This paper presents a dialogue emotion recognition system using Hidden Markov Model (HMM). We have compared accuracy of Mel-frequency cepstral coefficients (MFCC), Energy, and wavelet sub-band energies and their first derivative and all possible combination. Based on our experiment, MFCC show better performance in comparison with the other studied features. We have also evaluated the impact of gender...

chapter

Hidden Markov model neurons classification based on Mel-frequency cepstral coefficients

Sherif Haggag, Shady Mohamed, Hussein Haggag, Saeid Nahavandi

2014 9th International Conference on System of Systems Engineering (SOSE) > 166 - 170

2014 9th International Conference on System of Systems Engineering (SOSE)

In neuroscience, the extracellular actions potentials of neurons are the most important signals, which are called spikes. However, a single extracellular electrode can capture spikes from more than one neuron. Spike sorting is an important task to diagnose various neural activities. The more we can understand neurons the more we can cure more neural diseases. The process of sorting these spikes is...

chapter

Modification of widely used feature vectors for real-time acoustic events detection

Martin Lojka, Matus Pleva, Jozef Juhar, Eva Kiktova

Proceedings ELMAR-2013 > 199 - 202

2013 55th International Symposium ELMAR

Besides video surveillance system for monitoring large urban areas also the acoustic events detection system can be used. The acoustic detection system is monitoring potentially dangerous sounds and in case of detection an alarm is produced. We developed our own approach to the acoustic events detection system with modified Viterbi decoder operating over HMM (Hidden Markov Models) especially adapted...

chapter

Bangla ASR design by suppressing gender factor with gender-independent and gender-based HMM classifiers

Foyzul Hassan, Mohammed Rokibul Alam Kotwal, Mohammad Nurul Huda

2011 World Congress on Information and Communication Technologies > 1276 - 1281

2011 World Congress on Information and Communication Technologies (WICT)

Hidden factor such as gender characteristic plays an important role on the performance of Bangla (widely used as Bengali) automatic speech recognition (ASR). If there is a suppression process that represses the decrease of differences in acoustic-likelihood among categories resulted from gender factors, a robust ASR system can be realized. In our previous paper, we proposed a technique of gender effects...

chapter

Hybridization of two stage Multilayer Neural Networks based Bangla ASR incorporating dynamic parameters

Mohammed Rokibul Alam Kotwal, Md. Abdur Razzaque, Arif Hossen, Mohammad Nurul Huda

2011 11th International Conference on Hybrid Intelligent Systems (HIS) > 167 - 172

2011 11th International Conference on Hybrid Intelligent Systems (HIS 2011)

This paper presents a hybridization of Multilayer Neural Network-based Bangla phoneme recognition method for Automatic Speech Recognition (ASR) incorporating dynamic parameters. The method consists of four stages: at first stage, a multilayer neural network (MLN) converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities. Phoneme probabilities from the first...

chapter

Gender Effects Suppression in Bangla ASR by Designing Multiple HMM-Based Classifiers

Mohammed Rokibul Alam Kotwal, Foyzul Hassan, Md. Shafiul Alam, Shakib Ibn Daud, more

2011 International Conference on Computational Intelligence and Communication Networks > 390 - 394

2011 International Conference on Computational Intelligence and Communication Networks (CICN)

Speaker-specific characteristics play an important role on the performance of Bangla (widely used as Bengali) automatic speech recognition (ASR). It is difficult to recognize speech affected by gender factors, especially when an ASR system contains only a single acoustic model. If there exists any suppression process that represses the decrease of differences in acoustic-likelihood among categories...

chapter

Hybrid Features for Neural Network-Based Bangla ASR Incorporrating Velocity Coefficients (?)

Mohammed Rokibul Alam Kotwal, Foyzul Hassan, Shakib Ibn Daud, Md. Shafiul Alam, more

2011 International Conference on Computational Intelligence and Communication Networks > 416 - 420

2011 International Conference on Computational Intelligence and Communication Networks (CICN)

This paper presents a Neural Network-based Bangla phoneme recognition method for Automatic Speech Recognition (ASR). The method consists of three stages: at first stage, a multilayer neural network (MLN) converts acoustic features, mel frequency cepstral coefficients (MFCCs), into phoneme probabilities, where the second stage computes velocity (?) coefficients from the phoneme probabilities by using...

chapter

Acoustic features for detection of aspirated stops

V Patil, P Rao

2011 National Conference on Communications (NCC) > 1 - 5

2011 National Conference on Communications (NCC)

Aspiration is an important phonemic feature in several Indian languages. Unlike English, languages such as Marathi have lexicons in which words with different meanings differ only in the aspiration feature of the initial voiced or unvoiced stop. Thus the reliable discrimination of aspirated stops from their unaspirated counterparts is important in automatic speech recognition for such languages. The...

chapter

Automatic detection for some common pronunciation mistakes applied to chosen Quran sounds

M S Abdo, A H Kandil, A M El-Bialy, S A Fawzy

2010 5th Cairo International Biomedical Engineering Conference > 219 - 222

2010 5th Cairo International Biomedical Engineering Conference (CIBEC 2010)

This paper describes an automatic system for the detection of some common pronunciation mistakes occurring in Quran recitation. It addresses the application of the Arabic language pronunciation rules. The system is a basic step towards a complete automatic teaching system of the Holy Quran recitation rules. The focus of this study is to detect the non proper pronunciation of a chosen set of emphatic...

chapter

Multi-layered features with SVM for Chinese accent identification

Jue Hou, Yi Liu, T F Zheng, J Olsen, more

2010 International Conference on Audio, Language and Image Processing > 25 - 30

2010 International Conference on Audio, Language and Image Processing (ICALIP)

In this paper, we propose an approach of multi-layered feature combination associated with support vector machine (SVM) for Chinese accent identification. The multi-layered features include both segmental and suprasegmental information, such as MFCC and pitch contour, to capture the diversity of variations in Chinese accented speech. The pitch contour is estimated using cubic polynomial method to...

chapter

Auditory Features Revisited for Robust Speech Recognition

F Kelly, N Harte

2010 20th International Conference on Pattern Recognition > 4456 - 4459

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Auditory based front-ends for speech recognition have been compared before, but this paper focuses on two of the most promising algorithms for noise robustness in automatic speech recognition (ASR). The feature sets are Zero-Crossings with Peak Amplitudes (ZCPA) and the recently introduced Power-Law Nonlinearity and Power-Bias Subtraction (PNCC). Standard Mel-Frequency Cepstral Coefficients (MFCC)...

chapter

Environment Recognition from Audio Using MPEG-7 Features

G. Muhammad, K. Alghathbar

2009 Fourth International Conference on Embedded and Multimedia Computing > 1 - 6

2009 Fourth International Conference on Embedded and Multimedia Computing (EM-Com 2009)

In this paper, we introduce a full use of MPEG-7 audio features for environment recognition from audio for different multimedia applications. Environment recognition from audio files is a growing area of interest, however, compared to other branches of multimedia it is a less researched one. To recognize environment, we utilize total of 17 temporal and spectral MPEG-7 audio low level descriptors as...

Data set:
ieee
Keywords:
FEATURE EXTRACTION
ACCURACY
MEL FREQUENCY CEPSTRAL COEFFICIENT
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

SPEECH (17)
SPEECH RECOGNITION (15)
MFCC (7)
TRAINING (7)
AUTOMATIC SPEECH RECOGNITION (6)
SPEECH PROCESSING (6)
COMPUTATIONAL MODELING (5)
HIDDEN MARKOV MODEL (5)
SPEAKER RECOGNITION (5)
CORRELATION (4)
DATABASES (4)
PRINCIPAL COMPONENT ANALYSIS (4)
ROBUSTNESS (4)
SUPPORT VECTOR MACHINES (4)
ACOUSTIC MODEL (3)
AUDIO SIGNAL PROCESSING (3)
CEPSTRAL ANALYSIS (3)
CLASSIFICATION ALGORITHMS (3)
CLUSTERING ALGORITHMS (3)
COMPUTERS (3)
ELECTRONIC MAIL (3)
HMM (3)
NOISE (3)
PATTERN RECOGNITION (3)
SIGNAL PROCESSING (3)
ACOUSTIC MEASUREMENTS (2)
ACOUSTICS (2)
ALGORITHM DESIGN AND ANALYSIS (2)
ARTIFICIAL NEURAL NETWORKS (2)
CLASSIFICATION (2)
CONFERENCES (2)
DATA MINING (2)
DATA MODELS (2)
MATHEMATICAL MODEL (2)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (2)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (2)
MULTILAYER NEURAL NETWORK (2)
MULTIMEDIA COMMUNICATION (2)
NOISE ROBUSTNESS (2)
PITCH CONTOUR (2)
POLYNOMIALS (2)
SIGNAL PROCESSING ALGORITHMS (2)
TESTING (2)
TRAINING DATA (2)
VECTORS (2)
WHITE NOISE (2)
WRITING (2)
ACCELERATION (1)
ACOUSTIC EVENTS (1)
ACOUSTIC FEATURES (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC SURVEILLANCE (1)
ADDITIVE NOISE (1)
ANALYTICAL MODELS (1)
APPROXIMATION ALGORITHMS (1)
ARABIC LANGUAGE PRONUNCIATION RULES (1)
ARENAL VOLCANO (1)
ASPIRATED STOP DETECTION (1)
ASPIRATION (1)
ASPIRATION FEATURE DETECTION (1)
AUDIO ENVIRONMENT RECOGNITION (1)
AUDIO FEATURE REPRESENTATION (1)
AUDIO FILES (1)
AUDITORY BASED FRONT-ENDS (1)
AUDITORY CONTEXT RECOGNITION (1)
AUDITORY FEATURES (1)
AUDITORY SCENE (1)
AUDITORY SYSTEM (1)
AUTOMATIC CLASSIFICATION (1)
AUTOMATIC DETECTION (1)
AUTOMATIC TEACHING SYSTEM (1)
AWGN (1)
BACKGROUND NOISE (1)
BIRDS (1)
BOOKS (1)
BUILDINGS (1)
CHAOS (1)
CHAOTIC SIGNAL PROCESSING TECHNIQUE (1)
CHINESE ACCENTED SPEECH (1)
CHINESE ACCENTED SPEECH IDENTIFICATION (1)
CLUSTERING (1)
CLUSTERING ALGORITHM (1)
CLUSTERING METHODS (1)
COLIMA VOLCANO (1)
COMPANIES (1)
COMPLEXITY THEORY (1)
CONTEXT (1)
CONTEXT MODELING (1)
CONTINUOUS DENSITY HIDDEN MARKOV MODEL(CDHMM) (1)
CONVOLUTION (1)
COSTA RICA (1)
COVARIANCE MATRIX (1)
CUBIC POLYNOMIAL METHOD (1)
CULTURAL DIFFERENCES (1)
DECODING (1)
DELTA DERIVATIVES (1)
more

INFONA - science communication portal

Search results

Feature selection experiments on emotional speech classification

Automatic identification of bird species: A comparison between kNN and SOM classifiers

Akshara transcription of mrudangam strokes in Carnatic music

Speaker based Language Independent Isolated Speech Recognition System

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Proposed combination of PCA and MFCC feature extraction in speech recognition system

Devnagari Phoneme Recognition System

Auditory scene analysis and recognition with LDA topic model

Impact of gender and emotion type in dialogue emotion recognition

Hidden Markov model neurons classification based on Mel-frequency cepstral coefficients

Modification of widely used feature vectors for real-time acoustic events detection

Bangla ASR design by suppressing gender factor with gender-independent and gender-based HMM classifiers

Hybridization of two stage Multilayer Neural Networks based Bangla ASR incorporating dynamic parameters

Gender Effects Suppression in Bangla ASR by Designing Multiple HMM-Based Classifiers

Hybrid Features for Neural Network-Based Bangla ASR Incorporrating Velocity Coefficients (?)

Acoustic features for detection of aspirated stops

Automatic detection for some common pronunciation mistakes applied to chosen Quran sounds

Multi-layered features with SVM for Chinese accent identification

Auditory Features Revisited for Robust Speech Recognition

Environment Recognition from Audio Using MPEG-7 Features

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options