Search results

Items from 41 to 60 out of 154 results

chapter

MFCC feature with optimized frequency range: An essential step for emotion recognition

Subhasmita Sahoo, Aurobinda Routray

2016 International Conference on Systems in Medicine and Biology (ICSMB) > 162 - 165

2016 International Conference on Systems in Medicine and Biology (ICSMB)

One of the major challenge in human emotion recognition is extraction of features containing maximum prosodic information. The accuracy of entire emotion detection system eventually relies upon the efficiency of the selected feature. When it comes to identifying emotions from voice, ambiguity in detection can never be completely avoided due to several reasons. Exclusion of redundant information to...

chapter

Phoneme classification using Reservoirs with MFCC and Rasta-PLP features

Dona Varghese, Dominic Mathew

2016 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 6

2016 International Conference on Computer Communication and Informatics

The performance of speech classification tasks can be improved by accurate acoustic modeling. This modelling is responsible for establishing the relationship between the speech signal and the phonetic units that were produced by the speaker. In this paper Acoustic Modeling(AM) is done using Reservoir Computing(RC) technique for which the input speech signal frames can be identified and classified...

chapter

Novel speech features for improved detection of spoofing attacks

Dipjyoti Paul, Monisankha Pal, Goutam Saha

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

Now-a-days, speech-based biometric systems such as automatic speaker verification (ASV) are highly prone to spoofing attacks by an imposture. With recent development in various voice conversion (VC) and speech synthesis (SS) algorithms, these spoofing attacks can pose a serious potential threat to the current state-of-the-art ASV systems. To impede such attacks and enhance the security of the ASV...

chapter

Vietnamese Voice Recognition for Home Automation using MFCC and DTW Techniques

Minh-Son Nguyen, Tu-Lanh Vo

2015 International Conference on Advanced Computing and Applications (ACOMP) > 150 - 156

2015 International Conference on Advanced Computing and Applications (ACOMP)

Home automation with voice recognition can achieve a high level of performance in real world environment. However, such performance drops significantly in mismatched noisy conditions. To solve this problem, we propose a improvement method to extract Mel Frequency Cepstral Coefficients (MFCC) that increase the accuracy up to 20% than traditional method. This paper describes an approach of speech recognition...

chapter

Exploring multi-language resources for unsupervised spoken term discovery

Bogdan Ludusan, Alexandru Caranica, Horia Cucu, Andi Buzo, more

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) > 1 - 6

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)

With information processing and retrieval of spoken documents becoming an important topic, there is a need of systems performing automatic segmentation of audio streams. Among such algorithms, spoken term discovery allows the extraction of word-like units (terms) directly from the continuous speech signal, in an unsupervised manner and without any knowledge of the language at hand. Since the performance...

chapter

An image based approach for speech perception

Nguyen Quang Trung, Bui The Duy, Ma Thi Chau

2015 2nd National Foundation for Science and Technology Development Conference on Information and Computer Science (NICS) > 208 - 213

2015 2nd National Foundation for Science and Technology Development Conference on Information and Computer Science (NICS)

Classification of speech signal is one of the most vital problems in speech perception and spoken word recognition. Although, there have been many studies on the classification of speech signals but the results are still limited. In this paper, we propose an image based approach for speech signal classification based on the combination of Local Naïve Bayes Nearest Neighbor (LNBNN) and Scale-invariant...

chapter

Feature analysis for mispronounced phonemes in the case of alvoelar approximant (/r/) substituted with voiced dental consonant (/∂/)

Pravin B. Ramteke, Shashidhar G. Koolagudi, Arun Prabhakar

2015 Eighth International Conference on Contemporary Computing (IC3) > 132 - 137

2015 Eighth International Conference on Contemporary Computing (IC3)

Mispronunciation is commonly observed in children from age 2 to 8 years. Some of the common mispronunciations are stopping, fronting, backing and affrication. These processes are known as phonological processes. Identification of these processes is crucial in studying the vocal tract development pattern and treating the phonological disorders in children. The features that clearly discriminate correctly...

chapter

Feature selection experiments on emotional speech classification

Piyawat Sukhummek, Sawit Kasuriya, Thanaruk Theeramunkong, Chai Wutiwiwatchai, more

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 4

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

This paper presents the experiments on feature selection for emotional speech classification. There are 152 features used in this experiment. The minimum redundancy maximum relevance (mRMR) feature selection is applied as the features selection. The experiments are constructed from two corpora; Interactive Emotional Dyadic Motion Capture (IEMOCAP) and Emotional Tagged Corpus on Lakorn (EMOLA) which...

chapter

Tree-based context clustering using speech recognition features for acoustic model training of speech synthesis

Supadaech Chanjaradwichai, Atiwong Suchato, Proadpran Punyabukkana

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON) > 1 - 5

2015 12th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

Tree based context clustering processes reduce the sizes of acoustic models of Hidden Markov Model (HMM) speech synthesis systems as well as eliminate problems arising from unseen sound units. Representations of speech units in speech synthesis systems are often LPC or MCEP features whose characteristics promote speech reconstruction rather than discrimination among different sound units. In this...

chapter

Aural fragment analysis framework pestial on aspect mining

Madhuri P. Borawake, Kawitkar Rameshwar

International Conference on Computing, Communication & Automation > 128 - 132

2015 International Conference on Computing, Communication & Automation (ICCCA)

This Manuscript probe delinquent of classification of uninterrupted of broad-spectrum aural data for content based recovery. This paper is dealing with scheme for classifying aural data & segmentation is also done on same data so that processing rate is faster. Aural data is able to classify into eight categories Simple speech, noise, silence, music single speech with music, double speech with...

chapter

Implementation of machine learning applications on a fixed-point DSP

K Swetha Bharati, Ashok Jhunjhunwala

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE) > 1458 - 1463

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)

In this paper, we discuss efficient implementation of machine learning algorithms on DSPs. Specifically, we implement OCR and speech recognition on DSP and show how they can be optimized using fixed point routines. We illustrate the optimal usage of DSP resources like MAC units, shifters and software pipelining through assembly code structuring which massively reduces the MIPS consumed by the processor...

chapter

Speech recognition in noisy environment, issues and challenges: A review

Karishma Chavan, Ujwalla Gawande

2015 International Conference on Soft-Computing and Networks Security (ICSNS) > 1 - 5

2015 International Conference on Soft-Computing and Networks Security (ICSNS)

The voice is most prominent & primary mode of communication among the human beings. With this speech human can communicate with machine, thus this technique is used in education, military and medical sectors. Though this is not the new area, from last few decades researchers are working on the improvement of accuracy in voice recognition system. The design of that system concerns major issues...

chapter

Experimental studies on effect of speaking mode on spoken term detection

Kallola Rout, Pappagari Raghavendra Reddy, K Sri Rama Murty

2015 Twenty First National Conference on Communications (NCC) > 1 - 6

2015 Twenty First National Conference on Communications (NCC)

The objective of this paper is to study the effect of speaking mode on spoken term detection (STD) system. The experiments are conducted with respect to query words recorded in isolated manner and words cut out from continuous speech. Durations of phonemes in query words greatly vary between these two modes. Hence pattern matching stage plays a crucial role which takes care of temporal variations...

chapter

Speaker based Language Independent Isolated Speech Recognition System

Shanthi Therese S., Chelpa Lingam

2015 International Conference on Communication, Information & Computing Technology (ICCICT) > 1 - 7

2015 International Conference on Communication, Information & Computing Technology (ICCICT)

This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used...

chapter

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Khan Suhail Ahmad, Anil S. Thosar, Jagannath H. Nirmal, Vinay S. Pande

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC) and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian filter banks for text independent speaker recognition. MFCC modeled on the human auditory system shows robustness against noise and session changes and hence has become synonymous with speaker recognition. Our main aim is to test...

chapter

Phoneme and word based model for tamil speech recognition using GMM-HMM

S Karpagavalli, E Chandra

2015 International Conference on Advanced Computing and Communication Systems > 1 - 5

2015 International Conference on Advanced Computing and Communication Systems (ICACCS)

Speech is the standard means of communication among people. Automatic Speech Recognition (ASR) applications facilitate the users to interact with machines through speech and perform their tasks effortlessly. Speech Recognition applications in native languages will enable illiterate and semi-illiterate people to use computer services without any/little knowledge to operate computers and to lead better...

chapter

A novel spoken document retrieval system using Auto Associative Neural Network based keyword spotting

J. Sangeetha, S. Jothilakshmi

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO) > 1 - 6

2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO)

This paper formulates a novel approach to spoken document information retrieval for instinctive speech corpora. The conventional method for this problem is to make use of an Automatic Speech Recognizer (ASR) integrated with the typical information retrieval method. However, ASRs tend to produce transcripts of spontaneous speech with momentous word error rate, which is a negative aspect of standard...

chapter

Automatic Emotion Variation Detection in continuous speech

Yuchao Fan, Mingxing Xu, Zhiyong Wu, Lianhong Cai

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific > 1 - 5

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Though emotion speech recognition has gained increasing interest in the field of Human Computer Interaction, it is still a challenge to automatically determine the emotion state type and the boundaries of each emotionally salient segment in continuous speech, which is named as Automatic Emotion Variation Detection (AEVD). In this task, the input utterances are not pre-segmented and may contain emotion...

chapter

Voice morphing based on spectral features and prosodic modification

Abdul Qavi, Shoab Ahmad Khan, Kashif Basir

17th IEEE International Multi Topic Conference 2014 > 401 - 405

2014 IEEE 17th International Multi-Topic Conference (INMIC)

This paper is aimed at morphing the speech uttered by a source speaker in a manner that it seems to be spoken by another target speaker - a new identity is given while preserving the original content. The proposed method transforms the vocal tract parameters and glottal excitation of the source speaker into target speaker's acoustic characteristics. It relates to the development of appropriate vocal...

chapter

New features using fuzzy c-means alogorithm for automatic language recognition

M. Sadanandam, V. Kamakshi Prasad, N. Ramana, E. Jagadeshwara Rao

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 5

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

We propose new features for the language recognition using Gaussian computations. New features are derived from traditional features like Mel frequency cepstral coefficients (MFCC) using fuzzy c-means clustering algorithm. MFCC feature vectors derived from huge corpus of all languages under consideration are grouped into c-clusters using fuzzy c-means clustering algorithm and one Gaussian distribution...

Data set:
ieee
Keywords:
FEATURE EXTRACTION
MEL FREQUENCY CEPSTRAL COEFFICIENT
SPEECH
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Content availability

Available (153)
None (1)

Publication type

book (147)
article (7)

Keywords

SPEECH RECOGNITION (101)
TRAINING (38)
MFCC (35)
SPEECH PROCESSING (28)
HIDDEN MARKOV MODEL (21)
SPEAKER RECOGNITION (19)
CEPSTRAL ANALYSIS (18)
ACCURACY (17)
NOISE (17)
HMM (16)
DATABASES (15)
SUPPORT VECTOR MACHINES (14)
AUTOMATIC SPEECH RECOGNITION (13)
NATURAL LANGUAGE PROCESSING (11)
ARTIFICIAL NEURAL NETWORKS (10)
COMPUTATIONAL MODELING (10)
FILTER BANKS (10)
DATA MINING (9)
ROBUSTNESS (9)
CLASSIFICATION ALGORITHMS (8)
EMOTION RECOGNITION (7)
GAUSSIAN MIXTURE MODEL (7)
MATHEMATICAL MODEL (7)
CORRELATION (6)
GMM (6)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (6)
MULTILAYER NEURAL NETWORK (6)
NOISE MEASUREMENT (6)
SIGNAL PROCESSING (6)
SIGNAL TO NOISE RATIO (6)
VECTORS (6)
VOCABULARY (6)
ACOUSTICS (5)
ADAPTATION MODELS (5)
ALGORITHM DESIGN AND ANALYSIS (5)
AUDIO SIGNAL PROCESSING (5)
COMPUTERS (5)
DATA MODELS (5)
FREQUENCY DOMAIN ANALYSIS (5)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) (5)
SPEECH ENHANCEMENT (5)
SPEECH SYNTHESIS (5)
TRAINING DATA (5)
ACOUSTIC MODEL (4)
ACOUSTIC SIGNAL PROCESSING (4)
AUDITORY SYSTEM (4)
CLUSTERING ALGORITHMS (4)
COMPONENT (4)
DECODING (4)
DISCRETE FOURIER TRANSFORMS (4)
ELECTRONIC MAIL (4)
FILTER BANK (4)
GAUSSIAN DISTRIBUTION (4)
GAUSSIAN PROCESSES (4)
LPC (4)
MACHINE LEARNING (4)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (4)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (4)
MULTILAYER PERCEPTRONS (4)
PATTERN CLUSTERING (4)
PATTERN RECOGNITION (4)
SPEAKER DIARIZATION (4)
SPEAKER IDENTIFICATION (4)
SVM (4)
AUDIO CLASSIFICATION (3)
AUDIO SEGMENTATION (3)
AUTOMATIC SPEECH RECOGNITION SYSTEM (3)
CONFERENCES (3)
COVARIANCE MATRIX (3)
DISTINCTIVE PHONETIC FEATURES (3)
ENERGY MEASUREMENT (3)
FILTERING THEORY (3)
HIDDEN MARKOV MODEL(HMM) (3)
INDEXES (3)
JAPANESE NEWSPAPER ARTICLE SENTENCES (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
LINEAR PREDICTIVE CODING (3)
LVCSR (3)
NEURAL NETWORKS (3)
NEURONS (3)
NOISE ROBUSTNESS (3)
PATTERN CLASSIFICATION (3)
PITCH (3)
PRODUCTION (3)
SIGNAL CLASSIFICATION (3)
SPEAKER VERIFICATION (3)
SPECTRAL ANALYSIS (3)
SPEECH ANALYSIS (3)
SPEECH FEATURE EXTRACTION (3)
SPEECH SEGMENTATION (3)
TESTING (3)
VECTOR QUANTIZATION (3)
VITERBI ALGORITHM (3)
WAVELET TRANSFORMS (3)
ACCELERATION (2)
ACOUSTIC FEATURES (2)
more

INFONA - science communication portal

Search results

MFCC feature with optimized frequency range: An essential step for emotion recognition

Phoneme classification using Reservoirs with MFCC and Rasta-PLP features

Novel speech features for improved detection of spoofing attacks

Vietnamese Voice Recognition for Home Automation using MFCC and DTW Techniques

Exploring multi-language resources for unsupervised spoken term discovery

An image based approach for speech perception

Feature analysis for mispronounced phonemes in the case of alvoelar approximant (/r/) substituted with voiced dental consonant (/∂/)

Feature selection experiments on emotional speech classification

Tree-based context clustering using speech recognition features for acoustic model training of speech synthesis

Aural fragment analysis framework pestial on aspect mining

Implementation of machine learning applications on a fixed-point DSP

Speech recognition in noisy environment, issues and challenges: A review

Experimental studies on effect of speaking mode on spoken term detection

Speaker based Language Independent Isolated Speech Recognition System

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

Phoneme and word based model for tamil speech recognition using GMM-HMM

A novel spoken document retrieval system using Auto Associative Neural Network based keyword spotting

Automatic Emotion Variation Detection in continuous speech

Voice morphing based on spectral features and prosodic modification

New features using fuzzy c-means alogorithm for automatic language recognition

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options