Search results

Items from 141 to 160 out of 654 results

1 ...
5
6
7
8
9
10
11

chapter

Optimal feature extraction and selection techniques for speech processing: A review

Ankita N. Chadha, Mukesh A. Zaveri, Jignesh N. Sarvaiya

2016 International Conference on Communication and Signal Processing (ICCSP) > 1669 - 1673

2016 International Conference on Communication and Signal Processing (ICCSP)

Today speech processing is one of the demanding applications among all others. This article highlights two important aspects of speech processing, namely which feature representation must be employed and what is their selection criteria. Depending on different application areas, speech processing needs different set of features and techniques to extract them. At the same time it is necessary to choose...

chapter

Machine learning paradigms for speech recognition of an Indian dialect

N. D. Londhe, M. K. Ahirwal, P. Lodha

2016 International Conference on Communication and Signal Processing (ICCSP) > 780 - 786

2016 International Conference on Communication and Signal Processing (ICCSP)

Present era is full of speech recognition based services and products. The machine learning paradigms is at the centre stage of speech recognition methodology. Automatic speech recognition (ASR) technology has vastly evolved in recent years including emerging applications in mobile computing, natural user interface, and man-machine assistive technology. In this paper, it's the first time we are presenting...

chapter

Speech to text conversion for multilingual languages

Yogita H. Ghadage, Sushama D. Shelke

2016 International Conference on Communication and Signal Processing (ICCSP) > 236 - 240

2016 International Conference on Communication and Signal Processing (ICCSP)

The current work presents a multilingual speech-to-text conversion system. Conversion is based on information in speech signal. Speech is the natural and most important form of communication for human being. Speech-To-Text (STT) system takes a human speech utterance as an input and requires a string of words as output. The objective of this system is to extract, characterize and recognize the information...

chapter

Age driven automatic speech emotion recognition system

Devika Verma, Debajyoti Mukhopadhyay

2016 International Conference on Computing, Communication and Automation (ICCCA) > 1005 - 1010

2016 International Conference on Computing, Communication and Automation (ICCCA)

With the advent of technology, speech recognition is no longer just the capability of the humans. Voice based interfaces can turn most favorable for human computer interaction if computers respond according to its users emotional state. Emotion recognition from speech is a challenging problem as the system has to interact with diverse user utterances. This paper presents an age driven speech emotion...

chapter

Real-time speech emotion recognition by minimum number of features

Mohammad Savargiv, Azam Bastanfard

2016 Artificial Intelligence and Robotics (IRANOPEN) > 72 - 76

2016 Artificial Intelligence and Robotics (IRANOPEN)

Speech emotion recognition is an aspect of equipped robots with the human capabilities. The need to tradeoff between computational volume and performance accuracy is the main challenge of real-time processes. Application domain of this paper is robotic; therefore both mentioned factors are important. Selecting distinguishing factor with low dimension and high resolution is the optimal solution for...

chapter

Emotion recognition in i-vector space

Lenka Mackova, Anton Cizmar, Jozef Juhar

2016 26th International Conference Radioelektronika (RADIOELEKTRONIKA) > 372 - 375

2016 26th International Conference Radioelektronika (RADIOELEKTRONIKA)

Emotions in speech are the key to the fluent human communication. The investigation of emotions in speech has been reported in many different studies. Thus the scope of this article is dedicated to the emotion recognition from speech signal. To find out the best recognition performance of used system, different cepstral coefficient were extracted from the emotional recordings of two female and one...

chapter

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Garg Mamta, Arora Ajat Shatru, Gupta Savita

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3151 - 3155

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

This paper suggested a technique based on MFCC analysis for audio signals with speech classification application. The proposed work used multi-resolution (wavelet) analysis and spectral analysis based features for feature extraction. The proposed approach uses a no. of features like Mel Frequency Cepstral Coefficient (MFCC), and FFT Coefficients combined with wavelet based features. In addition, accuracy...

chapter

Improved MFCC and LPC algorithm for bundelkhandi isolated digit speech recognition

Abhishek Dixit, Abhinav Vidwans, Pankaj Sharma

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3755 - 3759

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

Automatic speech recognition in different languages, spoken in different areas of any country, is one of the major research area in the field of signal processing. This paper presents an improved MFCC algorithm for Bundelkhandi digit speech recognition. Here speech digit features are extracted by using modified Mel Frequency Cepstral Coefficient algorithm (MFCC). In this modified MFCC algorithm, one...

chapter

System propose for Be acquainted with newborn cry emotion using linear frequency cepstral coefficient

Sandhya S Jagtap, Premanand K Kadbe, Parshuram N Arotale

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 238 - 242

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

In this paper, we mainly paying attention on mechanization of Infant's Cry. For this implementation we use LFCC for feature extraction and VQ codebook for toning samples using LBG algorithm. The newborn crying samples composed from various crying baby having 0–6 months age. There are 27 babie's sound as training data, each of which represents the 7 hungry infant cries, 4 sleepy infant cries, 10 in...

chapter

Enhancing effectiveness of emotion detection by multimodal fusion of speech parameters

R. V. Darekar, A. P. Dhande

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 3242 - 3246

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

Speech processing is the one of the interesting and challenging concept in man machine communication. Emotion detection is the process of determination of the psychological state of the speaker. Pitch, formant frequencies, duration, timbre, MFCCs, energy are some of the efficient parameters from which, bulk of information can be retrieved from speech signal. These parameters have provided good accuracy...

chapter

Extraction of speech signal based on Power Normalized Cepstral Coefficient and Mel Frequency Cepstral Coefficient: A comparison

Bharathi, Narain Ponraj, Merlin Mercy

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 1843 - 1846

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

Speech processing is emerged as one of the important application area of digital signal processing. Power Normalized Cepstral Coefficients (PNCC) and Mel Frequency Cepstral Coefficient (MFCC) are mainly used in feature extraction of speech signals. The problem of real time speaker segmentation in speech processing is enormous in which no prior knowledge about the number of speakers and the identities...

chapter

Multi-class SVM for stressed speech recognition

Salsabil Besbes, Zied Lachiri

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 782 - 787

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

This paper deals with a new automatic stressed recognition system based on kernel classification. We extracted advanced acoustic features from the stressed signals and employed a multi-class Support Vector Machines with different kernels to recognize speech utterances under stress. Gammatone Frequency Cepstral Coefficients are also established. The system implemented is tested using isolated words...

chapter

Emotional speaker recognition in simulated and spontaneous context

Asma Mansour, Zied Lachiri

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 776 - 781

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

An interesting issue to be considered up to now in emotional speaker recognition system is the context in which speech database used to develop and evaluate the performance of the system. So, we propose and assess an emotional speaker recognition system based on different feature extraction methods, focusing on the diversities between simulated and natural emotional speech databases(BERLIN and IEMOCAP)...

chapter

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification

M.T.S. Al-Kaltakchi, W. L. Woo, S.S. Dlay, J. A. Chambers

2016 4th International Conference on Biometrics and Forensics (IWBF) > 1 - 6

2016 4th International Conference on Biometrics and Forensics (IWBF)

In this paper, a new combination of features and normalization methods is investigated for robust biometric speaker identification. Mel Frequency Cepstral Coefficients (MFCC) are efficient for speaker identification in clean speech while Power Normalized Cepstral Coefficients (PNCC) features are robust for noisy environments. Therefore, combining both features together is better than taking each one...

chapter

Speaker specific features and phonemes in speech: a proposal for evaluating a possible interaction

Nivedita Yadav, Solange Rossato, Juliette Kahn, Jean Francois Bonastre

2016 4th International Conference on Biometrics and Forensics (IWBF) > 1 - 6

2016 4th International Conference on Biometrics and Forensics (IWBF)

Speaker voice characteristics are an important aspect of forensic phonetics. Previous studies have suggested that all the features present in the speech signals are not equally important for speaker discrimination, and it is well-known that subsets of phonemes are more informative than others. However, most of theses studies have concerned a whole group of speakers, without taking into account the...

chapter

Correlative consideration concerning feature extraction techniques for speech recognition — A review

Arshpreet Kaur, Amitoj Singh, Virender Kadyan

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT) > 1 - 4

2016 International Conference on Circuit, Power and Computing Technologies (ICCPCT)

This paper frames co-relation on three feature extraction techniques in ASR system. As compared to primarily used technique called MFCC (Mel Frequency Cepstral Coefficients), PNCC (Power Normalized Cepstral Coefficients) obtains impressive advancement in noisy speech recognition due of its inhibition in high frequency spectrum for human voice. The techniques differ in the way as MFCC uses traditional...

chapter

Speaker verification by combining information from magnitude and phase spectrum

Karthik K Jain, Kirthan M G, Vinayak R Pai, Narendra K C

2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) > 163 - 166

2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET)

The task of developing automatic speaker verification (ASV) system for security application is of considerable importance. This paper aims at developing a fusion strategy which combines both magnitude and phase information of the speech signal which yields a better performance when compared to conventional individual features. This paper employs Mel frequency cepstral coefficients (MFCC) and modified...

chapter

Improving text-independent speaker recognition with GMM

Rania Chakroun, Leila Beltaifa Zouari, Mondher Frikha, Ahmed Ben Hamida

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 693 - 696

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

The Gaussian mixture models (GMM) represent an efficient model that was broadly used in most of speaker recognition applications. This study introduces a novel method for speaker verification task. We propose a reduced feature vector employing new information detected from the speaker's voice for performing text-independent speaker verification applications using GMM. We use the power spectrum density...

chapter

Improving emotion detection with speech by enhanced approach

R. V. Darekar, A. P. Dhande

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN) > 364 - 369

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN)

Emotion detection currently is found to be an important and interesting part of speech analysis. The analysis can be done by selection of an effective parameter or by combination of a number of parameters to gain higher accuracy level. Definitely selection of a number of parameters together will provide a reliable solution for getting higher level of accuracy than that of for the single parameter...

chapter

Dempster-Shafer Fusion Based Gender Recognition for Speech Analysis Applications

Jamil Ahmad, Khan Muhammad, Soon-il Kwon, Sung Wook Baik, more

2016 International Conference on Platform Technology and Service (PlatCon) > 1 - 4

2016 International Conference on Platform Technology and Service (PlatCon)

Speech signals carry valuable information about the speaker including age, gender, and emotional state. Gender information can act as a vital preprocessing ingredient for enhancing speech analysis applications like adaptive human-machine interfaces, multi-modal security applications, and sophisticated intent and context analysis based forensic systems. In uncontrolled environments like telephone speech...

1 ...
5
6
7
8
9
10
11

Data set:
ieee
Keywords:
FEATURE EXTRACTION
MEL FREQUENCY CEPSTRAL COEFFICIENT
SPEECH
Publication type:
book

Publication date

Set your own date range

Content availability

Available (651)
None (3)

Keywords

SPEECH RECOGNITION (353)
TRAINING (149)
HIDDEN MARKOV MODELS (147)
SPEAKER RECOGNITION (147)
MFCC (143)
DATABASES (117)
SPEECH PROCESSING (103)
SUPPORT VECTOR MACHINES (92)
ACCURACY (90)
CEPSTRAL ANALYSIS (76)
NOISE (70)
EMOTION RECOGNITION (69)
FILTER BANKS (50)
SPEAKER IDENTIFICATION (44)
GMM (42)
ROBUSTNESS (42)
GAUSSIAN MIXTURE MODEL (39)
NOISE MEASUREMENT (37)
GAUSSIAN PROCESSES (34)
MATHEMATICAL MODEL (34)
VECTORS (33)
CLASSIFICATION ALGORITHMS (32)
ARTIFICIAL NEURAL NETWORKS (31)
DATA MINING (31)
SPEAKER VERIFICATION (31)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (30)
CORRELATION (28)
TESTING (27)
AUTOMATIC SPEECH RECOGNITION (26)
VECTOR QUANTIZATION (26)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (24)
SIGNAL TO NOISE RATIO (24)
SVM (24)
COMPUTATIONAL MODELING (23)
DISCRETE COSINE TRANSFORMS (23)
FILTER BANK (23)
AUDIO SIGNAL PROCESSING (22)
HIDDEN MARKOV MODEL (21)
KERNEL (20)
PRINCIPAL COMPONENT ANALYSIS (20)
SIGNAL CLASSIFICATION (20)
SUPPORT VECTOR MACHINE (20)
NATURAL LANGUAGE PROCESSING (18)
FILTERING THEORY (17)
SIGNAL PROCESSING (17)
HMM (16)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (16)
MUSIC (16)
ACOUSTIC SIGNAL PROCESSING (15)
LPC (15)
NEURAL NETWORKS (15)
NIST (15)
COMPUTERS (14)
SUPPORT VECTOR MACHINE CLASSIFICATION (14)
ADAPTATION MODELS (13)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) (13)
MICROPHONES (13)
NEURAL NETWORK (13)
SPEECH CODING (13)
SPEECH EMOTION RECOGNITION (13)
SPEECH ENHANCEMENT (13)
TIME FREQUENCY ANALYSIS (13)
TRANSFORMS (13)
ALGORITHM DESIGN AND ANALYSIS (12)
DATA MODELS (12)
DISCRETE WAVELET TRANSFORMS (12)
FEATURE SELECTION (12)
GAUSSIAN MIXTURE MODELS (12)
HARMONIC ANALYSIS (12)
INDEXES (12)
LEARNING (ARTIFICIAL INTELLIGENCE) (12)
PATTERN CLASSIFICATION (12)
VECTOR QUANTISATION (12)
WAVELET TRANSFORMS (12)
ACOUSTICS (11)
CEPSTRUM (11)
CLASSIFICATION (11)
CONFERENCES (11)
NEURAL NETS (11)
ROBUST SPEECH RECOGNITION (11)
SPEAKER DIARIZATION (11)
MACHINE LEARNING (10)
PITCH (10)
SPECTRAL ANALYSIS (10)
ACOUSTIC FEATURES (9)
AUDIO CLASSIFICATION (9)
EQUATIONS (9)
ESTIMATION (9)
HEURISTIC ALGORITHMS (9)
NEURONS (9)
POLYNOMIALS (9)
SPEECH ANALYSIS (9)
SPEECH FEATURE EXTRACTION (9)
TRAINING DATA (9)
VISUALIZATION (9)
VQ (9)
ADAPTATION MODEL (8)
more

INFONA - science communication portal

Search results

Optimal feature extraction and selection techniques for speech processing: A review

Machine learning paradigms for speech recognition of an Indian dialect

Speech to text conversion for multilingual languages

Age driven automatic speech emotion recognition system

Real-time speech emotion recognition by minimum number of features

Emotion recognition in i-vector space

Noise robust speech recognition system using Mel cepstral and genetic algorithm

Improved MFCC and LPC algorithm for bundelkhandi isolated digit speech recognition

System propose for Be acquainted with newborn cry emotion using linear frequency cepstral coefficient

Enhancing effectiveness of emotion detection by multimodal fusion of speech parameters

Extraction of speech signal based on Power Normalized Cepstral Coefficient and Mel Frequency Cepstral Coefficient: A comparison

Multi-class SVM for stressed speech recognition

Emotional speaker recognition in simulated and spontaneous context

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification

Speaker specific features and phonemes in speech: a proposal for evaluating a possible interaction

Correlative consideration concerning feature extraction techniques for speech recognition — A review

Speaker verification by combining information from magnitude and phase spectrum

Improving text-independent speaker recognition with GMM

Improving emotion detection with speech by enhanced approach

Dempster-Shafer Fusion Based Gender Recognition for Speech Analysis Applications

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options