Search results

Items from 61 to 80 out of 654 results

chapter

Identification of Kamrupi dialect and similar languages

Tanvira Ismail, Gaurab Krishnan Deka, Sushanta Kabir Dutta, L. Joyprakash Singh

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 540 - 543

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

Dialect can be defined as a variety of a language that is distinguished from other varieties of the same language by pronunciation, grammar and vocabulary. The process of recognizing such dialects is called Dialect Identification. Kamrupi, although a dialect of the Assamese language, is spoken both in Assam (Kamrup district) and North Bengal. In this paper, we describe a method to identify not just...

chapter

Performance of auditory features using MFCC and Teager-Kaiser energy computation for non-intrusive speech quality assessment

Kumar Shashi Kant, Rajesh Kumar Dubey, Gitika Sinha

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN) > 23 - 26

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

This paper aims to bring in light non-intrusive speech quality assessment using Teager-Kaiser energy computation. Based on the above mentioned energy computation technique, Features in the form of cepstral coefficient are calculated and thereby, compared to the classical Mel-frequency cepstral co-efficient. The energy computation technique is vividly used in automatic speech recognition area. The...

chapter

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI) > 74 - 78

2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)

Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard...

chapter

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Nancy Semwal, Abhijeet Kumar, Sakthivel Narayanan

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) > 1 - 6

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial expressions and gestures or by combining these properties. This paper concentrates on determining the emotional state from speech signals. Various acoustic features such as energy, zero crossing rate(ZCR), fundamental frequency, Mel Frequency Cepstral Coefficients (MFCCs), etc are extracted for short term, overlapping...

chapter

Detection of emotion in analysis of speech using linear predictive coding techniques (L.P.C)

Akshay Chamoli, Ashish Semwal, Nomita Saikia

2017 International Conference on Inventive Systems and Control (ICISC) > 1 - 4

2017 International Conference on Inventive Systems and Control (ICISC)

Detection of Emotion by analysis of speech is important for identification of emotional state of person. This can be done using ‘Linear Predictive Techniques(LPC)’, which has different parameters like pitch, vocal tract spectrum, formant frequencies, Duration, MFCC etc. which are used for extraction of features from speech. TEO-CB-Auto-Env is the method which is non-linear method of features extraction...

chapter

Classification of speech under stress based on cepstral features and one-class SVM

Salsabil Besbes, Zied Lachiri

2017 International Conference on Control, Automation and Diagnosis (ICCAD) > 213 - 218

2017 International Conference on Control, Automation and Diagnosis (ICCAD)

This paper presents an approach that aims to recognize stressed speech utterances. Our work consists of extracting features using Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Frequency Cepstral Coefficients (GFCC). Indeed, these features are classified with One-class Support Vector Machines (OC-SVM). The results of the proposed method are obtained by conducting speech samples of four stressed...

chapter

A detailed survey on large vocabulary continuous speech recognition techniques

P Vanajakshi, M. Mathivanan

2017 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 7

2017 International Conference on Computer Communication and Informatics (ICCCI)

Speech recognition is a procedure of perceiving human speech by the PC and creating string yield in composed shape. A model is found out from an arrangement of sound recordings whose comparing transcripts are made by taking recordings of speech as sound and their content interpretations, and utilizing programming to make measurable meaning of the sounds that identify every word. Speech based applications...

chapter

Gender identification from speech signal by examining the speech production characteristics

Esther Ramdinmawii, V. K. Mittal

2016 International Conference on Signal Processing and Communication (ICSC) > 244 - 249

2016 International Conference on Signal Processing and Communication (ICSC)

The term gender identification deals with finding out the gender of a person from his or her voice. Gender identification has been implemented in several Automatic Speaker Recognition (ASR) systems and has proved to be of great significance. The use of gender identification in today's technology makes it easier for user authentication and identification in high security systems. In this paper, we...

chapter

Content-based audio classification and retrieval: A novel approach

Nilesh M. Patil, Milind U. Nemade

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) > 599 - 606

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC)

The amount of audio data on public networks like Internet is increasing in huge volume daily. So to access these media, we need to efficiently index and annotate them. Due to non-stationary nature and discontinuities present in the audio signal, segmentation and classification of audio signal has really become a challenging task. Automatic music classification and annotation is also one of the challenging...

chapter

Hierarchical method to classify emotions in speech signals

B.N.W.M.R.A. Boragolla, F.F. Farook, H.M.T.S.K. Herath, M. P. B. Ekanayake, more

2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS) > 1 - 6

2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)

Recently studies have been performed on spectral features such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Predictor Cepstral Coefficients (LPCC) for speech emotion recognition. It was found in our study that the Fourier Transform of MFCC time trajectories also play an important role in speech emotion recognition. And also a new hierarchical classification method was proposed based on...

chapter

Improved dialect recognition for colloquial Arabic speakers

Rania R. Ziedan, Michael N. Micheal, Abdulwahab K. Alsammak, Mona F.M. Mursi, more

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 16 - 21

2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

This article proposes a gender and geographical origin recognition system for Arabic speakers based on the dialect and accent characteristics. We demonstrate that the speaker gender and nationality can be determined from colloquial Arabic speech and recommend that this system can be integrated to more complex biometric applications. The acoustic features of our proposed dataset used to identify the...

chapter

Mutual information-based selection of audiovisual affective features to predict instantaneous emotional state

Sudipta Paul, Nurani Saoda, S M Mahbubur Rahman, Dimitrios Hatzinakos

2016 19th International Conference on Computer and Information Technology (ICCIT) > 463 - 468

2016 19th International Conference on Computer and Information Technology (ICCIT)

Automatic prediction of continuous level emotional state requires selection of suitable affective features to develop a regression system based on supervised machine learning. This paper investigates the performance of low-level dynamic features for predicting two common dimensions of emotional state, namely, valence and arousal instantaneously. Low-complexity features are extracted from audio and...

chapter

Emotional speaker recognition based on i-vector space model

Asma Mansour, Farah Chenchah, Zied Lachiri

2016 4th International Conference on Control Engineering & Information Technology (CEIT) > 1 - 6

2016 4th International Conference on Control Engineering & Information Technology (CEIT)

I-vector space feature has been recently proved to be very efficient in speaker recognition field. In this paper, we assess using the i-vector approach for emotional speaker recognition in order to boost the performance which is deteriorated by emotions. The key idea of the i-vector algorithm is to represent each speaker by a fixed length and low dimensional feature vector. The concatenation of these...

chapter

Automatic Speech Recognition of isolated words in Hindi language using MFCC

U. G. Patil, S. D. Shirbahadurkar, A. N. Paithane

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 433 - 438

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Speech is natural vocalized and primary means of communication. Speech is easy, hand-free, fast and do not require any technical knowledge. Communicating with computer using speech is simple and comfortable way for human being. Speech recognition system made this possible. The acoustic and language model for this system are available but mostly in English language. In India there are so many peoples...

chapter

Representation of speech signals using Hartley group delay function

K C Narendra, R. Kumara Swamy

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I) > 275 - 278

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)

This paper presents an alternate representation of phase information in speech signals using Hartley transform. Hartley Group Delay Function (HGDF) is computed on similar lines of Fourier Group delay function. Cepstral smoothing is applied so as to reduce the spiky nature of the group delay functions. The smoothened HGDF (SHGDF) is reported to have better resolution in group delay spectrum. A speaker...

chapter

A comparison of performance evaluation of ASR for noisy and enhanced signal using GMM

Sumita Nainan, Vaishali Kulkarni

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 489 - 494

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Speech is the simplest modality to be considered for Unimodal Biometric System. The accuracy however does depend on the quality of the signal which tends to be compromised due to the conditions under which it is spoken or recorded. To implement a robust system the challenge is in enhancing the quality and intelligibility of the noisy speech signal. Various speech enhancement techniques can be applied...

chapter

Multilingual articulatory features augmentation learning

Yue Zhao, Rui Zhao, Xiaoyang Wang, Qiang Ji

2016 23rd International Conference on Pattern Recognition (ICPR) > 2895 - 2899

2016 23rd International Conference on Pattern Recognition (ICPR)

Articulatory features are used as an universal set of speech attributes shared across many different languages. Some multilingual and cross-language speech recognition systems using articulatory features have been shown to improve the performance. The existing articulatory features are defined by phonetician as a set of articulatory descriptions of phones, which represent some semantic information...

chapter

A robust diarization system for measuring dominance in Peer-Led Team Learning groups

Harishchandra Dubey, Abhijeet Sangwan, John H. L. Hansen

2016 IEEE Spoken Language Technology Workshop (SLT) > 319 - 323

2016 IEEE Spoken Language Technology Workshop (SLT)

Peer-Led Team Learning (PLTL) is a structured learning model where a team leader is appointed to facilitate collaborative problem solving among students for Science, Technology, Engineering and Mathematics (STEM) courses. This paper presents an informed HMM-based speaker diarization system. The minimum duration of short conversational-turns and number of participating students were fed as side information...

chapter

Emotion detection using perceptual based speech features

S. Lalitha, Shikha Tripathi

2016 IEEE Annual India Conference (INDICON) > 1 - 5

2016 IEEE Annual India Conference (INDICON)

Speech is one of the most popular modalities for emotion recognition. This work uses Mel and Bark scale dependent perceptual auditory features for recognizing seven emotions from Berlin speech corpus. A combination of Mel Frequency Cepstral Coefficients (MFCC's), Perceptual Linear Predictive Cepstrum (PLPC), Mel Frequency Perceptual Linear Predictive Cepstrum (MFPLPC) and Linear predictive coefficients...

chapter

Robust speaker recognition based on multi-stream features

Ning Wang, Lei Wang

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China) > 1 - 4

2016 IEEE International Conference on Consumer Electronics-China (ICCE-China)

In this paper, we investigate the effect of the G.723.1 (6.3kbps) on speaker recognition system. In order to improve the robustness of codec mismatch, we used the Power Normalized Cepstral Coefficients (PNCC) which is a new robustness acoustic feature, to improve the performance of speaker verification system. And a modified SCF speech feature is propose to improve the robustness under codec mismatch...

Data set:
ieee
Keywords:
FEATURE EXTRACTION
MEL FREQUENCY CEPSTRAL COEFFICIENT
SPEECH
Publication type:
book

Publication date

Set your own date range

Content availability

Available (651)
None (3)

Keywords

SPEECH RECOGNITION (353)
TRAINING (149)
HIDDEN MARKOV MODELS (147)
SPEAKER RECOGNITION (147)
MFCC (143)
DATABASES (117)
SPEECH PROCESSING (103)
SUPPORT VECTOR MACHINES (92)
ACCURACY (90)
CEPSTRAL ANALYSIS (76)
NOISE (70)
EMOTION RECOGNITION (69)
FILTER BANKS (50)
SPEAKER IDENTIFICATION (44)
GMM (42)
ROBUSTNESS (42)
GAUSSIAN MIXTURE MODEL (39)
NOISE MEASUREMENT (37)
GAUSSIAN PROCESSES (34)
MATHEMATICAL MODEL (34)
VECTORS (33)
CLASSIFICATION ALGORITHMS (32)
ARTIFICIAL NEURAL NETWORKS (31)
DATA MINING (31)
SPEAKER VERIFICATION (31)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (30)
CORRELATION (28)
TESTING (27)
AUTOMATIC SPEECH RECOGNITION (26)
VECTOR QUANTIZATION (26)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (24)
SIGNAL TO NOISE RATIO (24)
SVM (24)
COMPUTATIONAL MODELING (23)
DISCRETE COSINE TRANSFORMS (23)
FILTER BANK (23)
AUDIO SIGNAL PROCESSING (22)
HIDDEN MARKOV MODEL (21)
KERNEL (20)
PRINCIPAL COMPONENT ANALYSIS (20)
SIGNAL CLASSIFICATION (20)
SUPPORT VECTOR MACHINE (20)
NATURAL LANGUAGE PROCESSING (18)
FILTERING THEORY (17)
SIGNAL PROCESSING (17)
HMM (16)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (16)
MUSIC (16)
ACOUSTIC SIGNAL PROCESSING (15)
LPC (15)
NEURAL NETWORKS (15)
NIST (15)
COMPUTERS (14)
SUPPORT VECTOR MACHINE CLASSIFICATION (14)
ADAPTATION MODELS (13)
MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) (13)
MICROPHONES (13)
NEURAL NETWORK (13)
SPEECH CODING (13)
SPEECH EMOTION RECOGNITION (13)
SPEECH ENHANCEMENT (13)
TIME FREQUENCY ANALYSIS (13)
TRANSFORMS (13)
ALGORITHM DESIGN AND ANALYSIS (12)
DATA MODELS (12)
DISCRETE WAVELET TRANSFORMS (12)
FEATURE SELECTION (12)
GAUSSIAN MIXTURE MODELS (12)
HARMONIC ANALYSIS (12)
INDEXES (12)
LEARNING (ARTIFICIAL INTELLIGENCE) (12)
PATTERN CLASSIFICATION (12)
VECTOR QUANTISATION (12)
WAVELET TRANSFORMS (12)
ACOUSTICS (11)
CEPSTRUM (11)
CLASSIFICATION (11)
CONFERENCES (11)
NEURAL NETS (11)
ROBUST SPEECH RECOGNITION (11)
SPEAKER DIARIZATION (11)
MACHINE LEARNING (10)
PITCH (10)
SPECTRAL ANALYSIS (10)
ACOUSTIC FEATURES (9)
AUDIO CLASSIFICATION (9)
EQUATIONS (9)
ESTIMATION (9)
HEURISTIC ALGORITHMS (9)
NEURONS (9)
POLYNOMIALS (9)
SPEECH ANALYSIS (9)
SPEECH FEATURE EXTRACTION (9)
TRAINING DATA (9)
VISUALIZATION (9)
VQ (9)
ADAPTATION MODEL (8)
more

INFONA - science communication portal

Search results

Identification of Kamrupi dialect and similar languages

Performance of auditory features using MFCC and Teager-Kaiser energy computation for non-intrusive speech quality assessment

Pashto spoken digits recognition using spectral and prosodic based feature extraction

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

Detection of emotion in analysis of speech using linear predictive coding techniques (L.P.C)

Classification of speech under stress based on cepstral features and one-class SVM

A detailed survey on large vocabulary continuous speech recognition techniques

Gender identification from speech signal by examining the speech production characteristics

Content-based audio classification and retrieval: A novel approach

Hierarchical method to classify emotions in speech signals

Improved dialect recognition for colloquial Arabic speakers

Mutual information-based selection of audiovisual affective features to predict instantaneous emotional state

Emotional speaker recognition based on i-vector space model

Automatic Speech Recognition of isolated words in Hindi language using MFCC

Representation of speech signals using Hartley group delay function

A comparison of performance evaluation of ASR for noisy and enhanced signal using GMM

Multilingual articulatory features augmentation learning

A robust diarization system for measuring dominance in Peer-Led Team Learning groups

Emotion detection using perceptual based speech features

Robust speaker recognition based on multi-stream features

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options