Search results for: Mousmita Sarma

Items from 1 to 13 out of 13 results

chapter

Dialect Identification from Assamese speech using prosodic features and a neuro fuzzy classifier

Mousmita Sarma, Kandarpa Kumar Sarma

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN) > 127 - 132

2016 3rd International Conference on Signal Processing and Integrated Networks (SPIN)

Dialect identification is the task of classifying speech on the basis of dialect, which comes under the Automatic Language Identification problem. In this work a neuro fuzzy classifier is used to identify dialect of speech from vowel sound. Vowel sounds occur in an acoustic speech signal more frequently and with higher energy. Therefore, prosodic feature of vowel sounds can be used to search dialect...

chapter

Speaker change detection using excitation source and vocal tract system information

Mousmita Sarma, Sree Nilendra Gadre, Biswajit Dev Sarma, S. R. Mahadeva Prasanna

2015 Twenty First National Conference on Communications (NCC) > 1 - 6

2015 Twenty First National Conference on Communications (NCC)

The speaker change information in speech is due to both vocal tract and excitation source information. In this work, the excitation source information is extracted by computing cepstral features from the zero frequency filtered speech (ZFFS) signal. The vocal tract system information is extracted by computing cepstral features from the speech signal. The speaker change evidences obtained from these...

chapter

RNN and SOM based classifier to recognize assamese fricative sounds designed using frame based temporal feature sets

Chayashree Patgiri, Mousmita Sarma, Kandarpa Kumar Sarma

2014 International Joint Conference on Neural Networks (IJCNN) > 3496 - 3502

2014 International Joint Conference on Neural Networks (IJCNN)

In this work, a Recurrent Neural Network (RNN) is trained using cepstral features and a set of difference cepstral feature (DCF) vectors on a frame to frame basis. The DCF vector is formulated to capture the temporal patterns of fricative sounds or phonemes of Assamese language. A hybrid algorithm is developed to recognize these fricative phonemes from certain words containing them. To preserve the...

chapter

Empirical mode decomposition based reconstruction of speech signal in noisy environment

Nisha Goswami, Mousmita Sarma, Kandarpa Kumar Sarma

2014 International Conference on Signal Processing and Integrated Networks (SPIN) > 760 - 765

2014 International Conference on Signal Processing and Integrated Networks (SPIN)

A novel technique for speech signal reconstruction using Empirical Mode Decomposition (EMD) of speech signal in noisy condition is described in this paper. EMD is applied for finding the glottal source signal of speech signals. After getting the source information, vocal tract filter response is determined and the original speech signal is reconstructed with the help of EMD with and without prior...

chapter

Development of Assamese Phonetic Engine: Some issues

Biswajit Dev Sarma, Mousmita Sarma, Meghamallika Sarma, S. R. Mahadeva Prasanna

2013 Annual IEEE India Conference (INDICON) > 1 - 6

2013 Annual IEEE India Conference (INDICON)

The phonetic engine is a system that performs speech signal to symbol transformation. This work describes some issues in the development of an Assamese Phonetic Engine (PE). International phonetic alphabet (IPA) is used as the phonetic unit to transcribe the speech database collected in three different modes, namely, reading, lecture and conversation modes. Only reading mode data is used for training...

chapter

Recurrent Neural Network based approach to recognize assamese vowels using experimentally derived acoustic-phonetic features

Mridusmita Sharma, Mousmita Sarma, Kandarpa Kumar Sarma

2013 1st International Conference on Emerging Trends and Applications in Computer Science > 140 - 143

2013 1st International Conference on Emerging Trends and Applications in Computer Science (ICETACS)

Vowels are the phonemes with greatest intensity and low frequencies. Assamese, which is considered as the lingua-franca of the entire north-east India, has eight vowel phonemes namely /i/, /e/, /ε/, /a/, /ɒ/, /ɔ/, /o/ and /u/. A Recurrent Neural Network (RNN) based algorithm is described in this paper for the recognition of the vowel sounds from Assamese speech. The feature vector is generated by...

chapter

Recurrent neural network based approach to recognize assamese fricatives using experimentally derived acoustic-phonetic features

Chayashree Patgiri, Mousmita Sarma, Kandarpa Kumar Sarma

2013 1st International Conference on Emerging Trends and Applications in Computer Science > 33 - 37

2013 1st International Conference on Emerging Trends and Applications in Computer Science (ICETACS)

Fricatives are the major group of speech sounds bearing distinct acoustical and phonetical characteristics and provides a wide range of application possibilities in the field of speech and speaker recognition. Assamese, which is a widely spoken language in the north eastern part of India, has four distinct fricative sounds called /s/, /z/, /x/ and /h /. In this paper, a Recurrent Neural Network (RNN)...

chapter

Reconstruction of speech signal using Empirical Mode Decomposition based glottal source extraction

Nisha Goswami, Mousmita Sarma, Kandarpa Kumar Sarma

2013 1st International Conference on Emerging Trends and Applications in Computer Science > 27 - 32

2013 1st International Conference on Emerging Trends and Applications in Computer Science (ICETACS)

In this paper, a novel technique for speech signal reconstruction is described using Empirical Mode Decomposition (EMD) of speech signal. EMD is applied for finding the glottal source signal of speech signals. After getting the source information, vocal tract filter response is determined and the original speech signal is reconstructed. The experimental result derived establishes the effectiveness...

chapter

Speaker identification model for Assamese language using a neural framework

Mousmita Sarma, Kandarpa Kumar Sarma

The 2013 International Joint Conference on Neural Networks (IJCNN) > 1 - 7

2013 International Joint Conference on Neural Networks (IJCNN 2013 - Dallas)

This paper presents a neural model of speaker identification using the vowel sound segmented out from words spoken by a speaker. Vowel sounds occur in a speech more frequently and with higher energy. Therefore, situations where acoustic information is noise corrupted vowel sounds can be used to extract different amounts of speaker discriminative information. The model explained here uses a neural...

article

An ANN based approach to recognize initial phonemes of spoken words of Assamese language

Mousmita Sarma, Kandarpa Kumar Sarma

Applied Soft Computing > 2013 > 13 > 5 > 2281-2291

Initial phoneme is used in spoken word recognition models. These are used to activate words starting with that phoneme in spoken word recognition models. Such investigations are critical for classification of initial phoneme into a phonetic group. A work is described in this paper using an artificial neural network (ANN) based approach to recognize initial consonant phonemes of Assamese words. A self...

chapter

Formant frequency estimation of phonemes of Assamese speech

Mousmita Sarma, Kandarpa Kumar Sarma

2012 2nd National Conference on Computational Intelligence and Signal Processing (CISP) > 119 - 125

2012 2nd National Conference on Computational Intelligence and Signal Processing (CISP)

Phonemes are the smallest distinguishable unit of speech signal. Formant frequency of a phoneme, the most fundamental concept in speech processing, differentiate one phoneme from another. Range of formant frequency of a particular phoneme can be used as a priori knowledge in various speech processing application. This paper describes a work done for estimating the formant frequencies of all consonant...

chapter

Segmentation of Assamese phonemes using SOM

Mousmita Sarma, Kandarpa Kumar Sarma

2012 3rd National Conference on Emerging Trends and Applications in Computer Science > 121 - 125

2012 3rd National Conference on Emerging Trends and Applications in Computer Science (NCETACS)

Phonemes are the smallest distinguishable unit of speech signal. Segmentation of phoneme from its word counterpart is a fundamental and crucial part in speech processing since initial phoneme is used to activate words starting with that phoneme. This work describes an Artificial Neural Network (ANN) based algorithm developed for segmentation and classification of consonant phoneme of Assamese language...

chapter

Speech corpus of assamese numerals extracted using an adaptive pre-emphasis filter for speech recognition

Mousmita Sarma, Krishna Dutta, Kandarpa Kumar Sarma

2010 International Conference on Computer and Communication Technology (ICCCT) > 461 - 466

2010 International Conference on Computer and Communication Technology (ICCCT 2010)

The quality and details captured in speech corpus directly affects the precision of performance in an Automatic Speech Recognition (ASR) system. The current work proposes a platform for speech corpus generation using an adaptive LMS filter and LPC Cepstrum, as a part of an Artificial Neural Network (ANN) based Speech Recognition System which is exclusively designed to recognize isolated numerals of...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Mousmita Sarma

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options