Search results for: Anil Kumar

Items from 1 to 7 out of 7 results

chapter

Learned dictionaries for sparse representation based unit selection speech synthesis

Pulkit Sharma, Vinayak Abrol, Anil Kumar Sao

2016 Twenty Second National Conference on Communication (NCC) > 1 - 5

2016 Twenty Second National Conference on Communication (NCC)

In this paper, we have employed learned dictionaries to compute sparse representation of speech utterances, which will be used to reduce the footprint of unit selection based speech synthesis (USS) systems. Speech database labeled at phoneme level is used to obtain multiple examples of the same phoneme, and all the examples (of each phoneme) are then used to learn a single overcomplete dictionary...

chapter

Detection of emotionally significant regions of speech for emotion recognition

Hari Krishna Vydana, Peddakota Vikash, Tallam Vamsi, Kolla Pavan Kumar, more

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

Emotions in human speech are short lived. In an emotive utterance, the emotive gestures produced due to the emotive state of the speaker persists only to a shorter duration. In this study, the regions of an utterance that are highly influenced by the emotive state of the speaker are detected. These regions are labeled as emotionally significant regions. Data from the detected emotionally significant...

chapter

Analysis of constraints on segmental DTW for the task of query-by-example spoken term detection

Sri Harsha Dumpala, K N R K Raju Alluri, Suryakanth V. Gangashetty, Anil Kumar Vuppala

2015 Annual IEEE India Conference (INDICON) > 1 - 6

2015 Annual IEEE India Conference (INDICON)

Query-by-example spoken term detection (QbE-STD) refers to the task of determining the subsequence of a reference which matches with a query, where both the query and the reference are in audio format. Dynamic time warping (DTW) based techniques are explored to match the two sequences with different lengths in an unsupervised manner. In this paper, a completely unsupervised approach based on Segmental...

chapter

Neutral to anger speech conversion using non-uniform duration modification

Anil Kumar Vuppala, Sudarsana Reddy Kadiri

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 4

2014 9th International Conference on Industrial and Information Systems (ICIIS)

In this paper, the non-uniform duration modification is exploited along with other prosody features for neutral speech to anger speech conversion. The non-uniform duration modification method modifies the durations of vowel and pause segments by different modification factors. Vowel segments are modified by factors based on their identities, and pause segments by uniform factors. Consonant and transition...

chapter

Improved Syllable Nuclei Detection Using Formant Energy in Glottal Closure Regions

Hari Krishna Vydana, Mounika K V, Anil Kumar Vuppala

2014 International Conference on Devices, Circuits and Communications (ICDCCom) > 1 - 6

2014 International Conference on Devices, Circuits and Communications (ICDCCom)

Robust syllabification of continuous speech is a vital aspect of language and speech processing systems. Syllabification of speech can be done by detecting the syllable nuclei. Syllable is the basic production unit of human speech and syllable nuclei can be attributed to high energy sonarants or resonant sounds which are relatively loud and carry a clear pitch. In this work, high spectral energy at...

chapter

IITKGP-MLILSC speech database for language identification

Sudhamay Maity, Anil Kumar Vuppala, K. Sreenivasa Rao, Dipanjan Nandi

2012 National Conference on Communications (NCC) > 1 - 5

2012 National Conference on Communications (NCC)

In this paper, we are introducing speech database consists of 27 Indian languages for analyzing language specific information present in speech. In the context of Indian languages, systematic analysis of various speech features and classification models in view of automatic language identification has not performed, because of the lack of proper speech corpus covering majority of the Indian languages...

chapter

Effect of Low Bit Rate Speech Coding on Epoch Extraction

Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti, K Sreenivasa Rao

2011 International Conference on Devices and Communications (ICDeCom) > 1 - 4

2011 International Conference on Devices and Communications (ICDeCom)

Speech coding is one of the major degradation involved in building the speech systems in mobile environment. In this paper, we are exploring the effect of low bit rate speech coding on the accuracy of detection of epochs. Epoch is referred as the instant of significant excitation of the vocal-tract system during production of speech. Many speech applications depend on the the accurate estimation of...

Filter options

Keywords:
SPEECH
DATABASES

Publication date

Set your own date range

Keywords

MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
SPEECH PROCESSING (3)
FEATURE EXTRACTION (2)
SPEECH CODING (2)
SPEECH RECOGNITION (2)
VOCODERS (2)
ANGER SPEECH (1)
ARRAYS (1)
BIT RATE (1)
CELP (1)
CMU-ARCTIC DATA (1)
COMPUTATIONAL MODELING (1)
DELAYS (1)
DICTIONARIES (1)
DICTIONARY LEARNING (1)
DURATION CONTOUR (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC PROGRAMMING PROJECTED PHASE SLOPE (1)
DYNAMIC TIME WARPING (1)
ELECTROGLOTTOGRAPH (1)
ELECTRONIC MAIL (1)
EMOTION CONVERSION (1)
EMOTION RECOGNITION (1)
EMOTIONALLY SIGNIFICANT REGIONS (1)
EPOCH EXTRACTION METHODS (1)
ERBIUM (1)
ETSI 06.10 (1)
EXCITATION SOURCE (1)
FILTERING THEORY (1)
FS-1016 (1)
GAUSSIAN MIXTURE MODELLING (1)
GAUSSIAN MIXTURE MODELS (GMMS) (1)
GAUSSIAN POSTERIORGRAMS (1)
GSM (1)
GSM FULL RATE (1)
HIDDEN MARKOV MODELS (1)
INDIAN LANGUAGE DATABASE (1)
INFORMATION TECHNOLOGY (1)
INTENSITY CONTOUR (1)
INTERPOLATION (1)
ITAKURA PARALLELOGRAM (1)
LANGUAGE IDENTIFICATION (1)
LINEAR PREDICTION CEPSTRAL COEFFICIENTS (LPCCS) (1)
LOW BIT RATE SPEECH CODING (1)
MATLAB (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (MFCCS) (1)
MOBILE ENVIRONMENT (1)
NEUTRAL SPEECH (1)
NOISE (1)
NON-UNIFORM DURATION MODIFICATION (1)
PHYSIOLOGICAL CONSTRAINTS (1)
PITCH SHIFT (1)
PREDICTIVE MODELS (1)
QUERY-BY-EXAMPLE SPOKEN TERM DETECTION (1)
RESONANT FREQUENCY (1)
SAKOE-CHIBA BAND (1)
SIGNAL PROCESSING ALGORITHMS (1)
SPARSE REPRESENTATION (1)
SPECTROGRAM (1)
SPEECH CODERS (1)
SPEECH PRODUCTION SYSTEM (1)
SPEECH SIGNAL (1)
SPEECH SYNTHESIS (1)
SPEECH SYSTEMS (1)
TIME-VARYING CHARACTERISTICS (1)
VOCAL-TRACT SYSTEM (1)
ZERO FREQUENCY FILTER (1)
more

INFONA - science communication portal

Search results for: Anil Kumar

Learned dictionaries for sparse representation based unit selection speech synthesis

Detection of emotionally significant regions of speech for emotion recognition

Analysis of constraints on segmental DTW for the task of query-by-example spoken term detection

Neutral to anger speech conversion using non-uniform duration modification

Improved Syllable Nuclei Detection Using Formant Energy in Glottal Closure Regions

IITKGP-MLILSC speech database for language identification

Effect of Low Bit Rate Speech Coding on Epoch Extraction

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options