Search results for: K. Sreenivasa Rao

Items from 1 to 14 out of 14 results

article

Robust Pitch Extraction Method for the HMM-Based Speech Synthesis System

M. Kiran Reddy, K. Sreenivasa Rao

IEEE Signal Processing Letters > 2017 > 24 > 8 > 1133 - 1137

This letter proposes an efficient method for extracting pitch from speech signals for the hidden Markov model (HMM)-based speech synthesis system (HTS). In the proposed method, voicing detection and pitch estimation is performed using the mean signal obtained from continuous wavelet transform coefficients. The proposed pitch extraction method is integrated in the HMM-based speech synthesis system...

chapter

Excitation modeling for HMM-based speech synthesis based on principal component analysis

N. P. Narendra, M. Kiran Reddy, K. Sreenivasa Rao

2016 Twenty Second National Conference on Communication (NCC) > 1 - 6

2016 Twenty Second National Conference on Communication (NCC)

This paper proposes a new excitation modeling method for improving the quality of HMM-based speech synthesis. The proposed excitation or source modeling method models the pitch-synchronous residual frames extracted from the excitation signal. Initially, principal component analysis is performed on the pitch-synchronous residual frames. Based on the analysis, the pitch synchronous residual frames are...

chapter

Emotion-specific features for classifying emotions in story text

Harikrishna D M, K. Sreenivasa Rao

2016 Twenty Second National Conference on Communication (NCC) > 1 - 4

2016 Twenty Second National Conference on Communication (NCC)

In this work, we are attempting emotion classification in view of synthesizing story speech. We are proposing emotion-specific text features (ESF) for classifying sentences from children stories into five different emotion categories: happy, sad, anger, fear and neutral. ESF is a five dimensional feature vector, where each dimension corresponds to weight of the sentence according to each emotion class...

chapter

Analysis of linear prediction residual signal, its magnitude and phase for language identification on NIST LRE (2003) database

Arup Kumar Dutta, K. Sreenivasa Rao

2015 International Conference on Computer, Communication and Control (IC4) > 1 - 4

2015 International Conference on Computer, Communication and Control (IC4)

The present work investigates the importance of excitation source features for language identification (LID). Linear prediction residual (LPR) represents the excitation source signal. By processing the LPR in sub-segmental, segmental and supra-segmental levels, we can get the language specific information present within a glottal cycle, within a sequence of a few glottal cycles and at the prosody...

chapter

Generation of emotional speech by prosody imposition on sentence, word and syllable level fragments of neutral speech

Jainath Yadav, K. Sreenivasa Rao

2015 International Conference on Cognitive Computing and Information Processing(CCIP) > 1 - 5

2015 International Conference on Cognitive Computing and Information Processing (CCIP)

In emotional-speech, it is observed that some words and phrases are spoken prominently, compared to neutral-speech. The prominence of these specific words and phrases are reflected in the form of prosodic features such as duration, intonation and intensity patterns of the words or phrases. The neutral speech and emotional speech have basic difference due to prosody aspects of speech. Three acoustic...

chapter

Emotion recognition using LP residual at sub-segmental, segmental and supra-segmental levels

Jainath Yadav, Anshu Kumari, K. Sreenivasa Rao

2015 International Conference on Communication, Information & Computing Technology (ICCICT) > 1 - 6

2015 International Conference on Communication, Information & Computing Technology (ICCICT)

This paper is concerned with speech signal based emotion recognition. Linear Prediction (LP) residual mainly contains source specific emotional information. LP residual is derived by inverse filtering of the speech signal. For characterizing the basic emotions, LP residual has been explored at sub-segmental level, segmental level, supra-segmental level, respectively. Gaussian mixture models (GMMs)...

chapter

Optimal residual frame based source modeling for HMM-based speech synthesis

N. P. Narendra, K. Sreenivasa Rao

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 5

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper proposes a method for modeling the excitation signal to improve the quality of HMM-based speech synthesis system (HTS). Single optimal residual frame which closely relates to all frames of phone is chosen to represent the entire residual signal of the phone. Optimal residual frames of all phones present in the speech corpus are efficiently grouped based on positional and contextual features...

chapter

Contribution of Telugu vowels in identifying emotions

Shashidhar Koolagudi G, Shivakranthi B, K Sreenivasa Rao, Pravin B Ramteke

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This work is mainly intended at identifying emotion contribution of different vowels in Telugu language. Instead of processing the entire speech signal we propose to focus only vowel parts of the utterance (/a/, /i/, /u/, /e/ and /o/). By analysing the vowels we can discriminate the emotions. In this work spectral and prosodic features are used for studying the effect of emotions on different vowels...

chapter

Telugu emotional story speech synthesis using SABLE markup language

M Gurunath Reddy, D M Harikrishna, K. Sreenivasa Rao, K E Manjunath

2015 International Conference on Signal Processing and Communication Engineering Systems > 331 - 335

2015 International Conference on Signal Processing And Communication Engineering Systems (SPACES)

In this paper, a framework for synthesizing Telugu emotional speech for story telling applications is presented. An XML based markup langauge, SABLE is used to synthesize the emotions from a given story text. SABLE markup defines a set of tags to improve the quality of the synthesized speech from the concatinative speech synthesizer. In this work, a subset of prosody tags are used to synthesize the...

chapter

IITKGP-MLILSC speech database for language identification

Sudhamay Maity, Anil Kumar Vuppala, K. Sreenivasa Rao, Dipanjan Nandi

2012 National Conference on Communications (NCC) > 1 - 5

2012 National Conference on Communications (NCC)

In this paper, we are introducing speech database consists of 27 Indian languages for analyzing language specific information present in speech. In the context of Indian languages, systematic analysis of various speech features and classification models in view of automatic language identification has not performed, because of the lack of proper speech corpus covering majority of the Indian languages...

chapter

Subword based approach for grapheme-to-phoneme conversion in Bengali text-to-speech synthesis system

Krishnendu Ghosh, K. Sreenivasa Rao

2012 National Conference on Communications (NCC) > 1 - 5

2012 National Conference on Communications (NCC)

In this paper, we propose a subword based approach for grapheme-to-phoneme (G2P) conversion in a text-to-speech (TTS) synthesis system. The proposed method resolves the problems present in both the manual and rule-based approaches for G2P conversion. The subword method uses a segmentation procedure which chops a word into its main part (root word) and subword part (suffix). By proper segmentation...

chapter

Development of Bengali screen reader using Festival speech synthesizer

N. P. Narendra, K. Sreenivasa Rao, Krishnendu Ghosh, Vempada Ramu Reddy, more

2011 Annual IEEE India Conference > 1 - 4

2011 Annual IEEE India Conference (INDICON)

This paper discusses the development of Bengali screen reader using Festival speech synthesizer. Screen reader is developed with the objective that the visually challenged people can use the computer without any difficulty. The usability of system is checked throughout the development and appropriate modifications are made. Unrestricted Bengali text to speech synthesis (TTS) system which can produce...

chapter

Effect of Low Bit Rate Speech Coding on Epoch Extraction

Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti, K Sreenivasa Rao

2011 International Conference on Devices and Communications (ICDeCom) > 1 - 4

2011 International Conference on Devices and Communications (ICDeCom)

Speech coding is one of the major degradation involved in building the speech systems in mobile environment. In this paper, we are exploring the effect of low bit rate speech coding on the accuracy of detection of epochs. Epoch is referred as the instant of significant excitation of the vocal-tract system during production of speech. Many speech applications depend on the the accurate estimation of...

chapter

Emotion recognition using LP residual

Arun Chauhan, Shashidhar G Koolagudi, Sabin Kafley, K Sreenivasa Rao

2010 IEEE Students Technology Symposium (TechSym) > 255 - 261

2010 IEEE Students' Technology Symposium (TechSym 2010)

This paper explores the Linear Prediction (LP) residual of speech signal for characterizing the basic emotions. The emotions used in this study are anger, compassion, disgust, fear, happy, neutral, sarcastic and surprise. LP residual is derived by inverse filtering of the speech signal, and the process is known as LP analysis. LP residual mainly contains higher order relations among the samples. For...

Filter options

Keywords:
DATABASES

Publication date

Set your own date range

Publication type

book (13)
article (1)

Keywords

SPEECH SYNTHESIS (6)
FEATURE EXTRACTION (4)
HIDDEN MARKOV MODELS (4)
EMOTION RECOGNITION (3)
EXCITATION SOURCE (3)
HIGH-TEMPERATURE SUPERCONDUCTORS (3)
SPEECH RECOGNITION (3)
EDUCATIONAL INSTITUTIONS (2)
GAUSSIAN MIXTURE MODEL (2)
INFORMATION TECHNOLOGY (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
SPEECH PROCESSING (2)
SPEECH SIGNAL (2)
SYNTHESIZERS (2)
ACCURACY (1)
ANALYTICAL MODELS (1)
AUTOASSOCIATIVE NEURAL NETWORK (1)
BENGALI SCREEN READER (1)
BIT RATE (1)
BUILDINGS (1)
CELP (1)
CMU-ARCTIC DATA (1)
COMPOUNDS (1)
COMPUTATIONAL MODELING (1)
COMPUTERS (1)
CONFERENCES (1)
CONTINUOUS WAVELET TRANSFORM (CWT) (1)
CONTINUOUS WAVELET TRANSFORMS (1)
CORRELATION (1)
CORRELATION COEFFICIENT (1)
DATABASE PRUNING (1)
DECISION TREES (1)
DICTIONARIES (1)
DISPERSION (1)
DURATION PATTERN (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC PROGRAMMING PROJECTED PHASE SLOPE (1)
ELECTROGLOTTOGRAPH (1)
EMOTION SPECIFIC INFORMATION (1)
EMOTION-SPECIFIC INFORMATION (1)
EMOTIONAL SPEECH (1)
EMOTIONS (1)
EPOCH EXTRACTION METHODS (1)
ESTIMATION (1)
ETSI 06.10 (1)
EXCITATION MODELING (1)
FEATURE VECTOR (1)
FILTERING THEORY (1)
FS-1016 (1)
GAUSSIAN MIXTURE MODELS (1)
GAUSSIAN MIXTURE MODELS (GMMS) (1)
GAUSSIAN PROCESSES (1)
GMM (1)
GRAPHEME-TO-PHONEME CONVERSION (1)
GSM (1)
GSM FULL RATE (1)
HIDDEN MARKOV MODELS (HMMS) (1)
HMM-BASED SPEECH SYNTHESIS (1)
HYBRID SYNTHESIS (1)
IITKGP SIMULATED EMOTION SPEECH CORPUS (1)
IITKGP-SESC (1)
INDIAN LANGUAGE DATABASE (1)
INTONATION PATTERN (1)
INVERSE FILTERING (1)
JITTER (1)
LANGUAGE IDENTIFICATION (1)
LINEAR PREDICTION ANALYSIS (1)
LINEAR PREDICTION CEPSTRAL COEFFICIENTS (LPCCS) (1)
LINEAR PREDICTION COEFFICIENT (1)
LINEAR PREDICTION RESIDUAL (1)
LOW BIT RATE SPEECH CODING (1)
LP RESIDUAL (1)
MANUALS (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (MFCCS) (1)
MFCC (1)
MOBILE ENVIRONMENT (1)
NEURAL NETS (1)
NEUTRAL TEXT-TO-SPEECH SYSTEM (1)
NIOBIUM (1)
NIST (1)
NIST LRE (2003) (1)
OPTIMAL RESIDUAL FRAME (1)
PITCH ESTIMATION (1)
PRAAT (1)
PRAAT SCRIPT (1)
PRAGMATICS (1)
PREDICTIVE MODELS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PROBABILITY DENSITY (1)
PROSODY (1)
PROSODY TAGS (1)
PSOLA ALGORITHM (1)
RESIDUAL SIGNAL (1)
SABLE MARKUP (1)
SEGMENTAL (1)
SEGMENTAL LEVEL (1)
SEMANTICS (1)
SHIMMER (1)
more

INFONA - science communication portal

Search results for: K. Sreenivasa Rao

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options