Wyniki wyszukiwania dla: K Sreenivasa Rao

Pozycje od 1 do 13 spośród 13 wyników

rozdział

Development of multilingual phone recognition system for Indian languages

K E Manjunath, K. Sreenivasa Rao, Dinesh Babu Jayagopi

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) > 1 - 6

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES)

In this paper, the development of Multilingual Phone Recognition System (MPRS) in the context of Indian languages is described. MPRS is a language independent Phone Recognition System (PRS) that could recognise the phonetic units present in a speech utterance of any language. We have developed two Bilingual and a quadrilingual PRS using four Indian languages — Kannada, Telugu, Bengali, and Odia. International...

rozdział

Modification and incorporation of excitation source features for emotion conversion

Arijul Haque, K. Sreenivasa Rao

2015 International Conference on Computer, Communication and Control (IC4) > 1 - 5

2015 International Conference on Computer, Communication and Control (IC4)

This paper describes techniques for modification and incorporation of excitation source features to convert neutral speech to emotional speech. Sad and anger are considered in this work. The features that have been used are epoch strength, epoch sharpness and pitch contour. A new method of modifying and incorporating epoch strength and sharpness is proposed. New pitch contours corresponding to the...

rozdział

Analysis of linear prediction residual signal, its magnitude and phase for language identification on NIST LRE (2003) database

Arup Kumar Dutta, K. Sreenivasa Rao

2015 International Conference on Computer, Communication and Control (IC4) > 1 - 4

2015 International Conference on Computer, Communication and Control (IC4)

The present work investigates the importance of excitation source features for language identification (LID). Linear prediction residual (LPR) represents the excitation source signal. By processing the LPR in sub-segmental, segmental and supra-segmental levels, we can get the language specific information present within a glottal cycle, within a sequence of a few glottal cycles and at the prosody...

rozdział

Improved recognition rate of language identification system in noisy environment

Randheer Bagi, Jainath Yadav, K. Sreenivasa Rao

2015 Eighth International Conference on Contemporary Computing (IC3) > 214 - 219

2015 Eighth International Conference on Contemporary Computing (IC3)

Spoken language identification is a technique to model and classify the language, spoken by an unknown person. Language identification task is more challenging in environmental condition due to addition of different types of noise. Presence of noise in speech signal causes several nuisances. This paper covers several aspect of language identification in noisy environment. Experiments have been carried...

rozdział

Data-driven pause prediction for speech synthesis in storytelling style speech

Parakrant Sarkar, K. Sreenivasa Rao

2015 Twenty First National Conference on Communications (NCC) > 1 - 5

2015 Twenty First National Conference on Communications (NCC)

In the storyteller speech, pauses plays a significant role in introducing suspense and climax. Pauses are used to emphasize keywords, emotion-salient words and separate the phrases in the utterance. The objective of this work is to predict the position and duration of the pauses in the synthesized speech from the text-to-speech system. We analyzed the pause patterns in storyteller speech and classified...

rozdział

Emotion recognition using LP residual at sub-segmental, segmental and supra-segmental levels

Jainath Yadav, Anshu Kumari, K. Sreenivasa Rao

2015 International Conference on Communication, Information & Computing Technology (ICCICT) > 1 - 6

2015 International Conference on Communication, Information & Computing Technology (ICCICT)

This paper is concerned with speech signal based emotion recognition. Linear Prediction (LP) residual mainly contains source specific emotional information. LP residual is derived by inverse filtering of the speech signal. For characterizing the basic emotions, LP residual has been explored at sub-segmental level, segmental level, supra-segmental level, respectively. Gaussian mixture models (GMMs)...

rozdział

Optimal residual frame based source modeling for HMM-based speech synthesis

N. P. Narendra, K. Sreenivasa Rao

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 5

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This paper proposes a method for modeling the excitation signal to improve the quality of HMM-based speech synthesis system (HTS). Single optimal residual frame which closely relates to all frames of phone is chosen to represent the entire residual signal of the phone. Optimal residual frames of all phones present in the speech corpus are efficiently grouped based on positional and contextual features...

rozdział

Contribution of Telugu vowels in identifying emotions

Shashidhar Koolagudi G, Shivakranthi B, K Sreenivasa Rao, Pravin B Ramteke

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) > 1 - 6

2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR)

This work is mainly intended at identifying emotion contribution of different vowels in Telugu language. Instead of processing the entire speech signal we propose to focus only vowel parts of the utterance (/a/, /i/, /u/, /e/ and /o/). By analysing the vowels we can discriminate the emotions. In this work spectral and prosodic features are used for studying the effect of emotions on different vowels...

rozdział

Significance of CV transition and steady vowel regions for language identification

Dipanjan Nandi, Arup Kumar Dutta, K. Sreenivasa Rao

2014 Seventh International Conference on Contemporary Computing (IC3) > 513 - 517

2014 Seventh International Conference on Contemporary Computing (IC3)

The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze...

rozdział

Infant cry recognition using excitation source features

Avinash Kumar Singh, Jayanta Mukhopadhyay, S B Sunil Kumar, K. Sreenivasa Rao

2013 Annual IEEE India Conference (INDICON) > 1 - 5

2013 Annual IEEE India Conference (INDICON)

In this work, source features are explored for classifying infant cries. Different types of infant cries considered in this work are hunger, pain and wet-diaper. The various excitation source features explored in this work are source features namely epoch interval contour (EIC), epoch strength contour (ESC), epoch sharpness, slope of EIC and ESC features. In this work Gaussian Mixture Models (GMM)...

rozdział

Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

K E Manjunath, S. B. Sunil Kumar, Debadatta Pati, Biswajit Satapathy, więcej

2013 Annual IEEE India Conference (INDICON) > 1 - 6

2013 Annual IEEE India Conference (INDICON)

The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali...

artykuł

Detection of Vowel Offset Point From Speech Signal

Jainath Yadav, K. Sreenivasa Rao

IEEE Signal Processing Letters > 2013 > 20 > 4 > 299 - 302

Vowel regions play important role in various speech tasks, such as speech segmentation, speaker-verification, prosody modification and emotion conversion. The instants at which the onset and offset of vowel take place in the speech signal are known as vowel onset point and vowel offset point, respectively. Vowel regions start with the vowel onset point and end with the vowel offset point. In this...

rozdział

Segmentation of TV broadcast news using speaker specific information

K. Sreenivasa Rao, Ketan Pachpande, Ramu Reddy Vempada, Sudhamay Maity

2012 National Conference on Communications (NCC) > 1 - 5

2012 National Conference on Communications (NCC)

In this paper, we proposed two-stage segmentation approach for splitting the TV broadcast news bulletins into sequence of news stories. In the first stage, speaker (news reader) specific characteristics present in initial headlines of the news bulletin are used for gross level segmentation. During second stage, errors in the gross level segmentation (first stage) are corrected by exploiting the speaker...

Opcje filtrowania

Słowa kluczowe:
FEATURE EXTRACTION

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (12)
artykuł (1)

Słowa kluczowe

SPEECH RECOGNITION (6)
ACCURACY (5)
DATABASES (4)
HIDDEN MARKOV MODELS (4)
VECTORS (4)
GAUSSIAN MIXTURE MODEL (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
ACOUSTICS (2)
EMOTION RECOGNITION (2)
EXCITATION SOURCE (2)
GMM (2)
SPEECH SYNTHESIS (2)
TRAINING (2)
VOWEL ONSET POINT (2)
ACOUSTIC FEATURES (1)
BENGALI (1)
BILINGUAL (1)
BREAKS (1)
BUILDINGS (1)
COMPLEXITY THEORY (1)
COMPUTERS (1)
CONFERENCES (1)
CONSONANT-VOWEL RECOGNITION (1)
CORRELATION (1)
CORRELATION COEFFICIENT (1)
CV TRANSITION REGION (1)
DATA MINING (1)
DECISION TREES (1)
DISTORTION (1)
EPOCH (1)
EPOCH INTERVAL CONTOUR (EIC) (1)
EPOCH SHARPNESS (1)
EPOCH STRENGTH (1)
EPOCH STRENGTH CONTOUR (ESC) (1)
EXCITATION MODELING (1)
FEATURE VECTOR (1)
FILTERING (1)
GAUSSIAN MIXTURE MODELS (GMMS) (1)
GAUSSIAN NORMALIZATION (1)
GLOTTAL CLOSURE REGION (1)
HIGH-TEMPERATURE SUPERCONDUCTORS (1)
HMM-BASED SPEECH SYNTHESIS (1)
HYBRID SYNTHESIS (1)
IITKGP-MLILSC (1)
INDEXING (1)
INDIAN LANGUAGES (1)
INDIAN SPEECH CORPUS (IITKGP-MLILSC) (1)
INFANT CRY RECOGNITION SYSTEM (ICRS) (1)
INTERNATIONAL PHONETIC ALPHABET (1)
JITTER (1)
KANNADA (1)
LINEAR PREDICTION ANALYSIS (1)
LINEAR PREDICTION COEFFICIENT (1)
MATERIALS (1)
MFCC (1)
MFCCS (1)
MINIMUM MEAN SQUARE ERROR (MMSE) (1)
MODULATION (1)
MODULATION SPECTRUM (1)
MONOLINGUAL (1)
MULTILINGUAL (1)
NIST (1)
NIST LRE (2003) (1)
NOISE MEASUREMENT (1)
NON-BREAK (1)
ODIA (1)
OPTIMAL RESIDUAL FRAME (1)
PAIN (1)
PAUSE DURATION (1)
PAUSE PREDICTION (1)
PEDIATRICS (1)
PHONE RECOGNITION (1)
PHRASING (1)
PREDICTIVE MODELS (1)
PROBABILITY DENSITY (1)
QUADRILINGUAL (1)
RESIDUAL SIGNAL (1)
RESONANT FREQUENCY (1)
SEGMENTAL (1)
SEGMENTAL LEVEL (1)
SHIMMER (1)
SIGNAL TO NOISE RATIO (1)
SILENCES (1)
SIMULATED DATABASE (1)
SPEAKER RECOGNITION (1)
SPEAKER SPECIFIC INFORMATION (1)
SPECTRAL PEAKS (1)
SPECTRAL SUBTRACTION (SS) (1)
SPEECH ENHANCEMENT (1)
SPEECH PROCESSING (1)
SPEECH SAMPLE (1)
STANDARDS (1)
STEADY VOWEL (SV) REGIONS (1)
STORYTELLING STYLE (1)
SUB-SEGMENTAL (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINES (1)
SUPRASEGMENTAL LEVEL (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: K Sreenivasa Rao

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu