Search results for: Anil Kumar

Items from 1 to 7 out of 7 results

article

Deep-Sparse-Representation-Based Features for Speech Recognition

Pulkit Sharma, Vinayak Abrol, Anil Kumar Sao

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 11 > 2162 - 2175

Features derived using sparse representation (SR)-based approaches have been shown to yield promising results for speech recognition tasks. In most of the approaches, the SR corresponding to speech signal is estimated using a dictionary, which could be either exemplar based or learned. However, a single-level decomposition may not be suitable for the speech signal, as it contains complex hierarchical...

chapter

Compressed sensing for unit selection based speech synthesis

Pulkit Sharma, Vinayak Abrol, Anil Kumar Sao

2015 23rd European Signal Processing Conference (EUSIPCO) > 1731 - 1735

2015 23rd European Signal Processing Conference (EUSIPCO)

This paper proposes an approach based on compressed sensing to reduce the footprint of speech corpus in unit selection based speech synthesis (USS) systems. It exploits the observation that speech signal can have a sparse representation (in suitable choice of basis functions) and can be estimated effectively using the sparse coding framework. Thus, only few significant coefficients of the sparse vector...

chapter

Various front end tools for digital speech processing

Shiva Prasad, Anil Kumar, Manjunatha, Kodanda Ramaiah

2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom) > 905 - 911

2015 2nd International Conference on "Computing for Sustainable Global Development" (INDIACom)

Speech is an informative signal, which conveys many information's like status of the speaker, environmental conditions of the speaker: the other necessary parameters which are classified as prosodic features and general features of speech. As speech is a signal which can be analysed by subjecting and can be inspected to various criteria with the implication of several available techniques. In this...

chapter

Neutral to anger speech conversion using non-uniform duration modification

Anil Kumar Vuppala, Sudarsana Reddy Kadiri

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 4

2014 9th International Conference on Industrial and Information Systems (ICIIS)

In this paper, the non-uniform duration modification is exploited along with other prosody features for neutral speech to anger speech conversion. The non-uniform duration modification method modifies the durations of vowel and pause segments by different modification factors. Vowel segments are modified by factors based on their identities, and pause segments by uniform factors. Consonant and transition...

chapter

A three stage hybrid model to perform feature level speech signal recognition

Sandhya Saroha, Anil Kumar

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 691 - 696

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

In this paper, a three stage improved speech signal recognition model is presented. The presented approach improved the recognition process by reducing the process time and to provide robust speech recognition. In first layer of presented model, the feature extraction from speech is done using Statistical Analysis based DWT approach. The extracted feature based recognition reduced the signal size...

chapter

Improved Syllable Nuclei Detection Using Formant Energy in Glottal Closure Regions

Hari Krishna Vydana, Mounika K V, Anil Kumar Vuppala

2014 International Conference on Devices, Circuits and Communications (ICDCCom) > 1 - 6

2014 International Conference on Devices, Circuits and Communications (ICDCCom)

Robust syllabification of continuous speech is a vital aspect of language and speech processing systems. Syllabification of speech can be done by detecting the syllable nuclei. Syllable is the basic production unit of human speech and syllable nuclei can be attributed to high energy sonarants or resonant sounds which are relatively loud and carry a clear pitch. In this work, high spectral energy at...

chapter

Effect of Low Bit Rate Speech Coding on Epoch Extraction

Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti, K Sreenivasa Rao

2011 International Conference on Devices and Communications (ICDeCom) > 1 - 4

2011 International Conference on Devices and Communications (ICDeCom)

Speech coding is one of the major degradation involved in building the speech systems in mobile environment. In this paper, we are exploring the effect of low bit rate speech coding on the accuracy of detection of epochs. Epoch is referred as the instant of significant excitation of the vocal-tract system during production of speech. Many speech applications depend on the the accurate estimation of...

Filter options

Keywords:
SPEECH
SPEECH PROCESSING

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

DATABASES (3)
DICTIONARIES (2)
HIDDEN MARKOV MODELS (2)
SPEECH CODING (2)
SPEECH RECOGNITION (2)
VOCODERS (2)
ANGER SPEECH (1)
ARRAYS (1)
BIT RATE (1)
CELP (1)
CEPSTRUM (1)
CMU-ARCTIC DATA (1)
COMPRESSED SENSING (1)
CONVOLUTION (1)
DATA MODELS (1)
DEEP SPARSE REPRESENTATION (DSR) (1)
DELAYS (1)
DICTIONARY LEARNING (1)
DISCRETE WAVELET TRANSFORMS (1)
DURATION CONTOUR (1)
DWT (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC PROGRAMMING PROJECTED PHASE SLOPE (1)
ELECTROGLOTTOGRAPH (1)
EMOTION CONVERSION (1)
EPOCH EXTRACTION METHODS (1)
ETSI 06.10 (1)
EXCITATION SOURCE (1)
FEATURE EXTRACTION (1)
FILTER BANKS (1)
FILTERING THEORY (1)
FILTRATION (1)
FOURIER TRANSFORM (1)
FS-1016 (1)
GSM (1)
GSM FULL RATE (1)
HMM (1)
INDEXES (1)
INTENSITY CONTOUR (1)
INTERPOLATION (1)
LOW BIT RATE SPEECH CODING (1)
MACHINE LEARNING (1)
MATLAB (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MOBILE ENVIRONMENT (1)
NEUTRAL SPEECH (1)
NOISE (1)
NON-UNIFORM DURATION MODIFICATION (1)
PITCH SHIFT (1)
PRINCIPAL COMPONENT ANALYSIS (1)
RESONANT FREQUENCY (1)
SPARSE MATRICES (1)
SPARSE REPRESENTATION (1)
SPECTRAL SUBTRACTION (1)
SPECTROGRAM (1)
SPEECH CODERS (1)
SPEECH SIGNAL (1)
SPEECH SYNTHESIS (1)
SPEECH SYSTEMS (1)
STFT (1)
TIME-VARYING CHARACTERISTICS (1)
VOCAL-TRACT SYSTEM (1)
ZERO FREQUENCY FILTER (1)
more

INFONA - science communication portal

Search results for: Anil Kumar

Deep-Sparse-Representation-Based Features for Speech Recognition

Compressed sensing for unit selection based speech synthesis

Various front end tools for digital speech processing

Neutral to anger speech conversion using non-uniform duration modification

A three stage hybrid model to perform feature level speech signal recognition

Improved Syllable Nuclei Detection Using Formant Energy in Glottal Closure Regions

Effect of Low Bit Rate Speech Coding on Epoch Extraction

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options