Search results for: Sunil Kumar

Items from 1 to 16 out of 16 results

chapter

Improved speaker recognition system for stressed speech using deep neural networks

Sri Harsha Dumpala, Sunil Kumar Kopparapu

2017 International Joint Conference on Neural Networks (IJCNN) > 1257 - 1264

2017 International Joint Conference on Neural Networks (IJCNN)

Good speaker recognition systems should identify the speaker irrespective of what is spoken, including non-speech sounds that are often produced during natural conversations. In this work, the inclusion of breath sounds in the training phase of the speaker recognition is analyzed using the popular Gaussian mixture model-universal background model (GMM-UBM) and deep neural network (DNN) based systems...

chapter

Automatic assessment of dysarthria severity level using audio descriptors

Chitralekha Bhat, Bhavik Vachhani, Sunil Kumar Kopparapu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5070 - 5074

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Dysarthria is a motor speech impairment, often characterized by speech that is generally indiscernible by human listeners. Assessment of the severity level of dysarthria provides an understanding of the patient's progression in the underlying cause and is essential for planning therapy, as well as improving automatic dysarthric speech recognition. In this paper, we propose a non-linguistic manner...

chapter

Spontaneous speech emotion recognition using prior knowledge

Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu

2016 23rd International Conference on Pattern Recognition (ICPR) > 2866 - 2871

2016 23rd International Conference on Pattern Recognition (ICPR)

Automatic and spontaneous speech emotion recognition is an important part of a human-computer interactive system. However, emotion identification in spontaneous speech is difficult because most often the emotion expressed by the speaker are not necessarily as prominent as in acted speech. In this paper, we propose a spontaneous speech emotion recognition framework that makes use of the associated...

chapter

Validating “Is ECC-ANN combination equivalent to DNN?” for speech emotion recognition

Rupayan Chakraborty, Sunil Kumar Kopparapu

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 4311 - 4316

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Use of the error correcting codes (ECC) in a multiclass audio emotion recognition problem is proposed to improve the emotion recognition accuracy. We visualize the emotion recognition system as a noisy communication channel, thus motivating the use of ECC. We assume the emotion recognition process consists of an audio feature extractor followed by an artificial neural network (ANN) for emotion classification...

chapter

Improved speech emotion recognition using error correcting codes

Rupayan Chakraborty, Sunil Kumar Kopparapu

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

We propose the use of the popular error correcting codes (ECC) in a multi-class audio emotion recognition scenario to improve the emotion recognition accuracy in spoken speech. In this paper, we visualize the emotion recognition system as a noisy communication channel, thus motivating the use of ECC in the emotion recognition process. We assume the emotion recognition process consists of an audio...

chapter

Using noise statistics for effective noise filtering

Sunil Kumar Kopparapu

TENCON 2015 - 2015 IEEE Region 10 Conference > 1 - 4

TENCON 2015 - 2015 IEEE Region 10 Conference

In this paper we show that the knowledge of noise statistics contaminating a signal leads to a better choice of filter to remove the noise. Very specifically, we show theoretically that the additive white Gaussian noise (AWGN) contaminating a signal can be filtered best by using a Gaussian filter mask which has some relation with the noise statistic of the AWGN. The main contribution of the paper...

chapter

Novel windowing technique of MFCC for speaker identification with Modified Polynomial Classifiers

Aarti Bakshi, Sunil Kumar Kopparapu, Sanjay Pawar, Shikha Nema

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 292 - 297

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

Speech is one of the most popular parameter used to identify a speaker by her spoken phrase. Feature extraction from speech is a necessary first step in a speaker identification process. Traditionally computation of the Mel Frequency Cepstral Coefficient (MFCC) features use hamming window, as a preprocessing step to reduce spectral leakages. However, hamming window results in reasonable side lobes...

chapter

Infant cry recognition using excitation source features

Avinash Kumar Singh, Jayanta Mukhopadhyay, S B Sunil Kumar, K. Sreenivasa Rao

2013 Annual IEEE India Conference (INDICON) > 1 - 5

2013 Annual IEEE India Conference (INDICON)

In this work, source features are explored for classifying infant cries. Different types of infant cries considered in this work are hunger, pain and wet-diaper. The various excitation source features explored in this work are source features namely epoch interval contour (EIC), epoch strength contour (ESC), epoch sharpness, slope of EIC and ESC features. In this work Gaussian Mixture Models (GMM)...

chapter

Multilingual speaker recognition on Indian languages

Sourjya Sarkar, K. Sreenivasa Rao, Dipanjan Nandi, S. B. Sunil Kumar

2013 Annual IEEE India Conference (INDICON) > 1 - 5

2013 Annual IEEE India Conference (INDICON)

In this paper we explore the performance of multilingual speaker recognition systems developed on the IITKGP-MLILSC speech corpus. Closed-set speaker identification and speaker verification experiments are individually conducted on 13 widely spoken Indian languages. In particular, we focus on the effect of language mismatch in the speaker recognition performance of individual languages and all languages...

chapter

Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

K E Manjunath, S. B. Sunil Kumar, Debadatta Pati, Biswajit Satapathy, more

2013 Annual IEEE India Conference (INDICON) > 1 - 6

2013 Annual IEEE India Conference (INDICON)

The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali...

chapter

A novel approach to identify problematic call center conversations

Meghna Abhishek Pandharipande, Sunil Kumar Kopparapu

2012 Ninth International Conference on Computer Science and Software Engineering (JCSSE) > 1 - 5

2012 International Joint Conference on Computer Science and Software Engineering (JCSSE)

Voice based call centers enable customers to query for information by speaking to agents in the call center. Most often these call conversations are recorded for analysis with the intent of trying to identify things that can help improve the performance of the call center to serve the customer better. Today the recorded conversations are analyzed by humans by listening to call conversations, which...

chapter

Building a Natural Language Hindi Speech Interface to Access Market Information

Ahmed Imran, Sunil Kumar Kopparapu

2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics > 58 - 61

2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG)

It is a well known fact that majority of rural India earns its livelihood from agriculture and farming. Although India is a net exporter of various agricultural products, the farmer who happens to be the primary producer, has remained information poor which puts him at a disadvantage. With little or no knowledge of prices at the markets, farmers have no leverage to negotiate better prices for their...

chapter

Visualization of continuous density hidden Markov models

Sunil Kumar Kopparapu

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 4

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Continuous density hidden Markov models (CD-HMMs) are doubly stochastic processes which are extensively used in speech and image signal processing. Especially in case of isolated spoken word recognition systems, the spoken words are usually modeled using HMMs. While CD-HMMs are in extensive use, to most of the speech community the HMMs remain abstract in the sense there has been no nice way of visualizing...

chapter

Real time speaking rate monitoring system

Meghna Abhishek Pandharipande, Sunil Kumar Kopparapu

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 4

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

The rate at which we speak has a bearing on its comprehensibility and is important in recent times with mushrooming call center operations. An optimal speaking rate is one that is neither too fast not it is too slow. A fast spoken speech makes conversation unintelligible while a slower speaking rate speech makes the conversation boring. Speaking rate definitely varies depending on the emotional state...

chapter

A two pass algorithm for speaker change detection

Sunil Kumar Kopparapu, Ahmed Imran, G Sita

TENCON 2010 - 2010 IEEE Region 10 Conference > 755 - 758

2010 IEEE Region 10 Conference (TENCON 2010)

Speaker change detection is a necessary first step in several applications. In this paper, we propose an unsupervised two pass algorithm for speaker change detection in conversational speech. Generalized Likelihood Ratio (GLR) metric is used in the first pass to coarsely identify speaker change points and during the second pass, these candidate change points are finely analyzed assuming that the initial...

chapter

Music and vocal separation using multiband modulation based features

Sunil Kumar Kopparapu, M A Pandharipande, G Sita

2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA) > 733 - 737

2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA 2010)

The potential use of non-linear speech features has not been investigated for music analysis although other commonly used speech features like Mel Frequency Ceptral Coefficients (MFCC) and pitch have been used extensively. In this paper, we assume an audio signal to be a sum of modulated sinusoidal and then use the energy separation algorithm to decompose the audio into amplitude and frequency modulation...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (10)
FEATURE EXTRACTION (7)
ACCURACY (4)
SPEAKER RECOGNITION (4)
TRAINING (4)
EMOTION RECOGNITION (3)
MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
SPEECH PROCESSING (3)
VECTORS (3)
ACOUSTICS (2)
ARTIFICIAL NEURAL NETWORK (2)
DATABASES (2)
ENCODING (2)
ERROR CORRECTION CODES (2)
HIDDEN MARKOV MODELS (2)
SPEAKING RATE (2)
ADAPTATION MODELS (1)
AMPLITUDE MODULATION COMPONENT (1)
AUDIO DESCRIPTORS (1)
AUDIO SIGNAL (1)
AUDIO SIGNAL PROCESSING (1)
AUTHENTICATION (1)
AUTOMATIC ASSESSMENT (1)
AWGN (1)
BLOCK CODING (1)
BUSINESS (1)
CALL CENTER AUDIO (1)
CALL CONVERSATIONS (1)
CHANGE DETECTION ALGORITHMS (1)
COMPANIES (1)
COMPUTATIONAL MODELING (1)
CONSONANT-VOWEL RECOGNITION (1)
CONTEXT (1)
CONVERSATIONAL AUDIO (1)
CONVERSATIONAL SPEECH (1)
CORRELATION (1)
CUSTOMER SATISFACTION (1)
CUSTOMER SATISFACTION INDEX (1)
DECODING (1)
DEEP LEARNING (1)
DISCRIMINATIVE TRAINING (1)
DYSARTHRIA (1)
ENERGY SEPARATION ALGORITHM (1)
ENGINES (1)
EPOCH INTERVAL CONTOUR (EIC) (1)
EPOCH STRENGTH CONTOUR (ESC) (1)
EQUATIONS (1)
ESTIMATION (1)
FINAL CHANGE POINT DETECTION DECISION (1)
FREQUENCY MODULATION (1)
FREQUENCY MODULATION COMPONENT (1)
GAUSSIAN MIXTURE MODELS (1)
GENERALIZED LIKELIHOOD RATIO METRIC (1)
GRAMMAR (1)
HARMONIC ANALYSIS (1)
HINDI SPEECH INTERFACE (1)
HUMANS (1)
INDIAN CLASSICAL SONG (1)
INDIAN LANGUAGES (1)
INFANT CRY RECOGNITION SYSTEM (ICRS) (1)
INTERNATIONAL PHONETIC ALPHABET (1)
KERNEL (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE-BASED FRAMEWORK (1)
KULLBACK-LEIBLER DIVERGENCE MEASURE (1)
LIKELIHOOD PROBABILITY FUNCTION (1)
MANDI BHAV JANKARI (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MEL FREQUENCY CEPTRAL COEFFICIENT (1)
MEL SPACED FREQUENCY BAND (1)
MICROPHONES (1)
MODELING (1)
MODULATED SINUSOIDAL (1)
MODULATION (1)
MODULATION FEATURES (1)
MONITORING (1)
MOTION PICTURES (1)
MULTI-TAPER (1)
MULTIBAND MODULATION BASED FEATURE (1)
MULTILINGUAL SPEAKER RECOGNITION (1)
MULTIPLE SIGNAL CLASSIFICATION (1)
MUSIC ANALYSIS (1)
MUSIC DISCRIMINATION (1)
MUSIC SEPARATION (1)
MUSIC VOICE SEPARATION (1)
NATURAL LANGUAGE (1)
NATURAL LANGUAGES (1)
NEURAL NETWORKS (1)
NON-ACTED EMOTION (1)
NON-LINGUISTIC FEATURE (1)
NONLINEAR SPEECH FEATURE (1)
NONLINEAR TEAGER-KAISER ENERGY OPERATOR (1)
NOVEL WINDOW (1)
PAIN (1)
PEDIATRICS (1)
POLYNOMIAL CLASSIFIERS (1)
PRAGMATICS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
more

INFONA - science communication portal

Search results for: Sunil Kumar

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options