Search results for: Rao

Items from 1 to 6 out of 6 results

chapter

I-vector based deep neural network acoustic model adaptation using multilingual language resource

Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, more

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

I-vector adaptation of DNN-HMM acoustic models has shown clear performance improvement for speech recognition. In this paper, we study this technique on Babel task. we use Swahili as target language (training data of 50 hours) and another 6 languages as multilingual resources to train i-vector extractors respectively. Our study shows that i-vector extractors trained with more multilingual data only...

article

Detection of Vowel Offset Point From Speech Signal

Jainath Yadav, K. Sreenivasa Rao

IEEE Signal Processing Letters > 2013 > 20 > 4 > 299 - 302

Vowel regions play important role in various speech tasks, such as speech segmentation, speaker-verification, prosody modification and emotion conversion. The instants at which the onset and offset of vowel take place in the speech signal are known as vowel onset point and vowel offset point, respectively. Vowel regions start with the vowel onset point and end with the vowel offset point. In this...

chapter

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

S.T. Shivappa, M.M. Trivedi, B.D. Rao

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops > 107 - 114

2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene understanding in the context of a smart meeting room involves the extraction of various kinds of cues at different levels of semantic abstraction. Specifically, human activity in a scene is usually monitored using arrays of audio and visual sensors. Tasks such as person localization and tracking, speaker ID, focus of attention detection, speech recognition and affective state recognition are...

chapter

Role of head pose estimation in speech acquisition from distant microphones

S.T. Shivappa, B.D. Rao, M.M. Trivedi

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3557 - 3560

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Reverberant environments pose a challenge to speech acquisition from distant microphones. Approaches using microphone arrays have met with limited success. Recent research using audio-visual sensors for tasks such as speaker localization has shown improvement over traditional audio-only approaches. Using computer vision techniques we can estimate the orientation of the speaker's head in addition to...

chapter

Significance of Word and Syllable Level Information for Expressive Speech Processing

K.S. Rao, S.R.M. Prasanna, T.V. Sagar

2009 Seventh International Conference on Advances in Pattern Recognition > 159 - 162

2009 Seventh International Conference on Advances in Pattern Recognition (ICAPR 2009)

In general, human beings make use of expressions (emotions) through speech, facial movements and gestures for conveying the crucial information. Mostly, expressions in speech can be attributed to longer segments, i.e., suprasegmental features also known to be prosodic features. In this paper we analyze the expressions in speech using prosodic features from utterance level, word level and syllable...

chapter

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques

B.V.S. Reddy, K.V. Rao, S.R.M. Prasanna

TENCON 2008 - 2008 IEEE Region 10 Conference > 1 - 4

TENCON 2008 - 2008 IEEE Region 10 Conference

This work demonstrates the development of Keyword Spotting (KWS) system using Vowel Onset Point (VOP), Vector Quantization (VQ) and Hidden Markov Model(HMM) based techniques. The goal of KWS system is to spot the keywords present in the test speech signal, while neglecting rest of the words. In this work, first independent KWS systems will be developed using VOP, VQ and HMM techniques. Each of these...

Filter options

Keywords:
DATA MINING
SPEECH RECOGNITION

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

SPEECH (5)
FEATURE EXTRACTION (3)
ARRAY SIGNAL PROCESSING (2)
DATABASES (2)
HIDDEN MARKOV MODELS (2)
KEYWORD SPOTTING (2)
MICROPHONES (2)
PROBABILITY DENSITY FUNCTION (2)
SPEECH PROCESSING (2)
VOWEL ONSET POINT (2)
ACOUSTICS (1)
ACTIVITY ANALYSIS (1)
ADAPTATION (1)
ADAPTATION MODELS (1)
AFFECTIVE STATE RECOGNITION (1)
ATTENTION DETECTION (1)
AUDIO SENSOR ARRAYS (1)
AUDIO-VISUAL FUSION (1)
AUDIO-VISUAL SENSOR (1)
AUDIO-VISUAL SYSTEMS (1)
BEAMFORMING (1)
BUSINESS DATA PROCESSING (1)
CAMERAS (1)
COMPUTER VISION (1)
DATA MODELS (1)
DEEP NEURAL NETWORK (1)
DISTANCE MEASUREMENT (1)
DURATION (1)
EMOTION RECOGNITION (1)
EMOTIONS (1)
ENERGY (1)
EXCITATION SOURCE (1)
EXPRESSION (1)
FACIAL MOVEMENT (1)
GESTURE ANALYSIS (1)
GLOTTAL CLOSURE REGION (1)
HEAD POSE ESTIMATION (1)
HEAD-POSE ESTIMATION (1)
HIDDEN MARKOV MODELING (1)
HIERARCHICAL AUDIO-VISUAL CUE INTEGRATION (1)
HUMAN-COMPUTER INTERFACE (1)
I-VECTOR (1)
IMAGE SENSORS (1)
INTELLIGENT MEETING ROOMS (1)
INTELLIGENT SPACES (1)
MAGNETIC HEADS (1)
MATERIALS (1)
MICROPHONE (1)
MODULATION (1)
MODULATION SPECTRUM (1)
MULTILINGUAL (1)
PITCH (1)
POSE ESTIMATION (1)
PROSODIC FEATURE (1)
PROSODIC FEATURES (1)
REVERBERANT ENVIRONMENT (1)
SEMANTIC ABSTRACTION (1)
SENSORS (1)
SMART MEETING ROOM (1)
SPEAKER ID (1)
SPEAKER LOCALIZATION (1)
SPEAKER RECOGNITION (1)
SPECTRAL PEAKS (1)
SPEECH ACQUISITION (1)
SPEECH ENHANCEMENT (1)
SPEECH EXPRESSION (1)
SPEECH SIGNAL (1)
SPEECH SYNTHESIS (1)
SUPRASEGMENTAL FEATURE (1)
SYLLABLE LEVEL (1)
SYLLABLE LEVEL INFORMATION (1)
TESTING (1)
TEXT RECOGNITION (1)
TRAINING (1)
UTTERANCE LEVEL (1)
VECTOR QUANTISATION (1)
VECTOR QUANTIZATION (1)
VISUAL SENSOR ARRAYS (1)
VOWEL OFFSET POINT (1)
WORD LEVEL (1)
WORD LEVEL INFORMATION (1)
more

INFONA - science communication portal

Search results for: Rao

I-vector based deep neural network acoustic model adaptation using multilingual language resource

Detection of Vowel Offset Point From Speech Signal

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

Role of head pose estimation in speech acquisition from distant microphones

Significance of Word and Syllable Level Information for Expressive Speech Processing

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options