Wyniki wyszukiwania dla: Rao

Pozycje od 1 do 6 spośród 6 wyników

rozdział

I-vector based deep neural network acoustic model adaptation using multilingual language resource

Haihua Xu, Wei Rao, Xiong Xiao, Hao Huang, więcej

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 5

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

I-vector adaptation of DNN-HMM acoustic models has shown clear performance improvement for speech recognition. In this paper, we study this technique on Babel task. we use Swahili as target language (training data of 50 hours) and another 6 languages as multilingual resources to train i-vector extractors respectively. Our study shows that i-vector extractors trained with more multilingual data only...

artykuł

Detection of Vowel Offset Point From Speech Signal

Jainath Yadav, K. Sreenivasa Rao

IEEE Signal Processing Letters > 2013 > 20 > 4 > 299 - 302

Vowel regions play important role in various speech tasks, such as speech segmentation, speaker-verification, prosody modification and emotion conversion. The instants at which the onset and offset of vowel take place in the speech signal are known as vowel onset point and vowel offset point, respectively. Vowel regions start with the vowel onset point and end with the vowel offset point. In this...

rozdział

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

S.T. Shivappa, M.M. Trivedi, B.D. Rao

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops > 107 - 114

2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene understanding in the context of a smart meeting room involves the extraction of various kinds of cues at different levels of semantic abstraction. Specifically, human activity in a scene is usually monitored using arrays of audio and visual sensors. Tasks such as person localization and tracking, speaker ID, focus of attention detection, speech recognition and affective state recognition are...

rozdział

Role of head pose estimation in speech acquisition from distant microphones

S.T. Shivappa, B.D. Rao, M.M. Trivedi

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3557 - 3560

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Reverberant environments pose a challenge to speech acquisition from distant microphones. Approaches using microphone arrays have met with limited success. Recent research using audio-visual sensors for tasks such as speaker localization has shown improvement over traditional audio-only approaches. Using computer vision techniques we can estimate the orientation of the speaker's head in addition to...

rozdział

Significance of Word and Syllable Level Information for Expressive Speech Processing

K.S. Rao, S.R.M. Prasanna, T.V. Sagar

2009 Seventh International Conference on Advances in Pattern Recognition > 159 - 162

2009 Seventh International Conference on Advances in Pattern Recognition (ICAPR 2009)

In general, human beings make use of expressions (emotions) through speech, facial movements and gestures for conveying the crucial information. Mostly, expressions in speech can be attributed to longer segments, i.e., suprasegmental features also known to be prosodic features. In this paper we analyze the expressions in speech using prosodic features from utterance level, word level and syllable...

rozdział

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques

B.V.S. Reddy, K.V. Rao, S.R.M. Prasanna

TENCON 2008 - 2008 IEEE Region 10 Conference > 1 - 4

TENCON 2008 - 2008 IEEE Region 10 Conference

This work demonstrates the development of Keyword Spotting (KWS) system using Vowel Onset Point (VOP), Vector Quantization (VQ) and Hidden Markov Model(HMM) based techniques. The goal of KWS system is to spot the keywords present in the test speech signal, while neglecting rest of the words. In this work, first independent KWS systems will be developed using VOP, VQ and HMM techniques. Each of these...

Opcje filtrowania

Słowa kluczowe:
DATA MINING
SPEECH RECOGNITION

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (5)
artykuł (1)

Słowa kluczowe

SPEECH (5)
FEATURE EXTRACTION (3)
ARRAY SIGNAL PROCESSING (2)
DATABASES (2)
HIDDEN MARKOV MODELS (2)
KEYWORD SPOTTING (2)
MICROPHONES (2)
PROBABILITY DENSITY FUNCTION (2)
SPEECH PROCESSING (2)
VOWEL ONSET POINT (2)
ACOUSTICS (1)
ACTIVITY ANALYSIS (1)
ADAPTATION (1)
ADAPTATION MODELS (1)
AFFECTIVE STATE RECOGNITION (1)
ATTENTION DETECTION (1)
AUDIO SENSOR ARRAYS (1)
AUDIO-VISUAL FUSION (1)
AUDIO-VISUAL SENSOR (1)
AUDIO-VISUAL SYSTEMS (1)
BEAMFORMING (1)
BUSINESS DATA PROCESSING (1)
CAMERAS (1)
COMPUTER VISION (1)
DATA MODELS (1)
DEEP NEURAL NETWORK (1)
DISTANCE MEASUREMENT (1)
DURATION (1)
EMOTION RECOGNITION (1)
EMOTIONS (1)
ENERGY (1)
EXCITATION SOURCE (1)
EXPRESSION (1)
FACIAL MOVEMENT (1)
GESTURE ANALYSIS (1)
GLOTTAL CLOSURE REGION (1)
HEAD POSE ESTIMATION (1)
HEAD-POSE ESTIMATION (1)
HIDDEN MARKOV MODELING (1)
HIERARCHICAL AUDIO-VISUAL CUE INTEGRATION (1)
HUMAN-COMPUTER INTERFACE (1)
I-VECTOR (1)
IMAGE SENSORS (1)
INTELLIGENT MEETING ROOMS (1)
INTELLIGENT SPACES (1)
MAGNETIC HEADS (1)
MATERIALS (1)
MICROPHONE (1)
MODULATION (1)
MODULATION SPECTRUM (1)
MULTILINGUAL (1)
PITCH (1)
POSE ESTIMATION (1)
PROSODIC FEATURE (1)
PROSODIC FEATURES (1)
REVERBERANT ENVIRONMENT (1)
SEMANTIC ABSTRACTION (1)
SENSORS (1)
SMART MEETING ROOM (1)
SPEAKER ID (1)
SPEAKER LOCALIZATION (1)
SPEAKER RECOGNITION (1)
SPECTRAL PEAKS (1)
SPEECH ACQUISITION (1)
SPEECH ENHANCEMENT (1)
SPEECH EXPRESSION (1)
SPEECH SIGNAL (1)
SPEECH SYNTHESIS (1)
SUPRASEGMENTAL FEATURE (1)
SYLLABLE LEVEL (1)
SYLLABLE LEVEL INFORMATION (1)
TESTING (1)
TEXT RECOGNITION (1)
TRAINING (1)
UTTERANCE LEVEL (1)
VECTOR QUANTISATION (1)
VECTOR QUANTIZATION (1)
VISUAL SENSOR ARRAYS (1)
VOWEL OFFSET POINT (1)
WORD LEVEL (1)
WORD LEVEL INFORMATION (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Rao

I-vector based deep neural network acoustic model adaptation using multilingual language resource

Detection of Vowel Offset Point From Speech Signal

Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms

Role of head pose estimation in speech acquisition from distant microphones

Significance of Word and Syllable Level Information for Expressive Speech Processing

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu