Wyniki wyszukiwania dla: J.R. Hershey

Pozycje od 1 do 10 spośród 10 wyników

artykuł

Tracking Motion, Deformation, and Texture Using Conditionally Gaussian Processes

T.K. Marks, J.R. Hershey, J.R. Movellan

IEEE Transactions on Pattern Analysis and Machine Intelligence > 2010 > 32 > 2 > 348 - 363

We present a generative model and inference algorithm for 3D nonrigid object tracking. The model, which we call G-flow, enables the joint inference of 3D position, orientation, and nonrigid deformations, as well as object texture and background texture. Optimal inference under G-flow reduces to a conditionally Gaussian stochastic filtering problem. The optimal solution to this problem reveals a new...

rozdział

Hierarchical variational loopy belief propagation for multi-talker speech recognition

S.J. Rennie, J.R. Hershey, P.A. Olsen

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 176 - 181

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

We present a new method for multi-talker speech recognition using a single-channel that combines loopy belief propagation and variational inference methods to control the complexity of inference. The method models each source using an HMM with a hierarchical set of acoustic states, and uses the max model to approximate how the sources interact to generate mixed data. Inference involves inferring a...

rozdział

Refactoring acoustic models using variational density approximation

P.L. Dognin, J.R. Hershey, V. Goel, P.A. Olsen

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4473 - 4476

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In model-based pattern recognition it is often useful to change the structure, or refactor, a model. For example, we may wish to find a Gaussian mixture model (GMM) with fewer components that best approximates a reference model. One application for this arises in speech recognition, where a variety of model size requirements exists for different platforms. Since the target size may not be known a...

rozdział

Single-channel speech separation and recognition using loopy belief propagation

S.J. Rennie, J.R. Hershey, P.A. Olsen

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3845 - 3848

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We address the problem of single-channel speech separation and recognition using loopy belief propagation in a way that enables efficient inference for an arbitrary number of speech sources. The graphical model consists of a set of N Markov chains, each of which represents a language model or grammar for a given speaker. A Gaussian mixture model with shared states is used to model the hidden acoustic...

rozdział

A fast, accurate approximation to log likelihood of Gaussian mixture models

P.L. Dognin, V. Goel, J.R. Hershey, P.A. Olsen

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3817 - 3820

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

It has been a common practice in speech recognition and elsewhere to approximate the log likelihood of a Gaussian mixture model (GMM) with the maximum component log likelihood. While often a computational necessity, the max approximation comes at a price of inferior modeling when the Gaussian components significantly overlap. This paper shows how the approximation error can be reduced by changing...

rozdział

Efficient model-based speech separation and denoising using non-negative subspace analysis

S.J. Rennie, J.R. Hershey, P.A. Olsen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 1833 - 1836

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We present a new probabilistic architecture for analyzing composite non-negative data, called Non-negative Subspace Analysis (NSA). The NSA model provides a framework for understanding the relationships between sparse subspace and mixture model based approaches, and encompasses a range of models, including Sparse Non-negative Matrix Factorization (SNMF) [1] and mixture-model based analysis as special...

rozdział

Accelerated Monte Carlo for Kullback-Leibler divergence between Gaussian mixture models

Jia-Yu Chen, J.R. Hershey, P.A. Olsen, E. Yashchin

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4553 - 4556

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Kullback Leibler (KL) divergence is widely used as a measure of dissimilarity between two probability distributions; however, the required integral is not tractable for gaussian mixture models (GMMs), and naive Monte-Carlo sampling methods can be expensive. Our work aims to improve the estimation of KL divergence for QMMs by sampling methods. We show how to accelerate Monte-Carlo sampling using variational...

rozdział

Variational Bhattacharyya divergence for hidden Markov models

J.R. Hershey, P.A. Olsen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4557 - 4560

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Many applications require the use of divergence measures between probability distributions. Several of these, such as the Kullback-Leibler (KL) divergence and the Bhattacharyya divergence, are tractable for simple distributions such as Gaussians, but are intractable for more complex distributions such as hidden Markov models (HMMs) used in speech recognizers. For tasks related to classification error,...

rozdział

Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models

J.R. Hershey, P.A. Olsen

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-317 - IV-320

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

The Kullback Leibler (KL) divergence is a widely used tool in statistics and pattern recognition. The KL divergence between two Gaussian mixture models (GMMs) is frequently needed in the fields of speech and image recognition. Unfortunately the KL divergence between two GMMs is not analytically tractable, nor does any efficient computational algorithm exist. Some techniques cope with this problem...

rozdział

Variational sampling approaches to word confusability

J.R. Hershey, P.A. Olsen, R.A. Gopinath

2007 Information Theory and Applications Workshop > 1 - 119

2007 Information Theory and Applications Workshop

In speech recognition it is often useful to determine how confusable two words are. For speech models this comes down to computing the Bayes error between two HMMs. This problem is analytically and numerically intractable. A common alternative, that is numerically approachable, uses the KL divergence in place of the Bayes error. We present new approaches to approximating the KL divergence, that combine...

Opcje filtrowania

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (9)
artykuł (1)

Słowa kluczowe

SPEECH RECOGNITION (7)
HIDDEN MARKOV MODELS (6)
ACOUSTICS (4)
COMPUTATIONAL MODELING (4)
GAUSSIAN PROCESSES (4)
APPROXIMATION METHODS (3)
INFERENCE ALGORITHMS (3)
SPEECH SEPARATION (3)
VARIATIONAL METHODS (3)
APPROXIMATION THEORY (2)
BAYES ERROR (2)
BAYES METHODS (2)
BHATTACHARYYA DIVERGENCE (2)
FACTORIAL HIDDEN MARKOV MODELS (2)
GAUSSIAN MIXTURE MODEL (2)
GAUSSIAN MIXTURE MODELS (2)
IMPORTANCE SAMPLING (2)
INFERENCE ALGORITHM (2)
IROQUOIS (2)
KULLBACK LEIBLER DIVERGENCE (2)
KULLBACK-LEIBLER DIVERGENCE (2)
LOOPY BELIEF PROPAGATION (2)
MAX MODEL (2)
MONTE CARLO METHODS (2)
PATTERN RECOGNITION (2)
PROBABILITY DISTRIBUTIONS (2)
SPEECH (2)
3D NONRIGID OBJECT TRACKING (1)
3D ORIENTATION (1)
3D POSITION (1)
ACCELERATED MONTE CARLO (1)
ACOUSTIC INFERENCE (1)
ACOUSTIC MODEL (1)
ACOUSTIC MODEL CLUSTERING (1)
ACOUSTIC SIGNAL PROCESSING (1)
ALGONQUIN (1)
AND COMPUTER VISION (1)
ANTITHETIC VARIATES (1)
ARTIFICIAL INTELLIGENCE (1)
ASR (1)
BACKGROUND TEXTURE (1)
BELIEF MAINTENANCE (1)
BELIEF NETWORKS (1)
BHATTACHARYYA DISTANCE (1)
BHATTACHARYYA ERROR (1)
BISMUTH (1)
CLUSTERING ALGORITHMS (1)
COMPUTER VISION (1)
COMPUTER VISION ALGORITHM (1)
COMPUTING METHODOLOGIES (1)
CONTROL VARIATES (1)
DATA MODELS (1)
DEFORMABLE MODELS (1)
EMOTION RECOGNITION (1)
ERROR ANALYSIS (1)
ERROR STATISTICS (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
EXPONENTIAL DISTRIBUTION (1)
FACE RECOGNITION (1)
FACE TRACKING (1)
FACE TRACKING. (1)
FACE VIDEO DATA SET (1)
FACIAL EXPRESSION (1)
FILTERING (1)
FILTERING THEORY (1)
G-FLOW MODEL (1)
GAUSSIAN CLUSTERING (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE MODELS (GMMS) (1)
GAUSSIAN STOCHASTIC FILTERING PROBLEM (1)
GENERATIVE MODELS (1)
GMM-APPROXIMATION (1)
HEAD MOTION TRACKING (1)
HIDDEN ACOUSTIC SIGNAL (1)
HIDDEN MARKOV MODEL (1)
HIDDEN MARKOV MODELS (HMMS) (1)
HIERARCHICAL VARIATIONAL LOOPY BELIEF PROPAGATION (1)
HIERARCHICAL VARIATIONAL MAX-SUM PRODUCT ALGORITHM (1)
HMM (1)
IMAGE MOTION ANALYSIS (1)
IMAGE PROCESSING (1)
IMAGE RECOGNITION (1)
IMAGE SEQUENCES (1)
IMAGE TEXTURE (1)
INFERENCE MECHANISMS (1)
KL DIVERGENCE (1)
LEARNING ALGORITHM (1)
LOG SPECTRUM DOMAIN (1)
LOOPY MESSAGE PASSING (1)
MARKOV CHAINS (1)
MARKOV PROCESSES (1)
MATRIX DECOMPOSITION (1)
MAXIMUM APPROXIMATION (1)
MAXIMUM COMPONENT LOG LIKELIHOOD APPROXIMATION (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MERGING (1)
MODEL-BASED PATTERN RECOGNITION (1)
MONAURAL SPEECH SEPARATION TASK (1)
MONTE CARLO SAMPLING (1)
MONTE-CARLO SAMPLING METHODS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: J.R. Hershey

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu