Search results for: Hugo Van hamme

Items from 1 to 8 out of 8 results

chapter

Incrementally learn the relevance of words in a dictionary for spoken language acquisition

Vincent Renkens, Vikrant Tomar, Hugo Van hamme

2016 IEEE Spoken Language Technology Workshop (SLT) > 144 - 150

2016 IEEE Spoken Language Technology Workshop (SLT)

This paper discusses a spoken language acquisition system for a command-and-control interface. The proposed system learns a set of words through coupled commands and demonstrations. The user can teach the system a new command by demonstrating the uttered command through an alternative interface. With these coupled commands and demonstrations, the system can learn the acoustic representations of the...

chapter

Latent variable speaker adaptation of Gaussian mixture weights and means

Xueru Zhang, Kris Demuynck, Hugo Van hamme

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4349 - 4352

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

We describe a novel fast speaker adaptation algorithm for large vocabulary speech recognition systems, which adapts both the Gaussian means and the mixture weights. Gaussian means are expressed as a linear combination of eigenvoices estimated with principal component analysis. The non-negative Gaussian mixture weights are expressed as a linear combination of a set of latent vectors estimated with...

chapter

Weakly supervised keyword learning using sparse representations of speech

Joris Driesen, Jort Gemmeke, Hugo Van hamme

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5145 - 5148

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

When applied to speech, Non-negative Matrix Factorization is capable of learning a small vocabulary of words, foregoing any prior linguistic knowledge. This makes it adequate for small-scale speech applications where flexibility is of the utmost importance, e.g. assistive technology for the speech impaired. However, its performance depends on the way its inputs are represented. We propose the use...

chapter

Tri-factorization learning of sub-word units with application to vocabulary acquisition

Meng Sun, Hugo Van hamme

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5177 - 5180

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In prior work, we proposed a method for vocabulary acquisition based on a co-occurrence model and non-negative matrix factorization. The vocabulary is described in terms of co-occurrence statistics of frame-level acoustic descriptions and suffers from poor scalability to larger vocabularies. Much like whole-word HMM models, there is no reuse of a sub-word units such as phone models. In this paper,...

chapter

Fast word acquisition in an NMF-based learning framework

Joris Driesen, Hugo Van hamme

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5137 - 5140

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

A speech recognition system that automatically learns word models for a small vocabulary from examples of its usage, without using prior linguistic information, can be of great use in cognitive robotics, human-machine interfaces, and assistive devices. In the latter case, the user's speech capabilities may also be affected. In this paper, we consider a NMF-based learning framework capable of doing...

chapter

Speaker age estimation and gender detection based on supervised Non-Negative Matrix Factorization

Mohamad Hasan Bahari, Hugo Van Hamme

2011 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BIOMS) > 1 - 6

2011 IEEE Workshop on Biometric Measurements and Systems for Security and Medical Applications (BIOMS)

In many criminal cases, evidence might be in the form of telephone conversations or tape recordings. Therefore, law enforcement agencies have been concerned about accurate methods to profile different characteristics of a speaker from recorded voice patterns, which facilitate the identification of a criminal. This paper proposes a new approach for speaker gender detection and age estimation, based...

chapter

Rapid speaker adaptation with speaker adaptive training and non-negative matrix factorization

Xueru Zhang, Kris Demuynck, Hugo Van hamme

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4456 - 4459

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we describe a novel speaker adaptation algorithm based on Gaussian mixture weight adaptation. A small number of latent speaker vectors are estimated with non-negative matrix factorization (NMF). These base vectors encode the correlations between Gaussian activations as learned from the train data. Expressing the speaker dependent Gaussian mixture weights as a linear combination of a...

chapter

Learning from images and speech with Non-negative Matrix Factorization enhanced by input space scaling

Joris Driesen, Hugo Van hamme, W Bastiaan Kleijn

2010 IEEE Spoken Language Technology Workshop > 1 - 6

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

Computional learning from multimodal data is often done with matrix factorization techniques such as NMF (Non-negative Matrix Factorization), pLSA (Probabilistic Latent Semantic Analysis) or LDA (Latent Dirichlet Allocation). The different modalities of the input are to this end converted into features that are easily placed in a vectorized format. An inherent weakness of such a data representation...

Filter options

Keywords:
ACOUSTICS

Publication date

Set your own date range

Keywords

TRAINING (6)
SPEECH (5)
HIDDEN MARKOV MODELS (4)
SPEECH RECOGNITION (4)
VECTORS (4)
VOCABULARY (4)
VOCABULARY ACQUISITION (4)
ADAPTATION MODELS (3)
MACHINE LEARNING (3)
DATA MODELS (2)
HISTOGRAMS (2)
MATRIX DECOMPOSITION (2)
NON-NEGATIVE MATRIX FACTORIZATION (2)
NONNEGATIVE MATRIX FACTORIZATION (2)
SILICON (2)
SPEAKER ADAPTIVE TRAINING (2)
TRAINING DATA (2)
ACCURACY (1)
ACOUSTIC SUB-WORD GENERATION (1)
AGE ESTIMATION (1)
BAYESIAN METHODS (1)
COMPUTIONAL LEARNING (1)
DATA REPRESENTATION (1)
DICTIONARIES (1)
EIGENVOICE AND WEIGHT ADAPTATION (1)
ESTIMATION (1)
EXEMPLARS (1)
FAST SPEAKER ADAPTATION (1)
FEATURE SELECTION (1)
GENDER DETECTION (1)
GENERAL REGRESSION NEURAL NETWORK (1)
IMAGE RECOGNITION (1)
INPUT SPACE SCALING (1)
LASSO (1)
LATENT DIRICHLET ALLOCATION (1)
LATENT VARIABLE METHOD (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MATHEMATICAL MODEL (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (1)
MULTI-MODAL LEARNING (1)
MULTIMODAL DATA (1)
NMF-BASED RECOGNITION FRAMEWORK (1)
NON-NEGATIVE MATRIX FACTORISATION (1)
OPTIMIZATION (1)
PATTERN DISCOVERY (1)
PROBABILISTIC LATENT SEMANTIC ANALYSIS (1)
PROBABILISTIC LOGIC (1)
SEMANTICS (1)
SEMI-SUPERVISED LEARNING (1)
SPARSENESS (1)
SPEAKER ADAPTATION (1)
SPECTRAL EMBEDDING (1)
SPOKEN LANGUAGE ACQUISITION (1)
UNSUPERVISED LEARNING (1)
VOCABULARY LEARNING (1)
WEIGHT ADAPTATION (1)
WEIGHTED SUPERVISED NON-NEGATIVE MATRIX FACTORIZATION (1)
WIRELESS SENSOR NETWORKS (1)
more

INFONA - science communication portal

Search results for: Hugo Van hamme

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options