Search results for: C. Rose

Items from 1 to 6 out of 6 results

chapter

Dealing with acoustic mismatch for training multilingual subspace Gaussian mixture models for speech recognition

Aanchan Mohan, Sina Hamidi Ghalehjegh, Richard C Rose

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4893 - 4896

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling technique suitable for configuring multilingual speech recognition systems. It is attractive for this purpose since its parametrization allows its “shared” model parameters to be trained with data from multiple languages [1]. In this work, we report on the results of an experimental study carried out with...

chapter

Applying deep-layered clustering to mammography image analytics

Derek C Rose, Itamar Arel, Thomas P Karnowski, Vincent C Paquit

2010 Biomedical Sciences and Engineering Conference > 1 - 4

2010 Biomedical Sciences and Engineering Conference (BSEC 2010)

This paper details a methodology and preliminary results for applying a hierarchy of clustering units to mammographic image data. The identification of patients with breast cancer through the detection of microcalcifications and masses is a demanding classification problem; minimal false negatives are desired while simultaneously avoiding false positives that cause unnecessary cost to patients and...

chapter

Approaches to automatic lexicon learning with limited training examples

N Goel, S Thomas, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5094 - 5097

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Preparation of a lexicon for speech recognition systems can be a significant effort in languages where the written form is not exactly phonetic. On the other hand, in languages where the written form is quite phonetic, some common words are often mispronounced. In this paper, we use a combination of lexicon learning techniques to explore whether a lexicon can be learned when only a small lexicon is...

chapter

Subspace Gaussian Mixture Models for speech recognition

Daniel Povey, Lukáš Burget, Mohit Agarwal, Pinar Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4330 - 4333

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results...

chapter

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

L Burget, P Schwarz, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4334 - 4337

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approach has been to use some kind of “universal phone set” that covers multiple languages. We report experiments on a different approach to multilingual speech recognition, in which the phone sets are entirely distinct but the...

article

Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier]

I Arel, D C Rose, T P Karnowski

IEEE Computational Intelligence Magazine > 2010 > 5 > 4 > 13 - 18

This article provides an overview of the mainstream deep learning approaches and research directions proposed over the past decade. It is important to emphasize that each approach has strengths and "weaknesses, depending on the application and context in "which it is being used. Thus, this article presents a summary on the current state of the deep machine learning field and some perspective...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

SPEECH (4)
SPEECH RECOGNITION (4)
ACOUSTICS (3)
HIDDEN MARKOV MODELS (3)
ACOUSTIC SIGNAL PROCESSING (2)
COMPUTATIONAL MODELING (2)
COMPUTER ARCHITECTURE (2)
DATA MODELS (2)
GAUSSIAN PROCESSES (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
NATURAL LANGUAGE PROCESSING (2)
SUBSPACE GAUSSIAN MIXTURE MODEL (2)
TRAINING DATA (2)
ACOUSTIC MODELING (1)
ACOUSTIC MODELLING (1)
ADAPTATION MODEL (1)
ADAPTATION MODELS (1)
ARTIFICIAL INTELLIGENCE RESEARCH (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC LEXICON LEARNING TECHNIQUE (1)
BELIEF NETWORKS (1)
BIOLOGICAL ORGANS (1)
BOOTSTRAPPING (1)
BRAIN MODELING (1)
BREAST CANCER (1)
CANCER (1)
CLUSTERING ALGORITHMS (1)
CNN (1)
COMPUTER AIDED DETECTION (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
DBN (1)
DEEP BELIEF NETWORKS (1)
DEEP MACHINE LEARNING APPROACH (1)
DEEP-LAYERED CLUSTERING (1)
DICTIONARIES (1)
FEATURE EXTRACTION (1)
FEED-FORWARD NEURAL NETWORK (1)
FEEDFORWARD NEURAL NETS (1)
GAUSSIAN MIXTURE MODELS (1)
IMAGE CLASSIFICATION (1)
IMAGE FEATURES (1)
IMAGE SEGMENTATION (1)
LARGE VOCABULARY SPEECH RECOGNITION (1)
LEXICON LEARNING (1)
LVCSR (1)
MACHINE LEARNING (1)
MAMMOGRAPHY (1)
MASSES (1)
MATHEMATICAL MODEL (1)
MEASUREMENT (1)
MEDICAL IMAGE PROCESSING (1)
MICROCALCIFICATIONS (1)
MIXTURE WEIGHTS (1)
MULTILINGUAL ACOUSTIC MODELING (1)
MULTILINGUAL SPEECH RECOGNITION (1)
NEURAL NETS (1)
PATTERN CLUSTERING (1)
PER-IMAGE PATCH SENSITIVITY (1)
PHONETIC LANGUAGE (1)
PHONETIC STATES (1)
ROBUSTNESS (1)
SPECIFICITY (1)
SPEECH RECOGNITION SYSTEMS (1)
SUBSPACE METHODS (1)
TOTAL PARAMETER SPACE (1)
UNSUPERVISED CLUSTERING (1)
UNSUPERVISED LEARNING (1)
VECTORS (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results for: C. Rose

Dealing with acoustic mismatch for training multilingual subspace Gaussian mixture models for speech recognition

Applying deep-layered clustering to mammography image analytics

Approaches to automatic lexicon learning with limited training examples

Subspace Gaussian Mixture Models for speech recognition

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier]

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options