Search results for: Hung-Shin Lee

Items from 1 to 10 out of 10 results

chapter

Discriminative autoencoders for speaker verification

Hung-Shin Lee, Yu-Ding Lu, Chin-Cheng Hsu, Yu Tsao, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5375 - 5379

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a learning and scoring framework based on neural networks for speaker verification. The framework employs an autoencoder as its primary structure while three factors are jointly considered in the objective function for speaker discrimination. The first one, relating to the sample reconstruction error, makes the structure essentially a generative model, which benefits to learn most...

chapter

Speaker verification using kernel-based binary classifiers with binary operation derived features

Hung-Shin Lee, Yu Tso, Yun-Fan Chang, Hsin-Min Wang, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1660 - 1664

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the “within-speaker” group or the “between-speaker” group...

chapter

I-vector based language modeling for spoken document retrieval

Kuan-Yu Chen, Hung-Shin Lee, Hsin-Min Wang, Berlin Chen, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7083 - 7088

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to...

chapter

Subspace-based phonotactic language recognition using multivariate dynamic linear models

Hung-Shin Lee, Yu-Chin Shih, Hsin-Min Wang, Shyh-Kang Jeng

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 6870 - 6874

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Phonotactics, dealing with permissible phone patterns and their frequencies of occurrence in a specific language, is acknowledged to be related to spoken language recognition (SLR). With the assistance of phone recognizers, each speech utterance can be decoded into an ordered sequence of phone vectors filled with likelihood scores contributed by all possible phone models. In this paper, we propose...

chapter

Exploiting semantic associative information in topic modeling

Meng-Sung Wu, Hung-Shin Lee, Hsin-Min Wang

2010 IEEE Spoken Language Technology Workshop > 384 - 388

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

Topic modeling has been widely applied in a variety of text modeling tasks as well as in speech recognition systems for effectively capturing the semantic and statistic information in documents or speech utterances. Most topic models rely on the bag-of-words assumption that results in learned latent topics composed of lists of individual words. Unfortunately, these words may convey topical information...

chapter

A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification

Hung-Shin Lee, Hsin-Min Wang, Berlin Chen

2010 20th International Conference on Pattern Recognition > 690 - 693

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents a novel discriminative feature transformation, named full-rank generalized likelihood ratio discriminant analysis (fGLRDA), on the grounds of the likelihood ratio test (LRT). fGLRDA attempts to seek a feature space, which is linearly isomorphic to the original n-dimensional feature space and is characterized by a full-rank (n×n) transformation matrix, under the assumption that...

chapter

Generalized likelihood ratio discriminant analysis

Hung-Shin Lee, B. Chen

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 158 - 163

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

In the past several decades, classifier-independent front-end feature extraction, where the derivation of acoustic features is lightly associated with the back-end model training or classification, has been prominently used in various pattern recognition tasks, including automatic speech recognition (ASR). In this paper, we present a novel discriminative feature transformation, named generalized likelihood...

chapter

Empirical error rate minimization based linear discriminant analysis

Hung-Shin Lee, B. Chen

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1801 - 1804

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Linear discriminant analysis (LDA) is designed to seek a linear transformation that projects a data set into a lower-dimensional feature space while retaining geometrical class separability. However, LDA cannot always guarantee better classification accuracy. One of the possible reasons lies in that its formulation is not directly associated with the classification error rate, so that it is not necessarily...

chapter

Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates

Hung-Shin Lee, B. Chen

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Linear discriminant analysis (LDA) is designed to seek a linear transformation that projects a data set into a lower-dimensional feature space for maximum class geometrical separability. LDA cannot always guarantee better classification accuracy, since its formulation is not in light of the properties of the classifiers, such as the automatic speech recognizer (ASR). In this paper, the relationship...

chapter

Training data selection for improving discriminative training of acoustic models

Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, Hung-Shin Lee, more

2007 IEEE Workshop on Automatic Speech Recognition&Understanding (ASRU) > 284 - 289

2007 IEEE Workshop on Automatic Speech Recognition and Understanding

This paper considers training data selection for discriminative training of acoustic models for broadcast news speech recognition. Three novel data selection approaches were proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance was utilized for utterance-level data selection. Second, phone-level data selection based on the difference...

Filter options

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (5)
ERROR ANALYSIS (3)
FEATURE EXTRACTION (3)
HIDDEN MARKOV MODELS (3)
LINEAR DISCRIMINANT ANALYSIS (3)
DISCRIMINATIVE TRAINING (2)
DISTANCE MEASUREMENT (2)
I-VECTOR (2)
LIKELIHOOD RATIO TEST (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
PATTERN CLASSIFICATION (2)
POLYNOMIALS (2)
PROBABILISTIC LOGIC (2)
SPEAKER VERIFICATION (2)
SPEECH (2)
TRAINING (2)
ACCURACY (1)
ACOUSTIC MODEL (1)
ACOUSTIC MODELS (1)
ACOUSTICS (1)
ANALYTICAL MODELS (1)
AUTOENCODERS (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPEECH RECOGNIZER (1)
BACK END MODEL TRAINING (1)
BAG-OF-WORDS ASSUMPTION (1)
BIOLOGICAL SYSTEM MODELING (1)
BROADCAST NEWS SPEECH RECOGNITION (1)
CLASSIFICATION ACCURACY (1)
CLASSIFICATION ERROR RATE (1)
CLASSIFIER INDEPENDENT FRONT END FEATURE EXTRACTION (1)
COVARIANCE MATRIX (1)
DATA MODELS (1)
DATA SELECTION (1)
DISCRIMINATIVE FEATURE TRANSFORMATION (1)
DISCRIMINATIVE LINEAR FEATURE TRANSFORMATION (1)
DISPERSION (1)
DNN (1)
EMPIRICAL ERROR RATE MINIMIZATION (1)
EMPIRICAL PAIRWISE CLASSIFICATION ERROR RATES (1)
ENTROPY (1)
ERBIUM (1)
EXPLOITING SEMANTIC ASSOCIATIVE INFORMATION (1)
FRAME-LEVEL DATA SELECTION (1)
FULL-RANK GENERALIZED LIKELIHOOD RATIO DISCRIMINANT ANALYSIS (1)
GAUSSIAN POSTERIOR PROBABILITY (1)
GAUSSIAN PROCESSES (1)
GENERALIZED LIKELIHOOD RATIO DISCRIMINANT ANALYSIS (1)
GEOMETRICAL CLASS SEPARABILITY (1)
GEOMETRY (1)
HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS (1)
HETEROSCEDASTIC LINEAR FEATURE TRANSFORMATION (1)
HYPOTHESIZED WORD SEQUENCE (1)
INDUCTIVE (1)
INFORMATION RETRIEVAL (1)
KL-DIVERGENCE METRIC (1)
LANGUAGE MODEL (1)
LANGUAGE MODELING (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION TASK (1)
LIGHTWEIGHT SOLVABILITY (1)
LINEAR PROGRAMMING (1)
LINEAR TRANSFORMATION (1)
MAHALANOBIS DISTANCE (1)
MAHALANOBIS DISTANCES (1)
MATHEMATICAL MODEL (1)
MATRIX ALGEBRA (1)
MATRIX DECOMPOSITION (1)
MAXIMUM CLASS GEOMETRICAL SEPARABILITY (1)
MULTICLASS CLASSIFICATION (1)
NATURAL LANGUAGE PROCESSING (1)
NEODYMIUM (1)
NEURAL NETWORKS (1)
NIST (1)
NORMALIZED FRAME-LEVEL ENTROPY (1)
PAIRWISE EMPIRICAL CLASSIFICATION ACCURACY (1)
PATTERN RECOGNITION (1)
PATTERN RECOGNITION TASKS (1)
PHONE-LEVEL DATA SELECTION (1)
PHONOTACTIC LANGUAGE RECOGNITION (1)
PLDA (1)
PROBABILITY (1)
SEMANTIC ASSOCIATION (1)
SEMANTIC INFORMATION (1)
SEMANTIC KNOWLEDGE (1)
SEMANTICS (1)
SPEAKER IDENTIFICATION (1)
SPEECH FEATURES (1)
SPEECH RECOGNITION SYSTEMS (1)
SPEECH UTTERANCES (1)
SPOKEN DOCUMENT RETRIEVAL (1)
SQUARED EUCLIDEAN DISTANCE (1)
STATISTIC INFORMATION (1)
STATISTICAL ANALYSIS (1)
STATISTICAL HYPOTHESIS TESTING (1)
STATISTICAL TESTING (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SVM (1)
TEXT MODELING (1)
TOPIC MODEL (1)
more

INFONA - science communication portal

Search results for: Hung-Shin Lee

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options