Search results for: Shuanhu Bai

Items from 1 to 6 out of 6 results

chapter

Building topic mixture language models using the document soft classification notion of topic models

Shuanhu Bai, Cheung-Chi Leung, Chien-Lin Huang, Bin Ma, more

2010 7th International Symposium on Chinese Spoken Language Processing > 229 - 232

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

We present a topic mixture language modeling approach making use of the soft classification notion of topic models. Given a text document set, we first perform document soft classification by applying a topic modeling process such as probabilistic latent semantic analyses (PLSA) or latent Dirichlet allocation (LDA) on the dataset. Then we can derive topic-specific n-gram counts from the classified...

chapter

Semi-supervised learning of language model using unsupervised topic model

Shuanhu Bai, Chien-Lin Huang, Bin Ma, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5386 - 5389

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We present a semi-supervised learning (SSL) method for building domain-specific language models (LMs) from general-domain data using probabilistic latent semantic analysis (PLSA). The proposed technique first performs topic decomposition (TD) on the combined dataset of domain-specific and general-domain data. Then it derives latent topic distribution of the interested domain, and derives domain-specific...

chapter

Language models learning for domain-specific natural language user interaction

Shuanhu Bai, Chien-Lin Huang, Yeow-Kee Tan, Bin Ma

2009 IEEE International Conference on Robotics and Biomimetics (ROBIO) > 2480 - 2485

2009 IEEE International Conference on Robotics and Biomimetics (ROBIO 2009)

Natural language interface is an important research topic in the area of natural language processing (NLP). Natural language interaction with robot could be the most natural and efficient way. In order to build speech enabled human language interface of robots, our research goal is to study the problems in this area and develop technologies that can potentially improve human-robot interaction. In...

chapter

Semi-supervised Learning of Domain-Specific Language Models from General Domain Data

Shuanhu Bai, Min Zhang, Haizhou Li

2009 International Conference on Asian Language Processing > 273 - 279

2009 International Conference on Asian Language Processing (IALP 2009)

We present a semi-supervised learning method for building domain-specific language models (LM) from general-domain data. This method is aimed to use small amount of domain-specific data as seeds to tap domain-specific resources residing in larger amount of general-domain data with the help of topic modeling technologies. The proposed algorithm first performs topic decomposition (TD) on the combined...

chapter

PLSA Based Topic Mixture Language Modeling Approach

Shuanhu Bai, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, we propose a method to extend the use of latent topics into higher order n-gram models. In training, the parameters of higher order n-gram models are estimated using discounted average counts derived from the application of probabilistic latent semantic analysis(PLSA) models on n-gram counts in training corpus. In decoding, a simple yet efficient topic prediction method is introduced...

chapter

Bayesian Learning of N-Gram Statistical Language Modeling

Shuanhu Bai, Haizhou Li

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

The n-gram language model adaptation is typically formulated using deleted interpolation under the maximum likelihood estimation framework. This paper proposes a Bayesian learning framework for n-gram statistical language model training and adaptation. By introducing a Dirichlet conjugate prior to the n-gram parameters, we formulate the deleted interpolation under maximum a posterior criterion with...

Filter options

Publication date

Set your own date range

Keywords

ADAPTATION MODEL (5)
NATURAL LANGUAGE PROCESSING (5)
TRAINING (5)
BUILDINGS (4)
LEARNING (ARTIFICIAL INTELLIGENCE) (4)
DATA MODELS (3)
LANGUAGE MODEL (3)
PROBABILISTIC LATENT SEMANTIC ANALYSIS (3)
TOPIC DECOMPOSITION (3)
CLASSIFICATION ALGORITHMS (2)
DOMAIN-SPECIFIC LANGUAGE MODELS (2)
DOMAIN-SPECIFIC WORD N-GRAM COUNTS (2)
PROBABILISTIC LOGIC (2)
SEMI-SUPERVISED LEARNING (2)
SUPERVISED LEARNING (2)
TOPIC MODEL (2)
VOCABULARY (2)
BAYESIAN LEARNING (1)
BAYESIAN METHODS (1)
BELIEF NETWORKS (1)
COMPUTATIONAL LINGUISTICS (1)
CONTEXT (1)
CONTEXT MODELING (1)
DATA MINING (1)
DELETED INTERPOLATION (1)
DIRICHLET CONJUGATE (1)
DISCRETE WAVELET TRANSFORMS (1)
DOCUMENT SOFT CLASSIFICATION (1)
DOMAIN-SPECIFIC NATURAL LANGUAGE PROCESSING (1)
ENTROPY (1)
GENERAL DOMAIN DATA (1)
GENERAL-DOMAIN DATA (1)
HIGHER ORDER N-GRAM MODELS (1)
HUMAN-ROBOT INTERACTION (1)
INFERENCE MECHANISMS (1)
INTERPOLATION (1)
LANGUAGE MODEL LEARNING (1)
LANGUAGE MODELS LEARNING (1)
LATENT DIRICHLET ALLOCATION (1)
MAXIMUM A POSTERIOR CRITERION (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
N-GRAM LANGUAGE MODEL (1)
N-GRAM MODELING APPROACH (1)
N-GRAM STATISTICAL LANGUAGE MODELING (1)
NATURAL LANGUAGE INTERFACES (1)
NATURAL LANGUAGE USER INTERFACES (1)
NATURAL LANGUAGES (1)
PATTERN CLASSIFICATION (1)
PLSA STYLE MIXTURE MODEL (1)
PREDICTION METHODS (1)
PROBABILISTIC LATENT SEMANTIC ANALYSES (1)
PROBABILISTIC LATENT SEMANTIC ANALYSIS MODELS (1)
PROBABILITY (1)
RELATIVE ENTROPY TEXT SELECTION (1)
ROBOTS (1)
SEMANTICS (1)
SEMISUPERVISED LEARNING (1)
SEMISUPERVISED LEARNING METHOD (1)
SIMULATED SUPERVISED LEARNING METHOD (1)
SMOOTHING METHODS (1)
STATISTICAL ANALYSIS (1)
TEXT ANALYSIS (1)
TOPIC MIXTURE LANGUAGE MODEL (1)
TOPIC MIXTURE LANGUAGE MODEL (TMLM) (1)
TOPIC MIXTURE LANGUAGE MODELING (1)
TOPIC PREDICTION METHOD (1)
TRADITIONAL N-GRAM MODELING APPROACH (1)
TRAINING CORPUS (1)
UNSUPERVISED ADAPTATION (1)
UNSUPERVISED TOPIC MODEL (1)
USER INTERACTION (1)
WEIGHTED DOMAIN-SPECIFIC WORD N-GRAM COUNT (1)
more

INFONA - science communication portal

Search results for: Shuanhu Bai

Building topic mixture language models using the document soft classification notion of topic models

Semi-supervised learning of language model using unsupervised topic model

Language models learning for domain-specific natural language user interaction

Semi-supervised Learning of Domain-Specific Language Models from General Domain Data

PLSA Based Topic Mixture Language Modeling Approach

Bayesian Learning of N-Gram Statistical Language Modeling

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options