Search results for: Jun Guo

Items from 1 to 13 out of 13 results

chapter

Maximizing the reliability of two-state automaton for burst feature detection in news streams

Gang Du, Jun Guo, Weiran Xu, Zhen Yang

2010 IEEE International Conference on Progress in Informatics and Computing > 1 > 229 - 233

2010 International Conference on Progress in Informatics and Computing (PIC 2010)

The capture of temporal dynamics of news streams has drawn increasing attentions in recent sequential data mining works. Most of them are based on the intuition that a “burst” of a topic is signaled by a growth of relevant words in a high intensity during a period of time. Such “burst features” can be efficiently identified by Kleinberg's two-state automaton model. The resolution is an important parameter...

chapter

An Efficient Algorithm of Hot Events Detection in Text Streams

Junliang Bai, Jun Guo, Guang Chen, Weiran Xu, more

2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery > 321 - 326

2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC)

Hot events detection in text streams has drawn increasing attention in recent sequential data mining works. Different from traditional TDT task which find all the real events' cluster, hot events detection only identify hot events concerned by public. This paper proposes a novel approach to identify those events based on burst terms, terms co-occurrence and generative probabilistic model. Experiments...

chapter

Discriminative LDA

Weiran Xu, Mingzhi Dong, YunHang Lin, Jun Guo, more

2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content > 287 - 292

2010 2nd IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2010)

This paper is aim to improve the discrimination capability of LDA model through unsupervised feature selection. Experimental results show that if the interference of general word and general topic can be removed, the discrimination capability of LDA model will be increased. The key problem is how to find supervised information to evaluate features. The LDA topics are assumed reasonable. Therefore,...

chapter

A Multiclass SVM Method via Probabilistic Error-Correcting Output Codes

Zhanyi Wang, Weiran Xu, Jiani Hu, Jun Guo

2010 International Conference on Internet Technology and Applications > 1 - 4

2010 International Conference on Internet Technology and Applications (iTAP 2010)

Error-correcting output code (ECOC) is an effective approach to solve the problem of multiclass SVM. In this paper, a probabilistic approach that is based on ECOC is proposed. In the training stage, a coding scheme is predefined, and a special model is trained by samples. In the classification stage, besides the labels from SVM as usual, posterior probabilities of labels are also calculated. They...

chapter

Exploiting Combined Multi-level Model for Document Sentiment Analysis

Si Li, Hao Zhang, Weiran Xu, Guang Chen, more

2010 20th International Conference on Pattern Recognition > 4141 - 4144

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper focuses on the task of text sentiment analysis in hybrid online articles and web pages. Traditional approaches of text sentiment analysis typically work at a particular level, such as phrase, sentence or document level, which might not be suitable for the documents with too few or too many words. Considering every level analysis has its own advantages, we expect that a combination model...

chapter

Cross-Document Coreference Resolution Based on Automatic Text Summary

Sanyuan Gao, Si Li, Weiran Xu, Jun Guo

2010 Third International Conference on Knowledge Discovery and Data Mining > 306 - 309

2010 3rd International Conference on Knowledge Discovery and Data Mining (WKDD 2010)

Cross-document coreference resolution plays an import part in the filed of natural language processing (NLP). It captures the ability of gathering documents for information about a certain entity. Most previous algorithms identify the underlying entity of a given document depending on the original text, which is unreliable if the original text contains multiple parts of different themes. In this paper,...

chapter

Semi-supervised Chinese compound word extraction based on HMM

Hui He, Bo Chen, Jun Guo

2008 7th World Congress on Intelligent Control and Automation > 2077 - 2081

2008 7th World Congress on Intelligent Control and Automation

In natural languages, compound words play an important role and their automatically extraction is very helpful in information retrieval, information extraction and text classification. We introduce a semi-supervised Chinese compound extraction approach based on HMM using bootstrapping in this paper. First, we define a set of tags BEMI {beginning, end, middle, independence}, which means the position...

chapter

Short Text Feature Extraction and Clustering for Web Topic Mining

Hui He, Bo Chen, Weiran Xu, Jun Guo

Third International Conference on Semantics, Knowledge and Grid (SKG 2007) > 382 - 385

2007 3rd International Conference on Semantics, Knowledge and Grid

This paper is to introduce an algorithm to cluster Chinese short texts for mining web topics based on Chinese chunks. Aiming at the characteristics of Chinese short texts, the algorithm employs N-gram feature extraction to capture Chinese chunks from texts, which reflect the text semantic structure and character dependency. Then RPCL algorithm is applied to realizing text clustering with high precision,...

chapter

Learning Locality Discriminating Indexing for Text Categorization

Jiani Hu, Weihong Deng, Jun Guo, Weiran Xu

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 3 > 239 - 242

2007 International Conference on Fuzzy Systems and Knowledge Discovery

This paper introduces a locality discriminating indexing (LDI) algorithm for text categorization. The LDI algorithm offers a manifold way of discriminant analysis. Based on the hypothesis that samples from different classes reside in class-specific manifold structures, the algorithm depicts the manifold structures by a nearest-native graph and a invader graphs. And a new locality discriminant criterion...

chapter

An Expert Experience Probabilistic Model for Enterprise Expert Finding

Zhao Ru, Weiran Xu, Jun Guo

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 1 > 477 - 481

2007 International Conference on Fuzzy Systems and Knowledge Discovery

Finding experts accurately and automatically is becoming difficult especially in a large organization. This paper presents a probabilistic model which applies language modeling techniques to find experts in enterprise corpora. The expertise of each candidate expert is modeled through the associated experience. We employ a qualification of experience, and validate this qualification as a measure of...

chapter

POC-NLW Template Based Tagging Method for Chinese Word Segmentation

Bo Chen, Weiran Xu, Hui He, Jun Guo

2006 International Conference on Computational Intelligence and Security > 2 > 1423 - 1428

2006 International Conference on Computational Intelligence and Security

In Chinese word segmentation, disambiguation and unknown words identification are becoming the two key issues. In this paper, a two-stage strategy based system is constructed to deal with these problems. First, an n-gram based model is applied to do the basic segmentation as well as disambiguation in some extent. Then, in the second stage, a language tagging template, named POC-NLW, is adopted to...

chapter

Application of the Character-Level Statistical Method in Text Categorization

Zhen Yang, Xiangfei Nie, Weiran Xu, Jun Guo

2006 International Conference on Computational Intelligence and Security > 2 > 1412 - 1417

2006 International Conference on Computational Intelligence and Security

It is generally thought that semantic and grammatical information was very significant to better understanding and processing of text. But in simple text categorization task, absence of this information does not always lead to the degradation of classifier performance. In this paper, we discuss the application of the character-level statistical method in text categorization, which extract character-level...

chapter

Automatic text categorization based on angle distribution

Tao Liu, Jun Guo

2005 International Conference on Machine Learning and Cybernetics > 6 > 3797 - 3801 Vol. 6

Proceedings of 2005 International Conference on Machine Learning and Cybernetics

In order to improve the performance of Chinese text categorization, a new Chinese text categorization method based on angle distribution is presented. The new method describes the text with a more precise model and proposed a new categorization algorithm by employing angle distribution. Simulation results on open Chinese text collection show that the precision and recall of most classes have been...

Filter options

Keywords:
TEXT ANALYSIS

Publication date

Set your own date range

Keywords

DATA MINING (5)
INFORMATION RETRIEVAL (4)
NATURAL LANGUAGE PROCESSING (4)
TEXT CATEGORIZATION (4)
ANALYTICAL MODELS (3)
FEATURE EXTRACTION (3)
HIDDEN MARKOV MODELS (3)
HEURISTIC ALGORITHMS (2)
INDEXING (2)
PATTERN CLUSTERING (2)
PROBABILITY (2)
SEMANTICS (2)
SYNTACTICS (2)
WEB SITES (2)
ALGORITHM (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANGLE DISTRIBUTION (1)
APPROXIMATION ALGORITHMS (1)
AUTOMATA THEORY (1)
AUTOMATIC CHINESE TEXT CATEGORIZATION METHOD (1)
AUTOMATIC TEXT SUMMARY (1)
AUTOMATON (1)
BAYES METHODS (1)
BAYESIAN THEORY (1)
BEMI TAGGING ALGORITHM (1)
BOOTSTRAPPING (1)
BRIDGES (1)
BURST FEATURE DETECTION (1)
BURST TERMS (1)
BUSINESS DATA PROCESSING (1)
CHARACTER DEPENDENCY (1)
CHARACTER SEQUENCE TAGGING (1)
CHARACTER-LEVEL FREQUENT PATTERN EXTRACTION (1)
CHARACTER-LEVEL STATISTICAL METHOD (1)
CHINESE INFORMATION PROCESSING (1)
CHINESE SHORT TEXT FEATURE CLUSTERING (1)
CHINESE WORD SEGMENTATION (1)
CLASSIFICATION ALGORITHMS (1)
CLUSTERING ALGORITHMS (1)
COMBINED MULTI-LEVEL MODEL (1)
COMPOUND WORD EXTRACTION (1)
COMPOUNDS (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTATIONAL MODELING (1)
CONDITIONAL RANDOM FIELDS (1)
CROSS-DOCUMENT COREFERENCE RESOLUTION (1)
CYCLONES (1)
DATA MODELS (1)
DISCRIMINANT ANALYSIS (1)
DISCRIMINATION CAPABILITY (1)
DOCUMENT MODELING (1)
DOCUMENT SENTIMENT ANALYSIS (1)
DOCUMENT-LEVEL (1)
EARTHQUAKES (1)
EFFICIENT ALGORITHM (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
EM ALGORITHM (1)
ENCODING (1)
ENTERPRISE CORPORA (1)
ENTERPRISE EXPERT FINDING (1)
ENTROPY (1)
EQUATIONS (1)
ERROR CORRECTION CODES (1)
EVENT DETECTION (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
EXPERT EXPERIENCE PROBABILISTIC MODEL (1)
GENERALIZED EIGENVALUE PROBLEM (1)
GENERATIVE PROBABILISTIC MODEL (1)
GRAMMATICAL INFORMATION (1)
GRAPHS (1)
HELIUM (1)
HEURISTIC ALGORITHM (1)
HIDDEN MARKOV MODEL (1)
HMM (1)
HOBBS ALGORITHM (1)
HOT EVENTS DETECTION (1)
HYBRID ONLINE ARTICLES (1)
INDEXES (1)
INFORMATION EXTRACTION (1)
INFORMATION GAIN (1)
INFORMATION RESOURCES (1)
INFORMATIVE-INDICATIVE SUMMARY (1)
INTERFERENCE (1)
INTERNET (1)
INVADER GRAPH (1)
JOINING PROCESSES (1)
KEYWORD RECOMMENDATION SYSTEM (1)
KNOWLEDGE BASED SYSTEMS (1)
LANGUAGE MODELING TECHNIQUE (1)
LARGE ORGANIZATION (1)
LDA MODEL (1)
LEARNING (1)
LEFT-MIDDLE-RIGHT TEMPLATE (1)
LIKELIHOOD ESTIMATION (1)
LOCALITY DISCRIMINANT CRITERION (1)
LOCALITY DISCRIMINATING INDEXING ALGORITHM (1)
MACHINE LEARNING (1)
MATHEMATICAL MODEL (1)
MAXIMUM ENTROPY MODEL (1)
more

INFONA - science communication portal

Search results for: Jun Guo

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options