Search results for: Zheng Chen

Items from 1 to 6 out of 6 results

chapter

Learning the Latent Semantic Space for Ranking in Text Retrieval

Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen

2008 Eighth IEEE International Conference on Data Mining > 1115 - 1120

ICDM 2008. Eighth IEEE International Conference on Data Mining

Subspace learning techniques for text analysis, such as latent semantic indexing (LSI), have been widely studied in the past decade. However, to our best knowledge, no previous study has leveraged the rank information for subspace learning in ranking tasks. In this paper, we propose a novel algorithm, called learning latent semantics for ranking (LLSR), to seek the optimal latent semantic space tailored...

chapter

TOFA: Trace Oriented Feature Analysis in Text Categorization

J. Yan, Ning Liu, Qiang Yang, Weiguo Fan, more

2008 Eighth IEEE International Conference on Data Mining > 668 - 677

ICDM 2008. Eighth IEEE International Conference on Data Mining

Dimension reduction for large-scale text data is attracting much attention lately due to the rapid growth of World Wide Web. We can consider dimension reduction algorithms in two categories: feature extraction and feature selection. An important problem remains: it has been difficult to integrate these two algorithm categories into a single framework, making it difficult to reap the benefit of both...

chapter

Improving Text Classification by Using Encyclopedia Knowledge

Pu Wang, Jian Hu, Hua-Jun Zeng, Lijun Chen, more

Seventh IEEE International Conference on Data Mining (ICDM 2007) > 332 - 341

2007 7th IEEE International Conference on Data Mining (ICDM '07)

The exponential growth of text documents available on the Internet has created an urgent need for accurate, fast, and general purpose text classification algorithms. However, the "bag of words" representation used for these classification methods is often unsatisfactory as it ignores relationships between important terms that do not co-occur literally. In order to deal with this problem,...

chapter

Document Transformation for Multi-label Feature Selection in Text Categorization

Weizhu Chen, Jun Yan, Benyu Zhang, Zheng Chen, more

Seventh IEEE International Conference on Data Mining (ICDM 2007) > 451 - 456

2007 7th IEEE International Conference on Data Mining (ICDM '07)

Feature selection on multi-label documents for automatic text categorization is an under-explored research area. This paper presents a systematic document transformation framework, whereby the multi-label documents are transformed into single-label documents before applying standard feature selection algorithms, to solve the multi-label feature selection problem. Under this framework, we undertake...

chapter

Local Word Bag Model for Text Categorization

Wen Pu, Ning Liu, Shuicheng Yan, Jun Han, more

Seventh IEEE International Conference on Data Mining (ICDM 2007) > 625 - 630

2007 7th IEEE International Conference on Data Mining (ICDM '07)

Many text processing applications adopted the bag of words (BOW) model representation of documents, in which each document is represented as a vector of weighted terms or n-grams, and then the cosine distance between two vectors is used as the similarity measurement. Although the great success in information retrieval and text categorization, the conventional BOW model ignores the detailed local text...

chapter

Diverse Topic Phrase Extraction through Latent Semantic Analysis

Jilin Chen, Jun Yan, B. Zhang, Qiang Yang, more

Sixth International Conference on Data Mining (ICDM'6) > 834 - 838

Sixth International Conference on Data Mining (ICDM'06)

We propose a novel algorithm for extracting diverse topic phrases in order to provide summary for large corpora. Previous works often ignore the importance of diversity and thus extract phrases crowded on some hot topics while failing to cover other less obvious but important topics. We solve this problem through document re-weighting and phrase diversification by using latent semantic analysis (LSA)...

Filter options

Keywords:
TEXT ANALYSIS

Publication date

Set your own date range

Keywords

DATA MINING (2)
INFORMATION RETRIEVAL (2)
INTERNET (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
TEXT CATEGORIZATION (2)
ALGORITHM DESIGN AND ANALYSIS (1)
BAG OF WORDS REPRESENTATION (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
COMPLEXITY THEORY (1)
COSINE DISTANCE (1)
DIMENSION REDUCTION (1)
DISCRIMINATIVE MACHINE LEARNING (1)
DIVERSE TOPIC PHRASE EXTRACTION (1)
DOCUMENT RE-WEIGHTING (1)
DOCUMENT TRANSFORMATION EVALUATION (1)
DOCUMENTS REPRESENTATION (1)
ENCYCLOPAEDIAS (1)
ENCYCLOPEDIA KNOWLEDGE (1)
ENTROPY-BASED LABEL ASSIGNMENT (1)
FEATURE ANALYSIS (1)
FEATURE EXTRACTION (1)
INDEXING (1)
INTUITIVE DOCUMENT TRANSFORMATION (1)
LABEL ENTROPY (1)
LARGE SCALE INTEGRATION (1)
LARGE-SCALE TEXT DATA (1)
LATENT SEMANTIC ANALYSIS (1)
LATENT SEMANTIC INDEXING (1)
LATENT SEMANTIC SPACE (1)
LOCAL WORD BAG MODEL (1)
MULTICLASS CATEGORIZATION (1)
MULTICLASS TEXT CATEGORIZATION (1)
MULTILABEL DOCUMENT TRANSFORMATION (1)
MULTILABEL EVALUATION BENCHMARK TEXT COLLECTION (1)
MULTILABEL FEATURE SELECTION (1)
OPTIMIZATION (1)
PHRASE DIVERSIFICATION (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PROBABILITY DENSITY FUNCTION (1)
RANKING (1)
RANKING TASK (1)
REUTERS NEWSFEEDS (1)
SINGLE-LABEL DOCUMENT (1)
SUBSPACE LEARNING TECHNIQUE (1)
SUPERVISED MAXIMUM MARGIN CRITERION (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINES (1)
SVM CLASSIFIER (1)
TEXT CLASSIFICATION (1)
TEXT DOCUMENT (1)
TEXT PROCESSING (1)
TEXT RETRIEVAL (1)
TOFA (1)
TRACE ORIENTED FEATURE ANALYSIS (1)
TRAINED DATA HANDLING (1)
TRAINING (1)
TRAINING DATA (1)
UNSUPERVISED PRINCIPAL COMPONENT ANALYSIS (1)
VG-PYRAMID MATCH KERNEL (1)
WIKIPEDIA (1)
WORD PROCESSING (1)
WORLD WIDE WEB (1)
more

INFONA - science communication portal

Search results for: Zheng Chen

Learning the Latent Semantic Space for Ranking in Text Retrieval

TOFA: Trace Oriented Feature Analysis in Text Categorization

Improving Text Classification by Using Encyclopedia Knowledge

Document Transformation for Multi-label Feature Selection in Text Categorization

Local Word Bag Model for Text Categorization

Diverse Topic Phrase Extraction through Latent Semantic Analysis

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options