Search results for: Yanling Li

Items from 1 to 6 out of 6 results

chapter

Data Imbalance Problem in Text Classification

Yanling Li, Guoshe Sun, Yehang Zhu

2010 Third International Symposium on Information Processing > 301 - 305

2010 Third International Symposium on Information Processing (ISIP 2010)

Aimming at the ever-present problem of imbalanced data in text classification, the authors study on several forms of imbalanced data, such as text number, class size, subclass and class fold. Some useful conclusions are gotten from a series of correlative experiments: first, when the text of two class is almost the same number, the difference of word number become major factor to affect the accuracy...

chapter

Method for feature word weight calculating

Yanling Li, Jing Yuan, Xia Ye

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 1 > 309 - 312

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

Automatic text categorization has been one of the hotspots in the information processing field. To aim at the important impact of feature weight calculating on text classification accuracy, first, the relationship between text representation model and feature weight calculating is studied, and the existed methods of feature weight calculating are analyzed, then the common idea of feature weighting...

chapter

Threshold Determining Method for Feature Selection

Yanling Li, Li Song

2009 Second International Symposium on Electronic Commerce and Security > 2 > 273 - 277

Second International Symposium on Electronic Commerce and Security, ISECS 2009

Feature selection is a key step in text categorization, its results has direct influence on the classification accuracy. Evaluation function is usually adopted in feature selection method to calculate the value of feature words,and the feature words which assessed value is higher than setted threshold are maintained as the final feature subset.So the threshold is the important factors of feature selection...

chapter

Text Classificationg for Imbalanced Data Sets

Yanling Li, Yehang Zhu, Ping Yang

2008 International Symposium on Information Science and Engineering > 2 > 778 - 781

2008 International Symposium on Information Science and Engineering (ISISE)

Imbalanced data set has caused a significant drawback of the classification performance attainable by most normal machine learning algorithm. However, the samples are often imbalanced. Therefore, how to reduce the effects of uneven distribution of training sets on text classification performance is a great challenge for machine learning on imbalanced data sets. Currently, the study on imbalaced data...

chapter

Feature Selection Method of Text Tendency Classification

Yanling Li, Guanzhong Dai, Gang Li

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 34 - 37

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Recently, automatic text categorization has made rapid progress and been one of the hotspots in the information processing field. Text tendency classification is one type of text categorization, which has very important applications in information retrievals bad information identification and filtering , content security management and analysis of public opinion tendency. To aim at the important influence...

article

A high-performance extraction method for public opinion on internet

Yanling Li, Guanzhong Dai, Yehang Zhu, Sen Qin

Wuhan University Journal of Natural Sciences > 2007 > 12 > 5 > 902-906

Aiming at the importance of the analysis for public opinion on Internet, the authors propose a high-performance extraction method for public opinion. In this method, the space model for classification is adopted to describe the relationship between words and categories. The combined feature selection method is used to remove noisy words from the original feature space effectively. Then the category...

Filter options

Keywords:
TEXT CATEGORIZATION

Publication date

Set your own date range

Content availability

Available (5)
None (1)

Publication type

book (5)
article (1)

Keywords

Data set

ieee (5)
Springer (1)

INFONA - science communication portal

Search results for: Yanling Li

Data Imbalance Problem in Text Classification

Method for feature word weight calculating

Threshold Determining Method for Feature Selection

Text Classificationg for Imbalanced Data Sets

Feature Selection Method of Text Tendency Classification

A high-performance extraction method for public opinion on internet

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options