Search results

Items from 1 to 6 out of 6 results

chapter

Text Classification Using Semi-supervised Clustering

Wen Zhang, T. Yoshida, Xijin Tang

2009 International Conference on Business Intelligence and Financial Engineering > 197 - 200

2009 International Conference on Business Intelligence and Financial Engineering (BIFE)

In this paper, mixture models are used to classify documents. The basic assumption for the documents in a collection is that each class is composed of a number of mixture components. By identifying the components in the document collection, the classes of documents can thereby be identified from each other. A semi-supervised clustering method is proposed to identify the components (clusters), and...

chapter

Research and Design of Internet Public Opinion Analysis System

Quanlong Guan, Saizhi Ye, Guoxiang Yao, Huanming Zhang, more

2009 IITA International Conference on Services Science, Management and Engineering > 173 - 177

2009 IITA International Conference on Services Science, Management and Engineering (SSME)

Internet is becoming a spreading platform for the public opinion. It is important to grasp the Internet public opinion in time and understand the trends of their opinion correctly. Text classification plays a fundamental role in a number of information management and retrieval tasks. But Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information...

chapter

Research on the Application of Improved Text Classific Algorithm in Intelligent Learning Platform

Rurui Zhou, You-fu Du, Ming Zhao

2008 Second International Conference on Genetic and Evolutionary Computing > 442 - 446

2008 Second International Conference on Genetic and Evolutionary Computing (WGEC)

With the rapid developing of the network information, it seems to be quite important to provide a more reasonable text classification algorithm for learners. In this paper,we adopt a sensitivity method to modify the characteristic weight in the distance formula and put up with a cutting method of training sample database based on CURE algorithm and Tabu algorithm; then adopt CURE cluster algorithm...

chapter

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Houfeng Ma, Xinghua Fan, Ji Chen

2008 International Symposiums on Information Processing > 308 - 312

2008 International Symposiums on Information Processing - ISIP 2008; 2008 International Pacific Workshop on Web Mining and Web-Based Application - WMWA 2008

Most conventional incremental learning algorithms perform incremental learning by selecting only one optimized text sample each time, which neither considers the relationship between texts in the unlabeled text set, nor improves incremental learning efficiency. In addition, because of the shortage of the classifierpsilas information storage, the selected optimized text is easily classified incorrectly...

chapter

Authorship attribution

I.N. Bozkurt, O. Baghoglu, E. Uyar

2007 22nd international symposium on computer and information sciences > 1 - 5

2007 22nd International Symposium on Computer and Information Sciences - ISCIS '07

Authorship attribution is the process of determining the writer of a document. In literature, there are lots of classification techniques conducted in this process. In this paper we explore information retrieval methods such as tf-Idf structure with support vector machines, parametric and nonparametric methods with supervised and unsupervised (clustering) classification techniques in authorship attribution...

chapter

An Improved Genetic Algorithm for Text Feature Selection

Wei Zhao, Yafei Wang

2010 International Conference on Intelligent Computing and Cognitive Informatics > 7 - 10

2010 International Conference on Intelligent Computing and Cognitive Informatics (ICICCI 2010)

High-dimensional feature space affects the quality and efficiency of text categorization. This paper investigates an improved genetic algorithm that how to help select relevant features in text classification. We follow the so-called "region growing" method to initialize the population, and uses k-means algorithm to selection operation to control the scope of the search, ensure the validity...

Filter options

Keywords:
TEXT CATEGORIZATION
PATTERN CLUSTERING

Publication date

Set your own date range

Content availability

Available (5)
None (1)

Keywords

CLASSIFICATION ALGORITHMS (5)
CLUSTERING ALGORITHMS (5)
ALGORITHM DESIGN AND ANALYSIS (3)
PATTERN CLASSIFICATION (3)
ACCURACY (2)
CLASSIFICATION (2)
DATABASES (2)
FEATURE EXTRACTION (2)
HEURISTIC ALGORITHMS (2)
INFORMATION RETRIEVAL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINES (2)
TEXT CLASSIFICATION ALGORITHM (2)
TRAINING (2)
AFFINITY PROPAGATION (1)
AUTHORSHIP ATTRIBUTION (1)
BAYES (1)
BAYESIAN CLASSIFIER (1)
CHINESE TEXT CLASSIFICATION ALGORITHM (1)
CLASSIFIER FEATURE REATIONSHIP (1)
CLUSTERING TECHNIQUE (1)
CONVENTIONAL INCREMENTAL LEARNING ALGORITHM (1)
CURE CLUSTER ALGORITHM (1)
DATABASE (1)
DATABASE MANAGEMENT SYSTEMS (1)
DISTANCE MEASUREMENT (1)
DOCUMENT WRITER DETERMINATION (1)
E-LEARNING (1)
ENCODING (1)
EXPECTATION MAXIMIZATION (1)
FEATURE REDUCTION (1)
FEATURE SELECTION (1)
GENETIC ALGORITHM (1)
GENETIC ALGORITHMS (1)
GENETICS (1)
IMPROVED GENETIC ALGORITHM (1)
INCREMENTAL LEARNING (1)
INFORMATION MANAGEMENT (1)
INFORMATION RETRIEVAL METHOD (1)
INFORMATION STORAGE CLASSIFIER (1)
INTELLIGENT LEARNING PLATFORM (1)
INTERNET (1)
INTERNET PUBLIC OPINION (1)
INTERNET PUBLIC OPINION ANALYSIS SYSTEM (1)
IPO (1)
K-MEANS ALGORITHM (1)
KERNEL (1)
LINEAR KERNEL (1)
MACHINE LEARNING (1)
MIXTURE MODEL (1)
NONPARAMETRIC METHOD (1)
PARAMETRIC NONPARAMETRIC CLASSIFIERS (1)
PROBABILITY (1)
REASONABLE LEARNING SEQUENCE (1)
REGION GROWING METHOD (1)
SEARCH PROBLEMS (1)
SEMISUPERVISED CLUSTERING METHOD (1)
SENSITIVITY METHOD (1)
TABU ALGORITHM (1)
TEXT CLUSTERING (1)
TEXT CLUSTERING ALGORITHM (1)
TEXT FEATURE SELECTION (1)
UNSUPERVISED CLASSIFICATION TECHNIQUE (1)
VECTOR SPACE MODEL (1)
WEB PAGES (1)
WEB-PAGE CLASSIFICATION (1)
WEB-PAGE SUMMARIZATION (1)
more

INFONA - science communication portal

Search results

Text Classification Using Semi-supervised Clustering

Research and Design of Internet Public Opinion Analysis System

Research on the Application of Improved Text Classific Algorithm in Intelligent Learning Platform

An Incremental Chinese Text Classification Algorithm Based on Quick Clustering

Authorship attribution

An Improved Genetic Algorithm for Text Feature Selection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options