Search results

Items from 1 to 6 out of 6 results

chapter

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Hrishikesh Bhaumik, Anirban Mukherjee, Siddhartha Bhattacharyya, Manojit Chattopadhyay

2014 International Conference on Computational Intelligence and Communication Networks > 530 - 535

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

This paper proposes a new approach for clustering English text documents, based on finding the pair wise correlation of documents in a given set of text documents. The correlation coefficient for each pair of documents is calculated on the basis of ranks given to the words in the documents. The ranking of the words occurring in a document is computed on the basis of weights of the words calculated...

article

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

Shie-Jue Lee, Jung-Yi Jiang

IEEE Transactions on Fuzzy Systems > 2014 > 22 > 6 > 1457 - 1471

We propose a fuzzy based method for multilabel text classification in which a document can belong to one or more than one category. In text categorization, the number of the involved features is usually huge, causing the curse of the dimensionality problem. Besides, a category can be a nonconvex region, which is a union of several overlapping or disjoint subregions. An automatic classification system,...

chapter

Building clusters with distributed features for text classification using KNN

Mohammed Abdul Wajeed, T. Adilakshmi

2012 International Conference on Computer Communication and Informatics > 1 - 6

2012 International Conference on Computer Communication and Informatics (ICCCI)

Bulk data is generated in the era of Information Technology. If it is not stored in a properly systematic manner then the generated data cannot be reused. This is because navigation becomes if not impossible, certainly very difficult. So we classify the data before it is stored. Present paper explores the techniques to store the data in a supervised classification paradigm using distributed features...

chapter

Text categorization study case: Patents' application documents

Neide de Oliveira Gomes, Emmanuel Piceses Lopes Passos

2011 6th IEEE Conference on Industrial Electronics and Applications > 446 - 450

2011 6th IEEE Conference on Industrial Electronics and Applications (ICIEA)

This paper presents computational methods aiming to patent's text categorization in Portuguese language, involving techniques from machine learning and computational linguistics. The algorithm used was the k-Nearest Neighbor method (k-NN) modified which showed good results, although it requires much computational time in the training stage. For the pre-processing step, it was implemented, with modifications,...

chapter

Study on Key Technology for Topic Tracking

Shengdong Li, Xueqiang Lv, Hongwei Wang, Shuicai Shi

2010 Sixth International Conference on Semantics, Knowledge and Grids > 275 - 280

2010 Sixth International Conference on Semantics Knowledge and Grid (SKG 2010)

Text classification is the key technology for topic tracking, and vector space model (VSM) is one of the most simple and effective model for topics representation. On the basis of K-nearest neighbor (KNN) algorithm for text classification and support vector machines (SVM) algorithm for text classification, we have studied how they affect topic tracking. Then we get the variation law that they affect...

chapter

An effective term weighting method using random walk model for text classification

M.R. Islam, M.R. Islam

2008 11th International Conference on Computer and Information Technology > 411 - 414

2008 11th International Conference on Computer and Information Technology (ICCIT)

Text classification may be viewed as assigning texts in a predefined set of categories. However there are many digital documents that are not organized according to their contents. So it is difficult task to find relevant documents for a user. Automatic text classification problem can solve this problem. In this paper we introduce a new random walk term weighting method for improved text classification...

Filter options

Keywords:
TEXT CATEGORIZATION
EQUATIONS

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

CLASSIFICATION ALGORITHMS (4)
TRAINING (4)
VECTORS (3)
CLUSTERING (2)
INFORMATICS (2)
TEXT ANALYSIS (2)
ACCURACY (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AUTOMATIC TEXT CLASSIFICATION (1)
CATEGORIZATION OF PATENTS' APPLICATIONS (1)
CLASSIFICATION (1)
CLASSIFICATION OF PATENT'S APPLICATIONS (1)
CLUSTERING ALGORITHMS (1)
COMPUTERS (1)
CORRELATION (1)
CORRELATION COEFFICIENT (1)
DAMPING (1)
DATABASES (1)
DIGITAL DOCUMENT (1)
DIMENSIONALITY REDUCTION (1)
DISTRIBUTED FEATURES (1)
FUZZY RELEVANCE (1)
GRAPH THEORY (1)
INFORMATION GAIN (1)
K-NEAREST NEIGHBOR ALGORITHM (1)
KNN (1)
KNN-CLASSIFIER (1)
KNOWLEDGE DISCOVERY IN TEXTS (1)
MATHEMATICAL MODEL (1)
MULTILABEL LEARNING (1)
PATENTS (1)
PATTERN CLASSIFICATION (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PROTOTYPES (1)
RANDOM WALK MODEL (1)
ROCCHIO TEXT CLASSIFICATION ALGORITHM (1)
SEMANTIC RELATION (1)
SOFT-HARD CLUSTERS (1)
SUPPORT VECTOR MACHINE ALGORITHM (1)
SUPPORT VECTOR MACHINES (1)
SVM (1)
TDT EVALUATION (1)
TDT EVALUATION METHOD (1)
TERM WEIGHTING METHOD (1)
TESTING (1)
TOPIC REPRESENTATION (1)
TOPIC TRACKING (1)
TOPIC TRACKING KEY TECHNOLOGY (1)
TRAINING DATA (1)
TRANSFORMS (1)
VECTOR SPACE MODEL (1)
WEIGHT MEASUREMENT (1)
more

INFONA - science communication portal

Search results

Towards Reliable Clustering of English Text Documents Using Correlation Coefficient

Multilabel Text Categorization Based on Fuzzy Relevance Clustering

Building clusters with distributed features for text classification using KNN

Text categorization study case: Patents' application documents

Study on Key Technology for Topic Tracking

An effective term weighting method using random walk model for text classification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options