Search results

Items from 1 to 7 out of 7 results

chapter

M2VSM: Extension of vector space model by introducing Meta keyword

Y. Takama, T. Ishibashi

2008 World Automation Congress > 1 - 6

2008 World Automation Congress

This paper proposes an extended vector space model (VSM), which is called M2VSM (meta keyword-based modified VSM). When conventional VSM is applied to document clustering, it is difficult to adjust the granularity of cluster in terms of topic. In order to solve the problem, M2VSM considers meta keywords such as

article

Applying text and data mining techniques to forecasting the trend of petitions filed to e-People

Jong Hwan Suh, Chung Hoon Park, Si Hyun Jeon

Expert Systems With Applications > 2010 > 37 > 10 > 7255-7268

propose the framework of applying text and data mining techniques not only to analyze a large number of petitions filed to e-People but also to predict the trend of petitions. In detail, we apply text mining techniques to unstructured data of petitions to elicit keywords from petitions and identify groups of petitions with

chapter

A Cluster-based Approach to Filtering Spam under Skewed Class Distributions

Wen-feng Hsiao, Te-ming Chang, Guo-hsin Hu

2007 40th Annual Hawaii International Conference on System Sciences (HICSS'7) > 53

Proceedings of the 40th Annual Hawaii International Conference on System Sciences

The purpose of this research is to propose an appropriate classification approach to improving the effectiveness of spam filtering on the issue of skewed class distributions. A clustering-based classifier is proposed to first cluster documents into several groups, and then an equal number of keywords are extracted

chapter

Text Categorization with Considering Temporal Patterns of Term Usages

H Abe, S Tsumoto

2010 IEEE International Conference on Data Mining Workshops > 800 - 807

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In document categorization method by using similarity measures based on word vectors, it is important to determine key words to characterize each document. However, conventional methods select the key words based on their frequency or/and particular importance index such as tf-idf. In this paper, we propose a method to characterize each document by using temporal clusters of technical term usages...

chapter

FMODC: Fuzzy guided multi-objective document clustering by GA

Annaluri Sreenivasa Rao, S. Ramakrishna

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I) > 650 - 655

2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)

of the documents considered for context assessment contains the authors list, keywords list and list of document versioning time schedules. The experiments were conducted to assess the significance of the proposed model.

article

Exploration of a collection of documents in neuroscience and extraction of topics by clustering

Antoine Naud, Shiro Usui

Neural Networks > 2008 > 21 > 8 > 1205-1211

using multidimensional scaling. Based on the Vector Space Model, several Term Spaces were built on the basis of a set of terms extracted from the posters’ abstracts and titles, and a set of free keywords assigned to the posters by their authors. The ensuing Term Spaces were compared from the point of view of retrieving the

article

The Hidden Structure of Neuropsychology: Text Mining of the Journal Cortex: 1991-2001

Ronald N. Kostoff, Henry A. Buchtel, John Andrews, Kirstin M. Pfeil

Cortex > 2005 > 41 > 2 > 103-115

compared for the analysis: Full Text, Abstract, Title, Keywords. Results and Conclusions: Highly cited documents were compared among Cortex, Neuropsychologia, andBrain, and a number of interesting parametric trends were observed. The characteristics of the papers that cite Cortex papers were examined, and some interesting

Filter options

Keywords:
TEXT MINING
DOCUMENT CLUSTERING

Publication date

Set your own date range

INFONA - science communication portal

Search results

M2VSM: Extension of vector space model by introducing Meta keyword

Applying text and data mining techniques to forecasting the trend of petitions filed to e-People

A Cluster-based Approach to Filtering Spam under Skewed Class Distributions

Text Categorization with Considering Temporal Patterns of Term Usages

FMODC: Fuzzy guided multi-objective document clustering by GA

Exploration of a collection of documents in neuroscience and extraction of topics by clustering

The Hidden Structure of Neuropsychology: Text Mining of the Journal Cortex: 1991-2001

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options