Search results

chapter

Data mining based fragmentation and prediction of medical data

Hnin Wint Khaing

2011 3rd International Conference on Computer Research and Development > 2 > 480 - 485

2011 3rd International Conference on Computer Research and Development (ICCRD 2011)

Data mining concerns theories, methodologies, and in particular, computer systems for knowledge extraction or mining from large amounts of data. Association rule mining is a general purpose rule discovery scheme. It has been widely used for discovering rules in medical applications. The diagnosis of diseases is a significant and tedious task in medicine. The detection of heart disease from various...

chapter

A Modified k-means Algorithm for Clustering Problem with Balancing Constraints

Sun Yuepeng, Liu Min, Wu Cheng

2011 Third International Conference on Measuring Technology and Mechatronics Automation > 1 > 127 - 130

2011 International Conference on Measuring Technology and Mechatronics Automation (ICMTMA)

A clustering problem with balancing constraints is studied in this paper, which means that the sample number in each cluster has to be at least pre-given value. A modified k-means clustering algorithm is proposed, which adopt the proposed heuristic cluster assignment algorithm to deal with the balancing constraints. Numerical computation shows that the proposed algorithm can deal with the balancing...

chapter

txtKnot — Text clustering based concept hierarchy to generalize from different text sources

D Jayasinghe, S Hettiarachchi, S Abeywickrama, C Ketteepearachchi, more

2010 Fifth International Conference on Information and Automation for Sustainability > 239 - 243

2010 5th International Conference on Information and Automation for Sustainability (ICIAfS)

Living in the modern technology dependent world, we heavily rely on electronically stored data and information, to come up with sound and timely decisions. Considering the entire information technology world, there exists an unimaginable volume of data which contains a lot of information which is relevant to various kinds of fields. But the problem emerges when we are interested to find out about...

chapter

An Improved Initialization Center Algorithm for K-Means Clustering

Baolin Yi, Haiquan Qiao, Fan Yang, Chenwei Xu

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

The traditional k-means algorithm has sensitivity to the initial start center. To solve this problem, this paper proposed a new method to find the initial center and improve the sensitivity to the initial centers of k-means algorithm. The algorithm first computes the density of the area where the data object belongs to; then it finds k data objects, which are belong to high density area, as the initial...

chapter

AK-Modes: A weighted clustering algorithm for finding similar case subsets

Lianhang Ma, Yefang Chen, Hao Huang

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 218 - 223

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

Finding similar crime case subsets is an important task for intelligence analysts in crime investigation. It can not only provide multiple clues to solve crimes but also improve efficiency to catch the criminals. However, the conventional approach by querying specific attributes in relational databases has two defects: first, it is relatively of poor efficiency when a lot of incidents have to be handled;...

chapter

Application of Data Mining Technique in Customer Segmentation of Shipping Enterprises

Wei Wang, Shidong Fan

2010 2nd International Workshop on Database Technology and Applications > 1 - 4

2010 2nd International Workshop on Database Technology and Applications (DBTA 2010)

Previous studies have focused on serveral aspects of CRM (Customer Relationship Management). However, there is a lack of research that focuses on the customer segmentation of shipping enterprises using data mining. Data mining technology can be used to in modern CRM to greatly enhance it function and efficiency. Based on the technologies of clustering and classification in data mining, this paper...

chapter

Algorithm of the Text Copy Detection Based on Topic Bag

Wang Sen, Wang Yu

2010 International Conference on Web Information Systems and Mining > 1 > 285 - 288

2010 International Conference on Web Information Systems and Mining (WISM 2010)

In order to resolve the current problem about seriously academic plagiarism in the web environment, this article proposes an algorithm of the text copy detection on the topic bag and the algorithm uses the idea of semantic clustering and multi-instance learning. Firstly, a paper is divided into three layers construction tree: a leaf node denotes a sentence; a branch node represents a topic bag, and...

chapter

An Extended Fuzzy k-Means Algorithm for Clustering Categorical Valued Data

Wang Jiacai, Gu Ruijun

2010 International Conference on Artificial Intelligence and Computational Intelligence > 2 > 504 - 507

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

Although fuzzy k-modes algorithm has removed the numeric-only limitation of the k-means algorithm, that each attribute of the centroid with a single category value and the use of a simple distance measure will compromise its precision, and therefore prone to falling into local optima. In this paper, an extended fuzzy k-means(xFKM) algorithm for clustering categorical valued data is presented, in which...

chapter

A New Scalability of Hybrid Fuzzy C-Means Algorithm

Hao Wang, Danyun Li, Yayun Chu

2010 International Conference on Artificial Intelligence and Computational Intelligence > 3 > 55 - 58

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

In this paper, a new scalability of hybrid fuzzy clustering algorithm that incorporates the Fuzzy C-means into the Quantum-behaved Particle Swarm Optimization algorithm is proposed. The QPSO has less parameters and higher convergent capability of the global optimizing than Particle Swarm Optimization algorithm. So the iteration algorithm is replaced by the new hybrid algorithm based on the gradient...

chapter

Facial expression recognition based orthogonal supervised spectral discriminant analysis

Zhan Wang, Qiuqi Ruan

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 1056 - 1059

2010 10th International Conference on Signal Processing (ICSP 2010)

In recent years, feature extraction methods make an achievement in pattern recognition and computer vision. It extracts not only useful feature for classification, but also reduces the dimension of pattern samples. In this paper, we propose orthogonal supervised spectral discriminant analysis (OSSDA) which motivated by marginal fisher analysis (MFA) and spectral clustering. It put different weights...

chapter

A Clustering Method Based on the Most Similar Relation Diagram of Datasets

Yan Yu, Yu Shan Bai, Wei Hong Xu, Nan Li

2010 IEEE International Conference on Granular Computing > 598 - 603

2010 IEEE International Conference on Granular Computing (GrC-2010)

In this paper we present a novel clustering analysis method based on the Most Similar Relation Diagram (MSRD). MSRD is a diagram in which each datum of a dataset is linked to its most similar data. By cutting off some links in the diagram a certain number of clusters are formed. A compare of the MSRD method with hierarchical method is implemented. Clustering experiences using MSRD were done and the...

chapter

Heuristic based approach to clustering and its time critical applications

Alan Chia-Lung Chen, Shang Gao, Reda Alhajj, Panagiotis Karampelas

2010 IEEE International Conference on Information Reuse&Integration > 37 - 42

2010 IEEE International Conference on Information Reuse & Integration (IRI 2010)

Clustering may be named as the first clustering technique addressed by the research community since 1960s. However, as databases continue to grow in size, numerous research studies have been undertaken to develop more efficient clustering algorithms and to improve the performance of existing ones. This paper demonstrates a general optimization technique applicable to clustering algorithms with a need...

chapter

Clustering GML documents using maximal frequent induced subtrees

Ying-wen Zhu, Gen-lin Ji, Qin-hong Sun

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2265 - 2269

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

An algorithm, TBCClustering, is presented in the paper for clustering GML documents using maximal frequent induced subtree patterns. TBCClustering mines the maximal frequent induced subtrees by using the structural information of GML documents, it can get the best minimum support automatically, and then chooses a set of subtree patterns to form the optimistic clustering features. Finally it uses CLOPE...

chapter

On algorithm for outliers detection in the process of mining cognitive maps based on data resources

Zhuang Chen, Guo Zhang, Huageng Tian

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 6 > 2849 - 2852

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Cognitive maps, one of the hot topic in the research of computational intelligence, have been widely used in knowledge representation and decision-making. In mining of cognitive maps on the basis of data resources, outlier data seriously affect the accuracy of cognitive maps. Therefore, this paper, based on the analysis of traditional ones, proposes a new outlier data detection algorithm. The algorithm...

chapter

Chinese text categorization study based on CBM learning

Yan Zhan, Hao Chen

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1511 - 1514

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Text Categorization (TC) is an important component in many information organization and information management tasks. In many TC applications, the case-base grows at a fast rate and this causes inefficiency in the case retrieval process. Using Case-Base Maintenance learning via the GC (Generalization Capability) algorithm, which can reduce the case number into KNN algorithm, can improve efficiency...

chapter

Clustering algorithm in literature-based discovery

Chunlei Ye, Fuhai Leng, Xin Guo

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 4 > 1625 - 1629

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Literature-based discovery is linking two or more literature concepts that have heretofore not been linked (i.e., disjoint), in order to produce novel, interesting, plausible, and intelligible knowledge. Cluster analysis is the core of literature-based discovery. This paper proposes an improved fuzzy c means (FCM) algorithm based on the analysis of existing clustering analysis of literature-based...

chapter

A fuzzy ART2 model for finding association rules in medical data

Yo-Ping Huang, Vu Thi Thanh Hoa, Jung-Shian Jau, Frode Eika Sandnes

International Conference on Fuzzy Systems > 1 - 6

2010 IEEE International Conference on Fuzzy Systems

This paper describes a model that discovers association rules from a medical database to help doctors treat and diagnose a group of patients who show similar prehistoric medical symptoms. The proposed data mining procedure consists of two modules. The first is a clustering module that is based on a neural network, Adaptive Resonance Theory 2 (ART2), which performs affinity grouping tasks on a large...

chapter

Law Text Clustering Based on Referential Relations

Biao Fan, Tao Liu, He Hu, Xiaoyong Du

2010 Fifth Annual ChinaGrid Conference > 60 - 66

Fifth ChinaGrid Annual Conference (ChinaGrid 2010)

This paper proposes a new method to cluster law texts based on referential relation of laws. We extract law entities (an entity represents a law) and their referential relation from law texts. Then SimRank algorithm is applied to calculate law entity's similarity through referential relation and law clustering is carried out based on the SimRank similarity. This is the first time to apply SimRank...

chapter

Maintaining privacy and data quality in privacy preserving association rule mining

Chirag N Modi, Udai Pratap Rao, Dhiren R Patel

2010 Second International conference on Computing, Communication and Networking Technologies > 1 - 6

2010 International Conference on Computing, Communication and Networking Technologies (ICCCNT'10)

Privacy preserving data mining (PPDM) is a novel research direction to preserve privacy for sensitive knowledge from disclosure. Many of the researchers in this area have recently made effort to preserve privacy for sensitive association rules in statistical database. In this paper, we propose a heuristic algorithm named DSRRC (Decrease Support of R.H.S. item of Rule Clusters), which provides privacy...

chapter

A novel approach for hierarchical clustering in non - binary search space

G Praveen Kumar, A Sarkar, Ilhyun Lee, Haesun Lee, more

2010 8th IEEE International Conference on Industrial Informatics > 693 - 697

2010 8th IEEE International Conference on Industrial Informatics (INDIN 2010)

Data clustering is one of the powerful techniques for the knowledge discovery from data. In this paper, a novel approach for hierarchical clustering has been proposed over non-binary search space. Besides the agglomerative methods, the proposed algorithm has considered the Strength of Presence associated with each transaction, to yield quality clusters which are again more close to the real life situation...

INFONA - science communication portal

Search results

Data mining based fragmentation and prediction of medical data

A Modified k-means Algorithm for Clustering Problem with Balancing Constraints

txtKnot — Text clustering based concept hierarchy to generalize from different text sources

An Improved Initialization Center Algorithm for K-Means Clustering

AK-Modes: A weighted clustering algorithm for finding similar case subsets

Application of Data Mining Technique in Customer Segmentation of Shipping Enterprises

Algorithm of the Text Copy Detection Based on Topic Bag

An Extended Fuzzy k-Means Algorithm for Clustering Categorical Valued Data

A New Scalability of Hybrid Fuzzy C-Means Algorithm

Facial expression recognition based orthogonal supervised spectral discriminant analysis

A Clustering Method Based on the Most Similar Relation Diagram of Datasets

Heuristic based approach to clustering and its time critical applications

Clustering GML documents using maximal frequent induced subtrees

On algorithm for outliers detection in the process of mining cognitive maps based on data resources

Chinese text categorization study based on CBM learning

Clustering algorithm in literature-based discovery

A fuzzy ART2 model for finding association rules in medical data

Law Text Clustering Based on Referential Relations

Maintaining privacy and data quality in privacy preserving association rule mining

A novel approach for hierarchical clustering in non - binary search space

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options