Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 24 wyników

Poprzednia

Następna

rozdział

An Improved Initialization Center Algorithm for K-Means Clustering

Baolin Yi, Haiquan Qiao, Fan Yang, Chenwei Xu

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

The traditional k-means algorithm has sensitivity to the initial start center. To solve this problem, this paper proposed a new method to find the initial center and improve the sensitivity to the initial centers of k-means algorithm. The algorithm first computes the density of the area where the data object belongs to; then it finds k data objects, which are belong to high density area, as the initial...

rozdział

An Extended Fuzzy k-Means Algorithm for Clustering Categorical Valued Data

Wang Jiacai, Gu Ruijun

2010 International Conference on Artificial Intelligence and Computational Intelligence > 2 > 504 - 507

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

Although fuzzy k-modes algorithm has removed the numeric-only limitation of the k-means algorithm, that each attribute of the centroid with a single category value and the use of a simple distance measure will compromise its precision, and therefore prone to falling into local optima. In this paper, an extended fuzzy k-means(xFKM) algorithm for clustering categorical valued data is presented, in which...

rozdział

On algorithm for outliers detection in the process of mining cognitive maps based on data resources

Zhuang Chen, Guo Zhang, Huageng Tian

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 6 > 2849 - 2852

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Cognitive maps, one of the hot topic in the research of computational intelligence, have been widely used in knowledge representation and decision-making. In mining of cognitive maps on the basis of data resources, outlier data seriously affect the accuracy of cognitive maps. Therefore, this paper, based on the analysis of traditional ones, proposes a new outlier data detection algorithm. The algorithm...

rozdział

A novel approach for hierarchical clustering in non - binary search space

G Praveen Kumar, A Sarkar, Ilhyun Lee, Haesun Lee, więcej

2010 8th IEEE International Conference on Industrial Informatics > 693 - 697

2010 8th IEEE International Conference on Industrial Informatics (INDIN 2010)

Data clustering is one of the powerful techniques for the knowledge discovery from data. In this paper, a novel approach for hierarchical clustering has been proposed over non-binary search space. Besides the agglomerative methods, the proposed algorithm has considered the Strength of Presence associated with each transaction, to yield quality clusters which are again more close to the real life situation...

rozdział

A comparison of two suffix tree-based document clustering algorithms

M Rafi, M Maujood, Murtaza Munawar Fazal, Syed Muhammad Ali

2010 International Conference on Information and Emerging Technologies > 1 - 5

2010 International Conference on Information and Emerging Technologies (ICIET)

Document clustering as an unsupervised approach extensively used to navigate, filter, summarize and manage large collection of document repositories like the World Wide Web (WWW). Recently, focuses in this domain shifted from traditional vector based document similarity for clustering to suffix tree based document similarity, as it offers more semantic representation of the text present in the document...

rozdział

Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm

Shi Na, Liu Xumin, Guan Yong

2010 Third International Symposium on Intelligent Information Technology and Security Informatics > 63 - 67

Third International Symposium on Intelligent Information Technology and Security Informatics (IITSI 2010)

Clustering analysis method is one of the main analytical methods in data mining, the method of clustering algorithm will influence the clustering results directly. This paper discusses the standard k-means clustering algorithm and analyzes the shortcomings of standard k-means algorithm, such as the k-means clustering algorithm has to calculate the distance between each data object and all cluster...

rozdział

Incremental clustering for categorical data using clustering ensemble

Li Taoying, Chne Yan, Qu Lili, Mu Xiangwei

Proceedings of the 29th Chinese Control Conference > 2519 - 2524

2010 29th Chinese Control Conference (CCC 2010)

More and more data in practice is changing every minute and been collected in incremental mode, and incremental clustering has attracted much of researchers' attention. However, little research now focuses on partitioning categorical data in incremental mode. How to design incremental clustering for categorical data is an urgent problem. We propose an incremental clustering for categorical data using...

rozdział

Customer Behavior Pattern Discovering Based on Mixed Data Clustering

Cheng Mingzhi, Xin Yang, Tian Yangge, Wang Cong, więcej

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

To be effective to retain customers and enhance the marketing capabilities, it is necessary to improve the personalization of e-commerce systems. Clustering is a reliable and efficient technology to provide personal service in e-commerce system. However, current research on clustering algorithm usually based on numeric data or categorical data. To analysis customer behavior, mixed data set must be...

rozdział

An Improved Entropy-Based Ant Clustering Algorithm

Zhao Weili

2009 WASE International Conference on Information Engineering > 2 > 41 - 44

2009 WASE International Conference on Information Engineering (ICIE)

Sorting and clustering methods inspired by the behavior of real ants are among the earliest methods in ant-based meta-heuristics. We revisit these methods in the context of a concrete application and introduce some modifications that yield significant improvements in terms of both quality and efficiency. In this paper, we propose an Improved entropy-based ant clustering (IEAC) algorithm. Firstly,...

rozdział

Research and application of a multi-ant colony clustering combination algorithm

Wei Xianmin

2009 4th International Conference on Computer Science&Education > 27 - 30

2009 4th International Conference on Computer Science & Education (ICCSE 2009)

At first, some improvements were done in a single ant colony clustering algorithm, then, for different speed ant colony, clustering analysis was finished independently and in parallel by imitating the collaborative performance of multi-colony, and clustering results were combined into a hyper-graph and second division was made in the hyper-graph using ACA, at last, the test result for four databases...

rozdział

An Approximation Algorithm for Max k-Uncut with Capacity Constraints

S. Choudhury, D.R. Gaur, R. Krishnamurti

2009 International Joint Conference on Computational Sciences and Optimization > 2 > 934 - 938

2009 International Joint Conference on Computational Sciences and Optimization, CSO

Clusters in protein interaction networks can potentially help identify functional relationships among proteins. The clustering problem can be modeled as a graph cut problem. Given an edge weighted graph the problem is to partition the vertices of the graph into k partitions of prescribed sizes such that the total weight of the edges within partitions are maximized. This problem is NP-complete for...

rozdział

Detection of Local Outlier over Dynamic Data Streams Using Efficient Partitioning Method

M. Elahi, Kun Li, W. Nisar, Xinjie Lv, więcej

2009 WRI World Congress on Computer Science and Information Engineering > 4 > 76 - 81

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Some of the important applications in the field of data mining are fraud detection, customer behavior analysis, and intrusion detection. There are number of good research algorithms for detecting outliers if the entire data is available and algorithms can...

rozdział

A Trajectory Clustering Algorithm Based on Symmetric Neighborhood

Yu Zhang, Dechang Pi

2009 WRI World Congress on Computer Science and Information Engineering > 3 > 640 - 645

2009 WRI World Congress on Computer Science and Information Engineering, CSIE

Trajectory clustering is attractive for the task of class identification in spatial database. Existing trajectory clustering algorithm TRCLUS uses global parameters to discover common trajectories. However, it can not discover small and dense clusters and be sensitive to two input parameters. Based on the partition-and-group framework, we propose a simple but effective trajectory clustering algorithm...

rozdział

A Clustering Algorithm Based on Symmetric Neighborhood of Micro-clusters

Yu Zhang, Dechang Pi

2009 International Conference on Computer and Automation Engineering > 118 - 122

2009 International Conference on Computer and Automation Engineering. ICCAE 2009

Clustering is an important task in data mining with numerous applications, including minefield detection, seismology, astronomy, etc. At present, the academic communities have introduced various clustering algorithms, and these methods have been widely applied to different fields according to their respective characteristics. In this paper, we propose a novel clustering algorithm based on symmetric...

rozdział

A Cluster Algorithm Identifying the Clustering Structure

Zhi-Wei Sun

2008 International Conference on Computer Science and Software Engineering > 4 > 288 - 291

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

Cluster analysis is a primary method for database mining. Most of clustering algorithms require input parameters which are hard to determine but have a significant influence on the clustering result. Furthermore, for many real-datasets there does not exist a global parameter setting for which the result of the clustering algorithm describes the intrinsic clustering structure accurately. We introduce...

rozdział

Hierarchical Clustering of Large-Scale Short Conversations Based on Domain Ontology

Yongheng Wang, Bo Guo

2008 International Symposium on Computer Science and Computational Technology > 1 > 126 - 130

2008 International Symposium on Computer Science and Computational Technology (ISCSCT)

With the rapid development of the Internet and communication technology, huge data is accumulated. Short text such as conversation in chatting room and email is common in such data. It is useful to cluster such short documents to get the structure of the data or to help building other data mining applications. But most of the current clustering algorithms can not get acceptable clustering accuracy...

rozdział

Constrained clustering by a novel graph-based distance transformation

K. Rothaus, Xiaoyi Jiang

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

In this work we present a novel method to model instance-level constraints within a clustering algorithm. Thereby, both similarity and dissimilarity constraints can be used coevally. The proposed extension is based on a distance transformation by shortest path computations in a constraint graph. With a new technique cannot-links are consistently supported and the dissimilarity is extended to their...

rozdział

Distributed Clustering for Data Sources with Diverse Schema

N.K. Visalakshi, K. Thangavel, P. Alagambigai

2008 Third International Conference on Convergence and Hybrid Information Technology > 1 > 1058 - 1063

2008 Third International Conference on Convergence and Hybrid Information Technology (ICCIT)

Many enterprises incorporate information gathered from a variety of data sources into an integrated input for some learning task. For example, aiming towards the design of an automated diagnostic tool for some diseases, one may wish to integrate data gathered from many different hospitals. Analyzing and mining these distributed heterogeneous data sources require distributed machine learning and data...

rozdział

GMDBSCAN: Multi-Density DBSCAN Cluster Based on Grid

Chen Xiaoyun, Min Yufang, Zhao Yan, Wang Ping

2008 IEEE International Conference on e-Business Engineering > 780 - 783

2008 IEEE International Conference on e-Business Engineering

DBSCAN is one of the most popular algorithms for cluster analysis. It can discover all clusters with arbitrary shape and separate noises. But this algorithm canpsilat choose parameter according to distributing of dataset. It simply uses the global MinPts parameter, so that the clustering result of multi-density database is inaccurate. In addition, when it is used to cluster large databases, it will...

rozdział

Improving Fuzzy C-Means Clustering Based on Adaptive Weighting

Wei Wang, Chunheng Wang, Xia Cui, Ai Wang

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 62 - 66

2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In traditional FCM clustering algorithm each feature is supposed to have equal importance. Considering different feature with different importance, this paper presented an improved FCM algorithm with adaptive weight for features of each cluster, named AWFCM. In the iterative AWFCM process, to identify the importance of features of each cluster, the weight for feature is computed dynamically based...

Poprzednia

Następna

Opcje filtrowania

Zbiór danych:
ieee
Słowa kluczowe:
ALGORITHM DESIGN AND ANALYSIS
PATTERN CLUSTERING
DATABASES
PARTITIONING ALGORITHMS

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

DATA MINING (17)
CLUSTERING (6)
CLUSTERING ALGORITHM (6)
DISTANCE MEASUREMENT (6)
ACCURACY (5)
CLASSIFICATION ALGORITHMS (5)
COMPLEXITY THEORY (3)
FUZZY SET THEORY (3)
GRAPH THEORY (3)
HEURISTIC ALGORITHMS (3)
K-MEANS (3)
K-MEANS ALGORITHM (3)
K-MEANS CLUSTERING ALGORITHM (3)
MACHINE LEARNING ALGORITHMS (3)
NEAREST NEIGHBOR SEARCHES (3)
CATEGORICAL DATA (2)
CLUSTER ANALYSIS (2)
CLUSTERING ANALYSIS (2)
COMPUTATIONAL COMPLEXITY (2)
COMPUTATIONAL EFFICIENCY (2)
DATA HANDLING (2)
DISTRIBUTED DATABASES (2)
ENTROPY (2)
GRID (2)
GRID COMPUTING (2)
HIERARCHICAL CLUSTERING (2)
INDEXES (2)
INTERNET (2)
NOISE (2)
OPTIMISATION (2)
OUTLIER (2)
PARALLEL ALGORITHMS (2)
SECURITY OF DATA (2)
SYMMETRIC NEIGHBORHOOD (2)
TREE DATA STRUCTURES (2)
VERY LARGE DATABASES (2)
ADAPTIVE WEIGHT (1)
AGENT BEHAVIOR MODEL (1)
AGGLOMERATIVE METHOD (1)
ANT COLONY ALGORITHM (1)
ANT-BASED ALGORITHM (1)
ANT-BASED META-HEURISTICS (1)
APPROXIMATION ALGORITHMS (1)
APPROXIMATION METHODS (1)
ARBITRARY SHAPE (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATED DIAGNOSTIC TOOL (1)
BIOINFORMATICS (1)
BSNTC (1)
C-MODES (1)
CAPACITY CONSTRAINT (1)
CAPACITY CONSTRAINTS (1)
CATEGORICAL VALUED DATA (1)
CENTROID MAPPING (1)
CHAPTERS (1)
CHATTING ROOM (1)
CLUSTER BASED PARTITIONING ALGORITHM (1)
CLUSTER CENTERS (1)
CLUSTER CENTROID VECTOR (1)
CLUSTER ENSEMBLE (1)
CLUSTER STRUCTURE (1)
CLUSTERING ANALYSIS METHOD (1)
CLUSTERING ENSEMBLE (1)
CLUSTERING INFORMATION (1)
CLUSTERING MEMBERSHIPS (1)
CLUSTERING METHODS (1)
CLUSTERING PROBLEM (1)
CLUSTERING STRUCTURE (1)
CO-OCCURRENCE FREQUENCY COMPONENT (1)
COGNITIVE MAP MINING (1)
COGNITIVE MAPS (1)
COLLABORATION (1)
COMPUTATIONAL INTELLIGENCE (1)
COMPUTATIONAL MODELING (1)
CONSTANT FACTOR APPROXIMATION ALGORITHM (1)
CONSTRAINT GRAPH (1)
CONSUMER BEHAVIOUR (1)
CONVERGENCE (1)
COPGB-K-MEANS ALGORITHM (1)
COST FUNCTION (1)
CRM (1)
CUSTOMER BEHAVIOR ANALYSIS (1)
CUSTOMER BEHAVIOR PATTERN DISCOVERING (1)
CUSTOMER RELATIONSHIP MANAGEMENT (1)
DATA ANALYSIS (1)
DATA CLUSTERING (1)
DATA DETECTION ALGORITHM (1)
DATA MINING TECHNIQUE (1)
DATA OBJECT (1)
DATA OBJECT DETECTION (1)
DATA RESOURCE (1)
DATA STRUCTURE (1)
DATABASE (1)
DATABASE MANAGEMENT SYSTEMS (1)
DATABASE MINING (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu