Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 32 results

chapter

Density-based clustering algorithm for GPGPU computing

Kai-Shiang Chang, Yi-Wen Peng, Wei-Mei Chen

2017 International Conference on Applied System Innovation (ICASI) > 774 - 777

2017 International Conference on Applied System Innovation (ICASI)

Clustering is a common data mining procedure that groups multi-dimensional points with similar components to form different subsets. Among all of the clustering algorithms, DBSCAN is one of the most popular algorithms owing to finding clusters with arbitrary shapes and noise of datasets. However, with data volumes growing and the execution time of algorithms becoming longer, numerous methods have...

chapter

Clustering by Creating a Graph

Yiwen Wang, Haolan Zhang, Ke Huang, Changbin Yu, more

2016 12th International Conference on Computational Intelligence and Security (CIS) > 499 - 502

2016 12th International Conference on Computational Intelligence and Security (CIS)

In this paper, we presented a novel graph-based clustering algorithm (GC). GC contains two main steps: the first step is to create a graph and find out the key nodes as centers, the second step is to divide every data point to each center. The centers are selected from a graph view. Experimental results on 8 datasets demonstrated that GC could do better than k-means, k-medoids, Hierarchical Clustering...

chapter

Improving classification in data mining using hybrid algorithm

Akanksha Ahlawat, Bharti Suri

2016 1st India International Conference on Information Processing (IICIP) > 1 - 4

2016 1st India International Conference on Information Processing (IICIP)

Data mining is a powerful concept with great potential to predict future trends and behavior. It refers to the extraction of hidden knowledge from large datasets using techniques like statistical analysis, machine learning, clustering, neural networks and genetic algorithms. Hybrid algorithms for data mining are a logical combination of multiple pre-existing techniques to enhance performance and provide...

chapter

Range clustering: An algorithm for empirical evaluation of classical clustering algorithms

Nishant Arora, Sandeep Jain, Santosh Kumar Verma

2016 Ninth International Conference on Contemporary Computing (IC3) > 1 - 4

2016 Ninth International Conference on Contemporary Computing (IC3)

Cluster analysis is a principal method in analytics domain of data mining. The algorithm used for clustering directly influences the results obtained from applying the clustering algorithm (clusters). Data clustering is done in order to identify the patterns and trends not identifiable from just looking at the data. Clustering may be supervised (if the machine training data set is available) or unsupervised...

chapter

Theoretical analysis of the Minimum Sum of Squared Similarities sampling for Nyström-based spectral clustering

Djallel Bouneffouf, Inanc Birol

2016 International Joint Conference on Neural Networks (IJCNN) > 3856 - 3862

2016 International Joint Conference on Neural Networks (IJCNN)

Spectral clustering has shown a superior performance in analyzing the cluster structure. However, the exponentially computational complexity limits its application in analyzing large-scale data. To tackle this problem, many low-rank matrix approximating algorithms are proposed, of which the Nyström method is an approach with proved lower approximate errors. The algorithms commonly combine two powerful...

chapter

Big data and clustering algorithms

V W Ajin, Lekshmy D Kumar

2016 International Conference on Research Advances in Integrated Navigation Systems (RAINS) > 1 - 5

2016 International Conference on Research Advances in Integrated Navigation Systems (RAINS)

Data mining is the method which is useful for extracting useful information and data is extorted, but the classical data mining approaches cannot be directly used for big data due to their absolute complexity. The data that is been formed by numerous scientific applications and incorporated environment has grown rapidly not only in size but also in variety in recent era. The data collected is of very...

chapter

Comparing clustering algorithms on wisconsin data set

Mucahit Erken

2016 24th Signal Processing and Communication Application Conference (SIU) > 1541 - 1544

2016 24th Signal Processing and Communication Application Conference (SIU)

Amount and diversity of data produced and processed has been dramatically increased parallel to improvements in technology. Unfortunately produced data usually don't have any labels which may make the classification and building information process more easily. This resulted with higher importance on data clustering for builing information. In this work K-Means, Spectral Clustering and Girvan-Newman...

chapter

Application of Hierarchical Clustering Algorithm to Evaluate Students Performance of an Institute

Shiwani Rana, Roopali Garg

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 692 - 697

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

Machine Learning is the field of computer science that learns from data by studying algorithms and their constructions. In machine learning, predictions can be made by using certain algorithms for specific inputs. In this paper important classification and clustering algorithms are discussed which can be further applied to BE (Information Technology). Third Semester to evaluate student's performance...

chapter

Empirical evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means clustering algorithms

Shreya Banerjee, Ankit Choudhary, Somnath Pal

2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE) > 168 - 172

2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE)

Clustering is one of the most widely studied problem in machine learning and data mining. The algorithms for clustering depend on the application scenario and data domain. K-Means algorithm is one of the most popular clustering techniques that depend on distance measure. In this work, an extensive empirical evaluation of three significant variations of K-Means algorithm is carried out on the basis...

chapter

Elective Recommendation Support through K-Means Clustering Using R-Tool

Agnivesh, Rajiv Pandey

2015 International Conference on Computational Intelligence and Communication Networks (CICN) > 851 - 856

2015 International Conference on Computational Intelligence and Communication Networks (CICN)

The data generated from both men and machines are exponentially multiplying the size and the structural definition of the data. Such a voluminous, dynamic and unstructured data termed as Big Data is analyzed and maintained and can be used for various purposes and applications. Big Data is generated from sources like social media, cyber physical system and business entities. This enormous data generation...

chapter

Using Word2Vec to process big text data

Long Ma, Yanqing Zhang

2015 IEEE International Conference on Big Data (Big Data) > 2895 - 2897

2015 IEEE International Conference on Big Data (Big Data)

Big data is a broad data set that has been used in many fields. To process huge data set is a time consuming work, not only due to its big volume of data size, but also because data type and structure can be different and complex. Currently, many data mining and machine learning technique are being applied to deal with big data problem; some of them can construct a good learning algorithm in terms...

chapter

Efficiency analysis of kernel functions in uncertainty based c-means algorithms

Dishant Mittal, B. K. Tripathy

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 807 - 813

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Application of clustering algorithms for investigating real life data has concerned many researchers and vague approaches or their hybridization with other analogous approaches has gained special attention due to their great effectiveness. Recently, rough intuitionistic fuzzy c-means algorithm has been proposed by Tripathy et al [3] and they established its supremacy over all other algorithms contained...

chapter

Machine learning based social media recommendation

Taiping Lai, Xianghan Zheng

2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM) > 28 - 32

2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM)

In view of the problems existing in traditional recommendation algorithm of low accuracy and low efficiency, this paper presents a machine learning based social media recommendation algorithm. The algorithm is based on the traditional personalized collaborative filtering algorithm, and combines with the correlation characteristics among users in a social network. Besides, the algorithm also considers...

chapter

Big Data Stream Learning with SAMOA

Albert Bifet, Gianmarco De Francisci Morales

2014 IEEE International Conference on Data Mining Workshop > 1199 - 1202

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Big data is flowing into every area of our life, professional and personal. Big data is defined as datasets whose size is beyond the ability of typical software tools to capture, store, manage and analyze, due to the time and memory complexity. Velocity is one of the main properties of big data. In this demo, we present SAMOA (Scalable Advanced Massive Online Analysis), an open-source platform for...

chapter

SOM Clustering Using Spark-MapReduce

Tugdual Sarazin, Hanane Azzag, Mustapha Lebbah

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 1727 - 1734

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we consider designing clustering algorithms that can be used in MapReduce using Spark platform, one of the most popular programming environment for processing large datasets. We focus on the practical and popular serial Self-organizing Map clustering algorithm (SOM). SOM is one of the famous unsupervised learning algorithms and it's useful for cluster analysis of large quantities of...

chapter

Informative Projection Recovery for Classification, Clustering and Regression

Madalina Fiterau, Artur Dubrawski

2013 12th International Conference on Machine Learning and Applications > 1 > 15 - 20

2013 12th International Conference on Machine Learning and Applications (ICMLA)

Data driven decision support systems often benefit from human participation to validate outcomes produced by automated procedures. Perceived utility hinges on the system's ability to learn transparent, comprehensible models from data. We introduce and formalize Informative Projection Recovery: the problem of extracting a set of low-dimensional projections of data which jointly form an accurate solution...

chapter

An improvement of DBSCAN Algorithm to analyze cluster for large datasets

Chetan Dharni, Meenakshi Bnasal

2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE) > 42 - 46

2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE)

Clustering is an important tool which has seen an explosive growth in Machine Learning Algorithms. DBSCAN (Density-Based Spatial Clustering of Applications with Noise) clustering algorithm is one of the most primary methods for clustering in data mining. DBSCAN has ability to find the clusters of variable sizes and shapes and it will also detect the noise. The two important parameters Epsilon (Eps)...

chapter

An integrated clustering approach for high dimensional categorical data

K. Kalaivani, A. P. V. Raghavendra

2013 International Conference on Green High Performance Computing (ICGHPC) > 1 - 4

2013 International Conference on Green High Performance Computing (ICGHPC'13)

Clustering is an attractive and important task in data mining which is used in many applications. However earlier work on clustering focused on only categorical data which is based on attribute values for grouping similar kind of data items thus will leads to convergence problem of clustering process. This proposed work is to enhance the existing k-means clustering process based on the categorical...

chapter

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

K A A Nazeer, S D M Kumar, M P Sebastian

2011 Second International Conference on Emerging Applications of Information Technology > 261 - 264

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

With the advent of modern techniques for scientific data collection, large quantities of data are getting accumulated at various databases. Systematic data analysis methods are necessary to extract useful information from rapidly growing data banks. Cluster analysis is one of the major data mining methods and the k-means clustering algorithm is widely used for many practical applications. But the...

chapter

Pseudo fuzzy clustering derived from Fisher criterions

Shibin Xuan, Yiguang Liu

2010 3rd International Congress on Image and Signal Processing > 4 > 1914 - 1918

3rd International Congress on Image and Signal Processing (CISP 2010)

This paper describes a new revised clustering algorithm in which each cluster center derived from the revised mean of a subclass in previous recursion. This modification factors make up with the mean of the cluster center in previous recursion multiplied with a coefficient polynomial. This computing center formula is derived from Fisher criteria. Experimental results show that the proposed clustering...

Keywords:
ALGORITHM DESIGN AND ANALYSIS
CLUSTERING
MACHINE LEARNING ALGORITHMS

Publication date

Set your own date range

Publication type

book (30)
article (2)

Keywords

CLUSTERING ALGORITHMS (31)
DATA MINING (15)
PARTITIONING ALGORITHMS (11)
PATTERN CLUSTERING (11)
MACHINE LEARNING (8)
APPROXIMATION ALGORITHMS (5)
CLASSIFICATION (5)
ACCURACY (4)
BIG DATA (4)
CLASSIFICATION ALGORITHMS (4)
IRIS (4)
K-MEANS (4)
K-MEANS ALGORITHM (4)
OPTIMIZATION (4)
PATTERN RECOGNITION (4)
DATA ANALYSIS (3)
INDEXES (3)
KERNEL (3)
CLUSTER ANALYSIS (2)
CONFERENCES (2)
DATA MODELS (2)
DISTANCE MEASUREMENT (2)
EUCLIDEAN DISTANCE (2)
FCM (2)
GENETIC ALGORITHMS (2)
HEURISTIC ALGORITHMS (2)
INFORMATION EXTRACTION (2)
K-MEANS CLUSTERING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LOGISTICS (2)
MAPREDUCE (2)
PARTICLE SWARM OPTIMISATION (2)
PARTICLE SWARM OPTIMIZATION (2)
REGRESSION (2)
STATISTICAL ANALYSIS (2)
TRAINING (2)
ABSTRACTING (1)
ADJACENCY MATRIX (1)
ALGEBRA (1)
ALGORITHM (1)
AND ASSOCIATION RULES (1)
ANT COLONY ALGORITHM (1)
ANT COLONY OPTIMIZATION (1)
ANT-BASED CLUSTERING ALGORITHM (1)
APPROXIMATE ALGORITHM (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ART (1)
ARTIFICIAL BEE COLONY (1)
ARTIFICIAL BEE COLONY OPTIMIZATION ALGORITHM (1)
ARTIFICIAL NEURAL NETWORKS (1)
BAYESIAN INFORMATION CRITERION (1)
BELIEF NETWORKS (1)
BIGDATA (1)
BIRCH (1)
BLOGS (1)
CANCER (1)
CATEGORICAL DATA (1)
CLASSIFICATION AND ASSOCIATION RULES (1)
CLIQUE (1)
CLOUD COMPUTER MANAGEMENT SYSTEM (1)
CLUSTERING ALGORITHM (1)
CLUSTERING ANALYSIS (1)
CLUSTERING EFFECT (1)
CLUSTERING TECHNIQUE (1)
CLUSTERING. (1)
COEFFICIENT POLYNOMIAL (1)
COLLABORATION (1)
COLLABORATIVE FILTERING (1)
COMPLEXITY THEORY (1)
COMPUTER ARCHITECTURE (1)
COMPUTER DEBUGGING (1)
CONFIDENCE INTERVAL (1)
CONVERGENCE (1)
CORPORATE ACQUISITIONS (1)
CRITERION FUNCTION (1)
CYBERNETICS (1)
D INDEX (1)
DATA ANALYSIS METHODS (1)
DATA BANKS (1)
DATA DISTRIBUTION (1)
DATA MATRIX (1)
DATA MINING METHODS (1)
DATA PARTITIONING (1)
DATA SORTING (1)
DATA STREAMS (1)
DATA WAREHOUSES (1)
DB INDEX (1)
DBSCAN (1)
DBSCAN ALGORITHM (1)
DEBUGGING (1)
DECISION TREES (1)
DELTA MODULATION (1)
DENSITY-BASED CLUSTERING ALGORITHM (1)
DIGITAL SIGNAL PROCESSING (1)
DISTANCE MEASURE (1)
DISTRIBUTED SYSTEMS (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Density-based clustering algorithm for GPGPU computing

Clustering by Creating a Graph

Improving classification in data mining using hybrid algorithm

Range clustering: An algorithm for empirical evaluation of classical clustering algorithms

Theoretical analysis of the Minimum Sum of Squared Similarities sampling for Nyström-based spectral clustering

Big data and clustering algorithms

Comparing clustering algorithms on wisconsin data set

Application of Hierarchical Clustering Algorithm to Evaluate Students Performance of an Institute

Empirical evaluation of K-Means, Bisecting K-Means, Fuzzy C-Means and Genetic K-Means clustering algorithms

Elective Recommendation Support through K-Means Clustering Using R-Tool

Using Word2Vec to process big text data

Efficiency analysis of kernel functions in uncertainty based c-means algorithms

Machine learning based social media recommendation

Big Data Stream Learning with SAMOA

SOM Clustering Using Spark-MapReduce

Informative Projection Recovery for Classification, Clustering and Regression

An improvement of DBSCAN Algorithm to analyze cluster for large datasets

An integrated clustering approach for high dimensional categorical data

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

Pseudo fuzzy clustering derived from Fisher criterions

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options