Search results

Items from 1 to 20 out of 40 results

chapter

Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms

Sara Ahmadian, Ashkan Norouzi-Fard, Ola Svensson, Justin Ward

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) > 61 - 72

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS)

Clustering is a classic topic in optimization with k-means being one of the most fundamental such problems. In the absence of any restrictions on the input, the best known algorithm for k-means with a provable guarantee is a simple local search heuristic yielding an approximation guarantee of 9+≥ilon, a ratio that is known to be tight with respect to such methods.We overcome this barrier...

chapter

Clustering of microRNAs Using Rough Hypercuboid Based Fuzzy C-Means

Partha Garai, Pradipta Maji

2016 International Conference on Information Technology (ICIT) > 304 - 308

2016 International Conference on Information Technology (ICIT)

MicroRNAs form a family of single strand RNA molecules having length of approximately 22 nucleotides that are present in all animals and plants. Various studies have revealed that microRNA tend to cluster on chromosomes. In this regard, a novel clustering algorithm is presented in this paper, integrating rough hypercuboid approach with fuzzy c-means. Using the concept of rough hypercuboid equivalence...

chapter

A Theoretical Analysis of the Fuzzy K-Means Problem

Johannes Blomer, Sascha Brauer, Kathrin Bujna

2016 IEEE 16th International Conference on Data Mining (ICDM) > 805 - 810

2016 IEEE 16th International Conference on Data Mining (ICDM)

One of the most popular fuzzy clustering techniques is the fuzzy K-means algorithm (also known as fuzzy-c-means or FCM algorithm). In contrast to the K-means and K-median problem, the underlying fuzzy K-means problem has not been studied from a theoretical point of view. In particular, there are no algorithms with approximation guarantees similar to the famous K-means++ algorithm known for the fuzzy...

chapter

Random Projection Clustering on Streaming Data

Lee A. Carraher, Philip A. Wilsey, Anindya Moitra, Sayantan Dey

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW) > 708 - 715

2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)

Clustering streaming data has gained importance in recent years due to an expanding opportunity to discover knowledge in widely available data streams. As streams are potentially evolving and unbounded sequence of data objects, clustering algorithms capable of performing fast and incremental processing of data points are necessary. This paper presents a method of clustering high-dimensional data streams...

chapter

A Fast, Scalable SLINK Algorithm for Commodity Cluster Computing Exploiting Spatial Locality

Poonam Goyal, Sonal Kumari, Sumit Sharma, Dhruv Kumar, more

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) > 268 - 275

Single linkage (SLINK) hierarchical clustering algorithm is a preferred clustering algorithm over traditional partitioning-based clustering as it does not require the number of clusters as input. But, due to its high time complexity and inherent data dependencies, it does not scale well for large datasets. To the best of our knowledge, all existing parallel SLINK algorithms are based on the traditional...

chapter

Detecting outliers on UCI repository datasets by Adaptive Rough Fuzzy clustering method

P. Ashok, G.M. Kadhar Nawaz

2016 Online International Conference on Green Engineering and Technologies (IC-GET) > 1 - 6

2016 Online International Conference on Green Engineering and Technologies (IC-GET)

The clustering is the most effective method to identify the outliers in the UCI Repository dataset. This paper proposes detecting outliers on UCI datasets using Adaptive Rough Fuzzy C-Means clustering algorithm. In the first phase of the Adaptive Rough Fuzzy C- Means algorithm, the Rough k means algorithm is used for pre-processing of UCI repository dataset and it is normally identify the outliers...

chapter

Towards optimization of availability and cost in selection of geo-distributed clouds datacenter

Hasan Ziafat, Seyed Morteza Babamir

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT) > 1 - 5

2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT)

With increasing data clouds in different geographical areas, the availability of a datacenter and the cost of using the datacenter are two concerned factors of clouds users. The present research aims to present a method using K-means clustering and NSGA-II multi-objective algorithm to maximize availability and minimizes cost in selecting a datacenter. The proposed approach was applied to some real...

chapter

Improving the Selection of Bases of BRDFs for Appearance Preservation

Fernando Melo Nascimento, Andre Britto de Carvalho, Beatriz Trinchao Andrade

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 440 - 447

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

An important step in the appearance preservation of real materials is the analysis of how they interact with light. Since this phenomena happens at a microscopic level, heuristics with different complexity have been developed to capture and reproduce it. In order to minimize sampling efforts, one of these approaches consists in representing the reflectance of a material as a linear combination of...

chapter

Challenges and possible solutions to density based clustering

Fatma Gunseli Yasar, Gozde Ulutagay

2016 IEEE 8th International Conference on Intelligent Systems (IS) > 492 - 498

2016 IEEE 8th International Conference on Intelligent Systems (IS)

Clustering is an interdisciplinary-studied subject of statistical data analysis. In this study, among various types of clustering algorithms, the algorithms derived from Density Based Spatial Clustering of Applications with Noise (DBSCAN) are investigated. Although DBSCAN is the well-known density-based algorithms it has some bottlenecks. So, enhanced versions of DBSCAN are developed to provide some...

chapter

Theoretical analysis of the Minimum Sum of Squared Similarities sampling for Nyström-based spectral clustering

Djallel Bouneffouf, Inanc Birol

2016 International Joint Conference on Neural Networks (IJCNN) > 3856 - 3862

2016 International Joint Conference on Neural Networks (IJCNN)

Spectral clustering has shown a superior performance in analyzing the cluster structure. However, the exponentially computational complexity limits its application in analyzing large-scale data. To tackle this problem, many low-rank matrix approximating algorithms are proposed, of which the Nyström method is an approach with proved lower approximate errors. The algorithms commonly combine two powerful...

chapter

Big data and clustering algorithms

V W Ajin, Lekshmy D Kumar

2016 International Conference on Research Advances in Integrated Navigation Systems (RAINS) > 1 - 5

2016 International Conference on Research Advances in Integrated Navigation Systems (RAINS)

Data mining is the method which is useful for extracting useful information and data is extorted, but the classical data mining approaches cannot be directly used for big data due to their absolute complexity. The data that is been formed by numerous scientific applications and incorporated environment has grown rapidly not only in size but also in variety in recent era. The data collected is of very...

chapter

Can We Group Similar Amazon Reviews: A Case Study with Different Clustering Algorithms

Chantal Fry, Sukanya Manna

2016 IEEE Tenth International Conference on Semantic Computing (ICSC) > 374 - 377

2016 IEEE Tenth International Conference on Semantic Computing (ICSC)

The amount of unstructured text data available is growing exponentially due to the proliferation of digital information such as emails, text messages, blogs, social media posts, and product reviews. For users of e-commerce websites such as Amazon, navigating thousands of reviews before buying a product can be a daunting task. Unsupervised machine learning techniques can be used to automatically analyze...

chapter

Efficiency analysis of kernel functions in uncertainty based c-means algorithms

Dishant Mittal, B. K. Tripathy

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 807 - 813

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Application of clustering algorithms for investigating real life data has concerned many researchers and vague approaches or their hybridization with other analogous approaches has gained special attention due to their great effectiveness. Recently, rough intuitionistic fuzzy c-means algorithm has been proposed by Tripathy et al [3] and they established its supremacy over all other algorithms contained...

chapter

A MinHash Approach for Clustering Large Collections of Binary Programs

Ciprian Oprisa

2015 20th International Conference on Control Systems and Computer Science > 157 - 163

2015 20th International Conference on Control Systems and Computer Science (CSCS)

Clustering large collections of binary programs is a challenging task due to two factors. First of all, a way to determine if two samples are similar or not is required. Secondly, pair wise comparison is impractical on collections comprising millions of items. This paper will mainly focus on the second factor and will propose a clustering algorithm based on the properties of Min Hash functions. The...

chapter

A density based model for facility location problem

Ashish Sharma, Krishna Kant, Anand Singh Jalal

2014 Annual IEEE India Conference (INDICON) > 1 - 5

2014 Annual IEEE India Conference (INDICON)

Solutions for facility location problems are numerous. As the problem is NP hard, continuous efforts have been made to find more efficient techniques. The nature of the facility adds to its variety. A popular approach has been based on geometric solutions. Other methods have also been tried; one of them is based on density applied for large databases as in Spatial Data Mining and Geographic Information...

chapter

A refined rough fuzzy clustering algorithm

Sahil Sobti, Vivek Shah, B. K. Tripathy

2014 IEEE International Conference on Computational Intelligence and Computing Research > 1 - 4

2014 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC)

Clustering is a familiar concept in the realm of Data mining and has wide applications in areas like image processing, pattern recognition and rule generation. Uncertainty in present day databases is a common feature. In order to handle these datasets, several clustering algorithms have been formulated in the literature. The first one being the Fuzzy C-Means (FCM) algorithm and it was followed by...

chapter

Semi-supervised Segmentation Fusion of Multi-spectral and Aerial Images

Mete Ozay

2014 22nd International Conference on Pattern Recognition > 3839 - 3844

2014 22nd International Conference on Pattern Recognition (ICPR)

A Semi-supervised Segmentation Fusion algorithm is proposed using consensus and distributed learning. The aim of Unsupervised Segmentation Fusion (USF) is to achieve a consensus among different segmentation outputs obtained from different segmentation algorithms by computing an approximate solution to the NP problem with less computational complexity. Semi-supervision is incorporated in USF using...

chapter

Network connectivity maintenance of WSNs in node failure-prone environment: A detailed survey

Tanu Pathak, Bhawna, Virender Ranga

2014 International Conference on Computing for Sustainable Global Development (INDIACom) > 685 - 691

2014 International Conference on Computing for Sustainable Global Development (INDIACom)

Network connectivity maintenance in failure prone environment has received more attention in the recent years. Unfortunately due to hostile environment there is need of some other active nodes i.e. backbone nodes which can compensate the failure of the nodes. One of the main design challenges for wireless sensor network (WSNs) is to obtain connecting dominating set (CDS) in polynomial time with low...

article

Constrained Concept Factorization for Image Representation

Haifeng Liu, Genmao Yang, Zhaohui Wu, Deng Cai

IEEE Transactions on Cybernetics > 2014 > 44 > 7 > 1214 - 1224

Matrix factorization based techniques, such as nonnegative matrix factorization and concept factorization, have attracted great attention in dimensionality reduction and data clustering. Previous studies show that both of them yield impressive results on image processing and document clustering. However, both of them are essentially unsupervised methods and cannot incorporate label information. In...

chapter

Clustering of Symbols Using Minimal Description Length

Oben M. Tataw, Thanawin Rakthanmanon, Eamonn J. Keogh

2013 12th International Conference on Document Analysis and Recognition > 180 - 184

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

The clustering of glyphs (individual letters/characters/symbols) is typically the first step in document processing algorithms and a critical enabling technology for most historical document indexing techniques. In this work, we take a step back from current domain/language specialized research efforts to consider the problem from an agnostic perspective. In particular, we claim that, independent...

Content availability:
Available
Data set:
ieee
Keywords:
ALGORITHM DESIGN AND ANALYSIS
CLUSTERING
APPROXIMATION ALGORITHMS

Publication date

Set your own date range

Publication type

book (36)
article (4)

Keywords

CLUSTERING ALGORITHMS (38)
PARTITIONING ALGORITHMS (16)
DATA MINING (11)
PATTERN CLUSTERING (11)
APPROXIMATION METHODS (9)
COMPLEXITY THEORY (6)
ACCURACY (5)
INDEXES (5)
MACHINE LEARNING ALGORITHMS (5)
HEURISTIC ALGORITHMS (4)
K-MEANS (4)
WIRELESS SENSOR NETWORKS (4)
DATA STRUCTURES (3)
DISTANCE MEASUREMENT (3)
APPROXIMATION THEORY (2)
CENTROID (2)
CLASSIFICATION ALGORITHMS (2)
CLUSTERING ALGORITHM (2)
CLUSTERING METHODS (2)
COMPUTER SCIENCE (2)
COUPLINGS (2)
DATA MODELS (2)
DENSITY (2)
ELECTRONIC MAIL (2)
FUZZY SETS (2)
MEASUREMENT (2)
OPTIMISATION (2)
OPTIMIZATION (2)
PARALLEL ALGORITHMS (2)
PHEROMONE (2)
RANDOMIZED ALGORITHM (2)
ROUGH SETS (2)
ROUTING (2)
SIGNAL PROCESSING ALGORITHMS (2)
STANDARDS (2)
TIME COMPLEXITY (2)
UNCERTAINTY (2)
VECTORS (2)
WIRELESS SENSOR NETWORK (2)
ABSTRACTING (1)
ACTIVE-DBSCAN (1)
AD HOC NETWORKS (1)
ADAPTIVE GRIDDING (1)
ADJUSTED MEAN APPROXIMATION (1)
ALGORITH (1)
ALGORITHM (1)
ALGORITHM DEVELOPMENT (1)
ALGORITHM/PROTOCOL DESIGN AND ANALYSIS (1)
ALGORITHMS FOR DATA AND KNOWLEDGE MANAGEMENT (1)
ANALYTICAL MODELS (1)
ANT COLONY ALGORITHM (1)
APPEARANCE MODELING (1)
APPROXIMATE ALGORITHM (1)
APPROXIMATE HIERARCHICAL CLUSTERING (1)
APPROXIMATE KNN (1)
APPROXIMATION (1)
ARRAYS (1)
ATMOSPHERIC MEASUREMENTS (1)
AUTONOMOUS (1)
AVAILABILITY (1)
BIG DATA (1)
BIGDATA (1)
BINARY CODE ANALYSIS (1)
BINARY SPACE PARTITIONING K-MEANS (1)
BINARY SPACE PARTITIONING TREES (1)
BIRCH (1)
BISMUTH (1)
BLOGS (1)
BRAIN MODELS (1)
BRDF (1)
BRUTE FORCE ALGORITHM (1)
CELL (1)
CELL BASED CLUSTERING ALGORITHM (1)
CHARACTER RECOGNITION (1)
CLIQUE (1)
CLOUD COMPUTER MANAGEMENT SYSTEM (1)
CLOUD COMPUTING (1)
CLUSTER SPACE-SET (1)
CLUSTERING MASSIVE-DOMAIN DATA STREAMS (1)
CLUSTERING TECHNIQUE (1)
COMPACT CLUSTER (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTATIONAL MODELING (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTER DEBUGGING (1)
COMPUTER NETWORKS (1)
COMPUTERS (1)
CONFLUENT FLOWS (1)
CONSENSUS (1)
CONVERGENCE (1)
CORRELATION (1)
COSTS (1)
CREDIT-CARD TRANSACTION STREAMS (1)
CURRENT MEASUREMENT (1)
D INDEX (1)
D-INDEX (1)
DATA CENTRALIZATION (1)
more

INFONA - science communication portal

Search results

Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms

Clustering of microRNAs Using Rough Hypercuboid Based Fuzzy C-Means

A Theoretical Analysis of the Fuzzy K-Means Problem

Random Projection Clustering on Streaming Data

A Fast, Scalable SLINK Algorithm for Commodity Cluster Computing Exploiting Spatial Locality

Detecting outliers on UCI repository datasets by Adaptive Rough Fuzzy clustering method

Towards optimization of availability and cost in selection of geo-distributed clouds datacenter

Improving the Selection of Bases of BRDFs for Appearance Preservation

Challenges and possible solutions to density based clustering

Theoretical analysis of the Minimum Sum of Squared Similarities sampling for Nyström-based spectral clustering

Big data and clustering algorithms

Can We Group Similar Amazon Reviews: A Case Study with Different Clustering Algorithms

Efficiency analysis of kernel functions in uncertainty based c-means algorithms

A MinHash Approach for Clustering Large Collections of Binary Programs

A density based model for facility location problem

A refined rough fuzzy clustering algorithm

Semi-supervised Segmentation Fusion of Multi-spectral and Aerial Images

Network connectivity maintenance of WSNs in node failure-prone environment: A detailed survey

Constrained Concept Factorization for Image Representation

Clustering of Symbols Using Minimal Description Length

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options