Search results

chapter

A novel clustering framework using farthest neighbour approach

Suvendu Kanungo, Aparna Shukla

2017 International Conference on Computing, Communication and Automation (ICCCA) > 164 - 169

2017 International Conference on Computing, Communication and Automation (ICCCA)

In this digital world, we are facing the flood of data, but depriving for knowledge. The eminent need of mining is useful to extract the hidden pattern from the wide availability of vast amount of data. Clustering is one such useful mining tool to handle this unfavorable situation by carrying out crucial steps refers as cluster analysis. It is the process of a grouping of patterns into clusters based...

chapter

Performance enhancement of K-means clustering algorithm for gene expression data using entropy-based centroid selection

Naveen Trivedi, Suvendu Kanungo

2017 International Conference on Computing, Communication and Automation (ICCCA) > 143 - 148

2017 International Conference on Computing, Communication and Automation (ICCCA)

Micro array data play a vital role in simultaneously monitoring the expression profile of large number of genes that are specified with various experimental conditions. In bioinformatics research, the recognition of co-expressed and coherent patterns is a major objective in micro array data analysis. The K-means clustering algorithm is gaining popularity in the knowledge discovery domain for effectively...

chapter

A modified DBSCAN clustering algorithm for proactive detection of DDoS attacks

Safaa O. Al-mamory, Zahraa M. Algelal

2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT) > 304 - 309

2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT)

In this paper, an exact and proactive technique is created to distinguish Distributed Denial of Service (DDoS) attacks. This is achieved by using an entropy concept to measure abnormal traffic changes according to the phases of the attack. This traffic is then clustered by using a modified DBSCAN algorithm, and the centroids for the resulting clusters are then used as patterns for efficient distance-based...

chapter

An Efficient Feature Subset Selection for Improved Stability Using T-Statistic

R. Karthika

2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM) > 326 - 329

2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)

Large amounts of data gets accumulated and stored in the databases in day to day life that are high dimensional in nature. The data mining task is used to excavate the useful information from the high dimensional data. To classify or cluster the high dimensional data, the dimensionality of the data needs to be reduced. Feature selection is used to select the features that are relevant to the analysis...

chapter

Biclustering on gene expression data

M P Shruthi

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET) > 1 - 4

2017 International Conference on Algorithms, Methodology, Models and Applications in Emerging Technologies (ICAMMAET)

Microarray technology is a tool which is essential to observe and monitor the genes in an living organism. Biclustering is a strategy to distinguish qualities that are co-directed under a subset of conditions, however are not really co-controlled crosswise over different conditions. The dataset is in the form of matrix, row matrix represents a set of genes and column matrix represents a set of conditions...

chapter

Bearing fault diagnosis based on multi-scale possibilistic clustering algorithm

Ya-Ting Hu, Fu-Heng Qu, Chang-Ji Wen

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) > 354 - 357

2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

In the on-line monitoring for the fault of rolling bearing, we have no information about the cluster number of the obtained data signal, which cause great challenges for on-line fault diagnosis when using clustering algorithms. In this paper, we extract three features of the vibration signals of rolling bearings as the parameters in time-domain, and then multi-scale possibilistic clustering (MPCM)...

chapter

An innovative approach to classify and retrieve text documents using feature extraction and Hierarchical clustering based on ontology

Aradhana R. Patil, Amrita A. Manjrekar

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 371 - 376

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Data retrieval is a key process of acquiring information as per requirement. The necessity of proper information has increased. The most basic tools which provide this service are browser. It traverses the data as per user's query and gives the search results of all related information. Hence, it becomes a time consuming process to find required information. In this paper, the focus is done on content...

chapter

Image clustering based on deep sparse representations

Le Lv, Dongbin Zhao, Qingqiong Deng

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 6

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

Currently, the supervised trained deep neural networks (DNNs) have been successfully applied in several image classification tasks. However, how to extract powerful data representations and discover semantic concepts from unlabeled data is a more practical issue. Unsupervised feature learning methods aim at extracting abstract representations from unlabeled data. Large amount of research works illustrate...

chapter

New Multi-class Clustering Approach for Protein Sequences

Faouzi Mhamdi, Marouen Hacheni, Imen Harzli

2016 27th International Workshop on Database and Expert Systems Applications (DEXA) > 57 - 60

2016 27th International Workshop on Database and Expert Systems Applications (DEXA)

Clustering, or unsupervised classification, is animportant issue in Bioinformatics. It serves to automaticallygroup protein sequences into families. Most researchers treatthe biclass clustering problem. In this paper we present ourapproach for the multiclass clustering of protein sequences. It isa difficult problem, because we are based on primary structure. This approach consists of four steps. In...

chapter

Evidential Label Propagation Algorithm for Graphs

Kuang Zhoua, Arnaud Martin, Quan Pan, Zhun-ga Liu

2016 19th International Conference on Information Fusion (FUSION) > 1316 - 1323

2016 19th International Conference on Information Fusion (FUSION)

Community detection has attracted considerable attention crossing many areas as it can be used for discovering the structure and features of complex networks. With the increasing size of social networks in real world, community detection approaches should be fast and accurate. The Label Propagation Algorithm (LPA) is known to be one of the near-linear solutions and benefits of easy implementation,...

article

Data Randomization and Cluster-Based Partitioning for Botnet Intrusion Detection

Omar Y. Al-Jarrah, Omar Alhussein, Paul D. Yoo, Sami Muhaidat, more

IEEE Transactions on Cybernetics > 2016 > 46 > 8 > 1796 - 1806

Botnets, which consist of remotely controlled compromised machines called bots, provide a distributed platform for several threats against cyber world entities and enterprises. Intrusion detection system (IDS) provides an efficient countermeasure against botnets. It continually monitors and analyzes network traffic for potential vulnerabilities and possible existence of active attacks. A payload-inspection-based...

chapter

An unsupervised-based dynamic feature selection for classification tasks

Romulo de O. Nunes, Carine A. Dantas, Anne M. P. Canuto, Joao C. Xavier-Junior

2016 International Joint Conference on Neural Networks (IJCNN) > 4213 - 4220

2016 International Joint Conference on Neural Networks (IJCNN)

Recently, the number of features in different problem domains has grown enormously. In order to select the best representation (attributes) for these problems, a deep knowledge of the problem domain is required. As this type of knowledge is not always possible, feature selection needs to be applied as an automatic selection process of the most relevant attributes in a dataset. In this paper, we propose...

chapter

Criminal pattern identification based on modified K-means clustering

Turki Aljrees, Daming Shi, David Windridge, William Wong

2016 International Conference on Machine Learning and Cybernetics (ICMLC) > 2 > 799 - 806

2016 International Conference on Machine Learning and Cybernetics (ICMLC)

Data mining methods like clustering enable police to get a clearer picture of criminal identification and prediction. Clustering algorithms will help to extracts hidden patterns to identify groups and their similarities. In this paper, a modified k-mean algorithm is proposed. The data point has been allocated to its suitable class or cluster more remarkably. The Modified k-mean algorithm reduces the...

chapter

Improving projected clustering algorithm for high dimensional dataset

Madhuri Dighe, Gajanan Gawde

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 1411 - 1415

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

The sparsity and the problem of curse of dimensionality of high dimensional data make traditional clustering algorithms such as K-Means, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) result in low quality clusters and increase the time complexity exponentially. Many Projected Clustering algorithms have been proposed to deal with noisy High Dimensional Data. However, most of...

chapter

Comparative study of k-means variants for mono-view clustering

Safa Bettoumi, Chiraz Jlassi, Najet Arous

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 183 - 188

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

Data clustering analysis is the process of finding similarity between data that are assigned into homogeneous groups and the most heterogeneous as possible among groups. There are several analysis methods in wich K-means clustering algorithm is the widly used in different research areas. Therefore, this paper reviews the most known variants of clustering methods which are K-means, IRP-K-means and...

chapter

Classical and Re-learning Based Clustering Algorithms for Huge Data Warehouses

Syed Zubair Ahmad Shah, Mohammad Amjad

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 209 - 213

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

The aim of writing this paper is to provide a detailed, in order description and analysis of the often used and important algorithms of clustering with focus on the recent advances, and to provide an extensive comparison of these algorithms in terms of their complexities and applications.

chapter

Agglomerative algorithm to discover semantics from unstructured big data

I-Jen Chiang

2015 IEEE International Conference on Big Data (Big Data) > 1556 - 1563

2015 IEEE International Conference on Big Data (Big Data)

The paper presents a graph model and an agglomerative algorithm for text document clustering. Given a set of documents, the associations among frequently co-occurring terms in any of the documents naturally form a graph, which can be decomposed into connected components at various levels. Each connected component represents a concept in the collection. These concepts can categorize documents into...

chapter

A Hybrid Method for Incomplete Data Imputation

Liang Zhao, Zhikui Chen, Zhennan Yang, Yueming Hu

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 1725 - 1730

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

With the explosive increase of data volume, the research of data quality and data usability draws extensive attention. In this work, we focus on one aspect of data usability -- incomplete data imputation, and present a novel missing value imputation method using stacked auto-encoder and incremental clustering (SAICI). Specifically, SAICI's functionality rests on four pillars: (i) a distinctive value...

chapter

Modified K-means algorithm using timestamp initialization in sliding window to detect anomaly traffic

I Wayan Oka Krismawan Putra, Yudha Purwanto, Fiky Yosef Suratman

2015 International Conference on Control, Electronics, Renewable Energy and Communications (ICCEREC) > 19 - 23

2015 International Conference on Control, Electronics, Renewable Energy and Communications (ICCEREC)

Traffic anomalies that occur on the network usually make authorized users cannot access properly. That because by an increased number of users at a time or due to the attack of botnet to the network. This research purpose a method to detect there is anomaly traffic or not. This research used K-Means algorithm as the detection algorithm that modified on determination of the centroid and the cluster...

chapter

A MinHash Approach for Clustering Large Collections of Binary Programs

Ciprian Oprisa

2015 20th International Conference on Control Systems and Computer Science > 157 - 163

2015 20th International Conference on Control Systems and Computer Science (CSCS)

Clustering large collections of binary programs is a challenging task due to two factors. First of all, a way to determine if two samples are similar or not is required. Secondly, pair wise comparison is impractical on collections comprising millions of items. This paper will mainly focus on the second factor and will propose a clustering algorithm based on the properties of Min Hash functions. The...

INFONA - science communication portal

Search results

A novel clustering framework using farthest neighbour approach

Performance enhancement of K-means clustering algorithm for gene expression data using entropy-based centroid selection

A modified DBSCAN clustering algorithm for proactive detection of DDoS attacks

An Efficient Feature Subset Selection for Improved Stability Using T-Statistic

Biclustering on gene expression data

Bearing fault diagnosis based on multi-scale possibilistic clustering algorithm

An innovative approach to classify and retrieve text documents using feature extraction and Hierarchical clustering based on ontology

Image clustering based on deep sparse representations

New Multi-class Clustering Approach for Protein Sequences

Evidential Label Propagation Algorithm for Graphs

Data Randomization and Cluster-Based Partitioning for Botnet Intrusion Detection

An unsupervised-based dynamic feature selection for classification tasks

Criminal pattern identification based on modified K-means clustering

Improving projected clustering algorithm for high dimensional dataset

Comparative study of k-means variants for mono-view clustering

Classical and Re-learning Based Clustering Algorithms for Huge Data Warehouses

Agglomerative algorithm to discover semantics from unstructured big data

A Hybrid Method for Incomplete Data Imputation

Modified K-means algorithm using timestamp initialization in sliding window to detect anomaly traffic

A MinHash Approach for Clustering Large Collections of Binary Programs

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options