Search results

Items from 1 to 20 out of 21 results

chapter

Analysis of clustering algorithms in biological networks

Asuda Sharma, Hesham H. Ali

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 2303 - 2305

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Biological data is often represented as networks, as in the case of protein-protein interactions and metabolic pathways. Modeling, analyzing, and visualizing networks can help make sense of large volumes of data generated by high-throughput experiments. However, due to their size and complex structure, biological networks can be difficult to interpret without further processing. Cluster analysis is...

chapter

Tracking feature relevance with the plaid model in continuously changing datasets

Richard D. Appiah, Sumeet Dua

2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA) > 20 - 24

2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA)

Selecting relevant features in data modeling is critical to ensure effective and accurate prediction of future effects. The problem becomes compounded when the relevance of previously selected features cannot be guaranteed due to changes in the underlying dataset. We propose an algorithm based on the statistical plaid model for the discovery and tracking of feature relevance scores in datasets that...

chapter

Deriving Cognitive Map Concepts on the Basis of Social Media Data Clustering

Vasiliy S. Kireev

2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW) > 37 - 40

2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW)

With data storage and processing technology developing fast, there has been accumulated a great amount of open data that comes from everywhere including social media. One of the promising tools to analyze these data is fuzzy cognitive maps that help to describe connections and substances to reveal patterns, facts and knowledge. One of the problems when creating cognitive maps is the identification...

chapter

On fuzzy clustering for heavy-tailed data

S. Mahmoud Taheri, A. Mohammadpour, Israa Atiyah

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS) > 202 - 206

2017 5th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)

The fuzzy c-means method is investigated to cluster the heavy tailed data by using some measures of distance. A comparison study is provided based on time and precision. The results show that when using the Euclidean distance, the time required is less than if we used Manhattan distance, but the precision is higher when using the Manhattan distance.

article

An Optimization Model for Clustering Categorical Data Streams with Drifting Concepts

Liang Bai, Xueqi Cheng, Jiye Liang, Huawei Shen

IEEE Transactions on Knowledge and Data Engineering > 2016 > 28 > 11 > 2871 - 2883

There is always a lack of a cluster validity function and optimization strategy to find out clusters and catch the evolution trend of cluster structures on a categorical data stream. Therefore, this paper presents an optimization model for clustering categorical data streams. In the model, a cluster validity function is proposed as the objective function to evaluate the effectiveness of the clustering...

chapter

Implementing cluster analysis tool for the identification of students typologies

Lotfi Najdi, Brahim Er-Raha

2016 4th IEEE International Colloquium on Information Science and Technology (CiSt) > 575 - 580

2016 4th IEEE International Colloquium on Information Science and Technology (CIST)

The identification of students' typologies plays interesting role in adapting educational strategies and improving academic performances. In this work, we show how unsupervised learning techniques can be applied to educational data for the extraction of typologies and profiles of graduate students based on educational outcomes in combination with the time to degree. We also describe a web-based tool...

chapter

Prediction of the Syngas composition of a gasification process

S. M. Zanoli, G. Astolfi, L. Barboni

2012 5th International Symposium on Communications, Control and Signal Processing > 1 - 5

2012 5th International Symposium on Communications, Control and Signal Processing (ISCCSP)

The scope of this work is the development of a mathematical model of a gasification process to be used for the prediction of the Syngas composition. The predictions are intended to support the gascromathographic measurements of the Syngas composition which are often not available due to periodic calibrations. This work represents the first step of broader project which scope is the development of...

chapter

Research and Implementation of an Anomaly Detection Model Based on Clustering Analysis

Li Han

2010 International Symposium on Intelligence Information Processing and Trusted Computing > 458 - 462

2010 International Symposium on Intelligence Information Processing and Trusted Computing (IPTC 2010)

IDS (Intrusion Detection system) is an active and driving defense technology. This paper mainly focuses on intrusion detection based on data mining. The aim is to improve the detection rate and decrease the false alarm rate, and the main research method is clustering analysis. The algorithm and model of ID are proposed and corresponding simulation experiments are presented. Firstly, a method to reduce...

chapter

Marginal maximum likelihood estimation of single parameter logistic based on EM algorithm

Xueyan Sun, Fengxuan Jing, Xiaoyao Xie, Anyu Zhang

2010 International Conference on Anti-Counterfeiting, Security and Identification > 173 - 175

2010 International Conference on Anti-Counterfeiting, Security and Identification (2010 ASID)

Cluster analysis is one of the most important functions of data mining. Expectation Maximization (EM) method is an important technology based on model clustering method. The expectation maximization algorithm is analyzed in this research and applied to Adaptive Testing System, in which logistic function in item response theory serves as a model, and the combination of methods of marginal maximum likelihood...

chapter

Feature weighing for efficient clustering

W Ahmad, A Narayanan

2010 6th International Conference on Advanced Information Management and Service (IMS) > 236 - 242

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

In cluster analysis, current algorithms assume that all features in the data contribute uniformly in assigning samples to clusters. This assumption can lead to poor clustering results, due to the existence of noisy and less important features. Feature weighting overcomes this issue by assigning different weights to features based on some notion of importance. According to feature weighting, more important...

chapter

Reducing dendrogram instability of features using rough set indiscernibility level

R. B. Fajriya Hakim, , Subanar, Edi Winarko

2010 International Conference on Distributed Frameworks for Multimedia Applications > 1 - 10

2010 International Conference on Distributed Frameworks for Multimedia Applications (DFmA)

Cluster analysis is one of the most well known methods in data mining. One of the major problems in clustering is the dendrogram instability due to data input order. Rough set has already been used as an intelligent approach to data mining. The core concept of classical rough sets is to cluster similarities and differences of data objects based on the notions of indiscernibility and indiscernibility...

chapter

Using Incremental Fuzzy Clustering to Web Usage Mining

S.R. Aghabozorgi, T.Y. Wah

2009 International Conference of Soft Computing and Pattern Recognition > 653 - 658

2009 International Conference of Soft Computing and Pattern Recognition

The recent extensive growth of data on the Web, has generated an enormous amount of log records on Web server databases. Applying Web usage mining techniques on these vast amounts of historical data can discover potentially useful patterns and reveal user access behaviors on the Web site. Cluster analysis has widely been applied to generate user behavior models on server Web logs. Most of these off-line...

chapter

Research on Classification and Subdivision Model of Telecom Rural Channel Based on Clustering Analysis

Wang Yuan, Zhang Yihua

2009 International Conference on Information Management, Innovation Management and Industrial Engineering > 3 > 531 - 534

2009 International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2009)

Rapid advances in data collection and storage technology have enabled telecom company to accumulate vast amounts of data. However, extracting useful information has proven extremely challenging. Telecom enterprises are holding massive customers' data and should convert it to competitive advantage in order to maximize customers' profitability. Based on CRISP-DM (cross-industry standard process for...

chapter

Gaussian Mixture Models and Split-Merge Algorithm for parameter analysis of tracked video objects

GuoQing Yin, D. Bruckner

2009 35th Annual Conference of IEEE Industrial Electronics > 4155 - 4158

IECON 2009 - 35th Annual Conference of IEEE Industrial Electronics (IECON 2009)

Parameters of tracked video objects (for example: the angles of moving objects) are discrete random variables and the amount of data increases over time. In this paper we use a new method to analyze the parameter angle: the video frame is segmented into small sections and in each section the angle values during some time period are gathered. Through analysis the angle data in each section these angles...

chapter

Determining provenance in phishing websites using automated conceptual analysis

R. Layton, P. Watters

2009 eCrime Researchers Summit > 1 - 7

2009 eCrime Researchers Summit. eCRIME 2009

Phishing is a form of online fraud with drastic consequences for the victims and institutions being defrauded. A phishing attack tries to create a believable environment for the intended victim to enter their confidential data such that the attacker can use or sell this information later. In order to apprehend phishers, law enforcement agencies need automated systems capable of tracking the size and...

chapter

Global optimization, Meta Clustering and consensus clustering for class prediction

I. Bifulco, C. Fedullo, F. Napolitano, G. Raiconi, more

2009 International Joint Conference on Neural Networks > 332 - 339

2009 International Joint Conference on Neural Networks (IJCNN 2009 - Atlanta)

Clustering of real-world data is often ill-posed. Because of noise and intrinsic ambiguity in data, optimization models attempting to maximize a fitness function can be misled by the assumption of uniqueness of the solution. In this work we present a methodology including classic and novel techniques to approach clustering in a systematic way, with two application examples to biological data sets...

chapter

Real-Time Freeway Traffic State Estimation Based on Cluster Analysis and Multiclass Support Vector Machine

Chao Deng, Fan Wang, Huimin Shi, Guozhen Tan

2009 International Workshop on Intelligent Systems and Applications > 1 - 4

2009 International Workshop on Intelligent Systems and Applications

Urban traffic state analysis plays an important role in the solution of traffic congestion problem. To estimate traffic state effectively is a foundational work for improving traffic condition and preventing traffic congestion. In this paper, a novel pattern-based approach is proposed to model the clustering and classification of traffic state. First, fuzzy-set clustering method is utilized to divide...

chapter

Regression Diagnostics for Multiple Model Step Data

A.A.M. Nurunnabi, M. Nasser

2009 International Conference on Digital Image Processing > 85 - 89

2009 International Conference on Digital Image Processing, ICDIP

In many vision and image problems there are multiple structures in a single data set and we need to identify the multiple models. To preserve most structures in presence of noise makes the estimation difficult. In such case for each structure, data which belong to other structures are also outliers in addition to the outliers for all the structures. Robust regression techniques are commonly used to...

chapter

K-Means Divide and Conquer Clustering

M. Khalilian, F.Z. Boroujeni, N. Mustapha, M.N. Sulaiman

2009 International Conference on Computer and Automation Engineering > 306 - 309

2009 International Conference on Computer and Automation Engineering. ICCAE 2009

Cluster analysis, primitive exploration with little or no prior knowledge, consists of research developed across a wide variety of communities. Most clustering techniques ignore the fact about the different size or levels - where in most cases, clustering is more concern with grouping similar objects or samples together ignoring the fact that even though they are similar, they might be of different...

chapter

The research of railway transport market subdivision based on data mining technology

Ya Tian, Jiajuan Chen, Xuebing Wang

2008 IEEE International Conference on Service Operations and Logistics, and Informatics > 1 > 147 - 152

2008 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI)

Facing to the characters of railway transport industry, parameters of railway freight market customer subdivision were presented; clustering model K-Means was established for market subdivision. The algorithm of K-Means was modified by using square error and objective function. Actual experiment of railway transport market subdivision is done based on Clementine 8.0 platform, and the features of various...

Keywords:
CLUSTERING ALGORITHMS
DATA MODELS

Publication date

Set your own date range

INFONA - science communication portal

Search results

Analysis of clustering algorithms in biological networks

Tracking feature relevance with the plaid model in continuously changing datasets

Deriving Cognitive Map Concepts on the Basis of Social Media Data Clustering

On fuzzy clustering for heavy-tailed data

An Optimization Model for Clustering Categorical Data Streams with Drifting Concepts

Implementing cluster analysis tool for the identification of students typologies

Prediction of the Syngas composition of a gasification process

Research and Implementation of an Anomaly Detection Model Based on Clustering Analysis

Marginal maximum likelihood estimation of single parameter logistic based on EM algorithm

Feature weighing for efficient clustering

Reducing dendrogram instability of features using rough set indiscernibility level

Using Incremental Fuzzy Clustering to Web Usage Mining

Research on Classification and Subdivision Model of Telecom Rural Channel Based on Clustering Analysis

Gaussian Mixture Models and Split-Merge Algorithm for parameter analysis of tracked video objects

Determining provenance in phishing websites using automated conceptual analysis

Global optimization, Meta Clustering and consensus clustering for class prediction

Real-Time Freeway Traffic State Estimation Based on Cluster Analysis and Multiclass Support Vector Machine

Regression Diagnostics for Multiple Model Step Data

K-Means Divide and Conquer Clustering

The research of railway transport market subdivision based on data mining technology

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options