Search results

Items from 1 to 20 out of 62 results

chapter

An effective method determining the initial cluster centers for K-means for clustering gene expression data

Deniz Tanir, Fidan Nuriyeva

2017 International Conference on Computer Science and Engineering (UBMK) > 751 - 754

2017 International Conference on Computer Science and Engineering (UBMK)

Clustering is an important tool for analyzing gene expression data. Many clustering algorithms have been proposed for the analysis of gene expression data. In this article we have clustered real life gene expression data via K-Means which is one of clustering algorithms. Also, we have proposed a new method determining the initial cluster centers for K-means. We have compared results of our method...

chapter

A Data Science and Engineering Solution for Fast K-Means Clustering of Big Data

Karl E. Dierckens, Adrian B. Harrison, Carson K. Leung, Adrienne V. Pind

2017 IEEE Trustcom/BigDataSE/ICESS > 925 - 932

2017 IEEE Trustcom/BigDataSE/ICESS

With advances in technology, high volumes of a wide variety of valuable data of different veracity can be easily collected or generated at a high velocity in the current era of big data. Embedded in these big data are implicit, previously unknown and potentially useful information. Hence, fast and scalable big data science and engineering solutions that mine and discover knowledge from these big data...

chapter

An analysis of random forest algorithm based network intrusion detection system

Yi Yi Aung, Myat Myat Min

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 127 - 132

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

In the world today, the security of the computer system is of great importance, And in the last few years, there have seen an affected growth in the amount of intrusions that intrusion detection has become the dominant of current information security. Firewalls cannot provide complete protection. Applying on a firewall system alone is not enough to prevent a corporate network from all types of network...

chapter

Expert system for retrieval of documents using evolutionary approaches incorporating clustering

Sharvari Deshpande, Monika Doke, Aishwarya Deshpande, Anagha N. Chaudhari

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) > 2 > 414 - 418

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)

Classification is a central problem in the fields of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classifier) that can be used to predict the class of new unlabelled instances. Data preparation is crucial to the data mining process, and its focus is to improve the fitness of the training data for the learning algorithms to produce more...

chapter

Comparison of applications for educational data mining in Engineering Education

Diego Buenano Fernandez, Sergio Lujan-Mora

2017 IEEE World Engineering Education Conference (EDUNINE) > 81 - 85

2017 IEEE World Engineering Education Conference (EDUNINE)

Currently there are many techniques based on information technology and communication aimed at assessing the performance of students. Data mining applied in the educational field (educational data mining) is one of the most popular techniques that are used to provide feedback with regard to the teaching-learning process. In recent years there have been a large number of open source applications in...

chapter

Density-based spatial clustering of application with noise algorithm for the classification of solar radiation time series

Benmouiza Khalil, Cheknane Ali

2016 8th International Conference on Modelling, Identification and Control (ICMIC) > 279 - 283

2016 8th International Conference on Modelling, Identification and Control (ICMIC)

The study of the dynamic behaviour of the solar radiation is a very important task for PV system efficiency. Hence, we propose in this paper, a time series data mining method to detect the underlying dynamic presents in hourly solar radiation time series. Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is used to cluster the solar radiation time series and detect noisy data. Moreover,...

chapter

Classification of association rules based on K-means algorithm

Azzeddine Dahbi, Mohamed Mouhir, Youssef Balouki, Taoufiq Gadi

2016 4th IEEE International Colloquium on Information Science and Technology (CiSt) > 300 - 305

2016 4th IEEE International Colloquium on Information Science and Technology (CIST)

Association rule mining is one of the most relevant techniques in data mining, aiming to extract correlation among sets of items or products in transactional databases. The huge number of association rules extracted represents the main obstacle that a decision maker faces. Hence, many interestingness measures have been proposed to evaluate the association rules. However, the abundance of these measures...

chapter

Crime prediction and forecasting in Tamilnadu using clustering approaches

S. Sivaranjani, S. Sivakumari, M. Aasha

2016 International Conference on Emerging Technological Trends (ICETT) > 1 - 6

2016 International Conference on Emerging Technological Trends (ICETT)

Crime is one of the most predominant and alarming aspects in our society and its prevention is a vital task. Crime analysis is a systematic way of detecting and investigating patterns and trends in crime. In this work, we use various clustering approaches of data mining to analyse the crime data of Tamilnadu. The crime data is extracted from National Crime Records Bureau (NCRB) of India. It consists...

chapter

Disease prediction using hybrid K-means and support vector machine

Sandeep Kaur, Sheetal Kalra

2016 1st India International Conference on Information Processing (IICIP) > 1 - 6

2016 1st India International Conference on Information Processing (IICIP)

Medical data mining is one of the significant research field as medical organizations produce large volume of data on daily basis. Handling this vast amount of data in medical field is challenging, so there is a need to mine this data in order to extract useful patterns for disease prediction. A hybrid K-means and Support Vector Machine algorithm for disease prediction is proposed in this paper. The...

chapter

A novel sample weighting K-means clustering algorithm based on angles information

Lei Gu

2016 International Joint Conference on Neural Networks (IJCNN) > 3697 - 3702

2016 International Joint Conference on Neural Networks (IJCNN)

One identical weighting scheme for each sample of one cluster is often employed in the traditional sample weighting k-means clustering. However, this paper proposes a novel sample weighting k-means clustering algorithm based on angles information(SWKMA). In this presented SWKMA, firstly, samples of one cluster is divided into two types according to angles information, and secondly, different weighting...

chapter

A bi-directional sampling based on K-means method for imbalance text classification

Jia Song, Xianglin Huang, Sijun Qin, Qing Song

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) > 1 - 5

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS)

This paper studies the imbalanced data classifycation problem and proposes bi-directional sampling based on clustering (BDSK) for the imbalanced data classification. This algorithm combines SMOTE over-sampling algorithm and under-sampling algorithm based on K-Means to solve the within-class imbalance problem and the between-class imbalance problem. It not only avoid induce too much noise but also...

chapter

Recognition and anticipation of cancer and non-cancer prophecy using data mining approach

R. Kaviarasi, A. Valarmathi

2016 International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS) > 1 - 4

2016 International Conference on Emerging Trends in Engineering, Technology and Science (ICETETS)

Lung cancer is the number one cause of cancer deaths in both men and women in the worldwide. The two types of lung cancer, which grow and spread differently, are the small cell lung cancers (SCLC) and non-small cell lung cancers (NSCLC). Treatment of lung cancer can involve a combination of surgery, chemotherapy, and radiation therapy as well as newer experimental methods. The general prognosis of...

chapter

A classification method to classify high dimensional data

Amit Gupta, Naganna Chetty, Shraddha Shukla

2015 International Conference on Computing, Communication and Security (ICCCS) > 1 - 6

2015 International Conference on Computing, Communication and Security (ICCCS)

The rapid computerization and advancement in the technology has led to huge amount of data in the databases. Research has shown that the amount of data in the world doubles in every 20 months. However, this available data consists of large number of noise values and thus, cannot be directly used. The extraction of information from the vast pool of data has emerged a major challenge.

chapter

Performance evaluation of enhanced hierarchical and partitioning based clustering algorithm (EPBCA) in data mining

Gurpreet Singh, Jaskaranjit Kaur, Yusuf Mulge

2015 International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT) > 805 - 810

2015 International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT)

Clustering is a way of combining data objects or data points into disjoint cluster. The basic concept behind clustering is that the data objects in the same clusters should be related to each other and the data objects belonging to different clusters should differ from each other. This research paper proposes a new algorithm which combines the features of K-means clustering algorithm and Hierarchical...

chapter

Improvements the HANN-L2F for classification by using k-means

Jirawat Teyakome, Narissara Eiamkanitchat

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE) > 621 - 625

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE)

This paper presents the improved algorithm for the Hybrid Approach of Neural network and Level-2 Fuzzy set (HANN-L2F). The main structure is including 2 parts. The first part is Neuro-Fuzzy system, including the MLP Neural network with the combination of the level-2 Fuzzy system. The second part is using k-nearest neighbor to classify the output from Neuro-fuzzy. The HANN-L2F is an algorithm with...

chapter

Application of clustering algorithm on TV programmes preference grouping of subscribers

Haiyue Zhang, Jianping Chai, Yan Wang, Min An, more

2015 IEEE International Conference on Computer and Communications (ICCC) > 40 - 44

2015 IEEE International Conference on Computer and Communications (ICCC)

With the development of digital cable interactive business and the diversification of the customers' demand, grouping TV programmes based on preferences of users effectively is vital for market segmentation and differentiation. The study summarizes the main principle and characteristic of clustering algorithm, and uses K-Means algorithm to show TV programmes preference grouping based on 52392 subscribers...

chapter

Fetal state classification from cardiotocography based on feature extraction using hybrid K-Means and support vector machine

Nurul Chamidah, Ito Wasito

2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS) > 37 - 41

2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)

Cardiotocography (CTG) records fetal heart rate (FHR) signal and intra uterine pressure (IUP) simultaneously. CTG are widely used for diagnosing and evaluates pregnancy and fetus condition until before delivery. The high dimension of CTG data are the problem for classification computation, by extracting feature we can get the useful information from CTG data, and in this research, K-Means Algorithm...

chapter

Research and improve on K-means algorithm based on hadoop

Kehe Wu, Wenjing Zeng, Tingting Wu, Yanwen An

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 334 - 337

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

With the advent of the big data era, traditional data mining algorithm becomes incompetent for the task of massive data analysis, management and mining. The development of cloud computing brings new life to algorithm parallelization. In this paper, we have studied the K-means algorithm, one of the clustering algorithm. Then we attempt to improves this algorithm via the method that sample the large-scale...

chapter

KBB: A hybrid method for intrusion detection

Shreya Dubey, Jigyasu Dubey

2015 International Conference on Computer, Communication and Control (IC4) > 1 - 6

2015 International Conference on Computer, Communication and Control (IC4)

In this paper, we propose a hybrid method for intrusion detection which is based on k-means, naive-bayes and back propagation neural network (KBB). Initially we apply k-means which is partition-based, unsupervised cluster analysis method. In the form of clusters, we attain the gathered data which can be easily processed and learned by any machine learning algorithm. These outcomes are provided to...

chapter

Enhancement of online web recommendation system using a hybrid clustering and pattern matching approach

Hiral Y. Modi, Meera Narvekar

2015 International Conference on Nascent Technologies in the Engineering Field (ICNTE) > 1 - 6

2015 International Conference on Nascent Technologies in the Engineering Field (ICNTE)

The rise in amount of information over internet in last few years has caused the growing risk of information flooding which in turn has created the problem of accessing relevant data to the users. Also with the hike in number of websites and web pages, webmasters find it challenging to formulate the content in accordance with the user's need. The information demand of the online users can be figured...

Keywords:
DATA MINING
CLASSIFICATION ALGORITHMS
K-MEANS

Publication date

Set your own date range

Content availability

Available (61)
None (1)

Keywords

CLUSTERING ALGORITHMS (56)
ALGORITHM DESIGN AND ANALYSIS (33)
CLUSTERING (21)
PARTITIONING ALGORITHMS (20)
PATTERN CLUSTERING (20)
CLASSIFICATION (9)
DATABASES (9)
ACCURACY (7)
INTRUSION DETECTION (7)
DATA MODELS (6)
PATTERN CLASSIFICATION (5)
TRAINING (5)
CLUSTERING METHODS (4)
COMPLEXITY THEORY (4)
COMPUTERS (4)
DECISION TREES (4)
FEATURE EXTRACTION (4)
FUZZY C-MEANS (4)
INDEXES (4)
INTERNET (4)
K-MEANS ALGORITHM (4)
PREDICTION ALGORITHMS (4)
SECURITY OF DATA (4)
SUPPORT VECTOR MACHINES (4)
ARTIFICIAL NEURAL NETWORKS (3)
BIOINFORMATICS (3)
CLUSTERING TECHNIQUES (3)
COMPUTER SCIENCE (3)
DATA CLUSTERING (3)
FEATURE SELECTION (3)
FUZZY SET THEORY (3)
GENETIC ALGORITHMS (3)
HEURISTIC ALGORITHMS (3)
IRIS (3)
K-MEANS CLUSTERING ALGORITHM (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
MACHINE LEARNING (3)
MOBILE COMMUNICATION (3)
MOBILE HANDSETS (3)
UNSUPERVISED LEARNING (3)
ANOMALY DETECTION (2)
BREAST CANCER (2)
CLUSTER ANALYSIS (2)
COMPUTATIONAL COMPLEXITY (2)
CONTEXT (2)
CUSTOMER RELATIONSHIP MANAGEMENT (2)
DATA HANDLING (2)
DATABASE MANAGEMENT SYSTEMS (2)
DBSCAN (2)
DIMENSION REDUCTION (2)
DISEASES (2)
DOCUMENT CLUSTERING (2)
EDUCATIONAL INSTITUTIONS (2)
GENE EXPRESSION (2)
GENETIC ALGORITHM (2)
GENETICS (2)
INFORMATION TECHNOLOGY (2)
INTRUSION DETECTION SYSTEM (2)
ITEMSETS (2)
MACHINE LEARNING ALGORITHMS (2)
MOBILE COMPUTING (2)
OPTIMIZATION (2)
PREPROCESSING (2)
PROBABILITY DENSITY FUNCTION (2)
SEARCH ENGINES (2)
SIGNAL PROCESSING ALGORITHMS (2)
SPATIAL DATABASES (2)
STABILITY ANALYSIS (2)
TEXT ANALYSIS (2)
UNSUPERVISED FEATURE SELECTION (2)
WAVELET TRANSFORMS (2)
WEKA TOOL (2)
2D WAVELET TRANSFORM (1)
ABSTRACTING (1)
AGGLOMERATIVE (1)
ALGORITHM FLOW (1)
ALGORITHM OUTPUT GRANULARITY (1)
ANGLES INFORMATION (1)
ANOMALY DETECTION MODEL (1)
ANOMALY INTRUSION DETECTION MODEL (1)
ANT CLUSTERING ALGORITHM (1)
ANT CLUSTERING ALGORITHM OPTIMIZATION (1)
ANT COLONY OPTIMIZATION (ACO) (1)
APRIORI (1)
ARRAYS (1)
ARTIFICIAL BEE COLONY (ABC) (1)
ARTIFICIAL INTELLIGENCE (1)
ASSOCIATION RULES MINING (1)
AUTHENTICATION (1)
AUTOMATIC IDENTIFICATION (1)
AUTOMATION (1)
AUTOMOBILES (1)
AUTONOMOUS DATA SOURCE (1)
BACK PROPAGATION NEURAL NETWORK (1)
BACKGROUND VOCABULARY SUPPORT (1)
BIDIRECTIONAL CONTROL (1)
BIG DATA (1)
more

INFONA - science communication portal

Search results

An effective method determining the initial cluster centers for K-means for clustering gene expression data

A Data Science and Engineering Solution for Fast K-Means Clustering of Big Data

An analysis of random forest algorithm based network intrusion detection system

Expert system for retrieval of documents using evolutionary approaches incorporating clustering

Comparison of applications for educational data mining in Engineering Education

Density-based spatial clustering of application with noise algorithm for the classification of solar radiation time series

Classification of association rules based on K-means algorithm

Crime prediction and forecasting in Tamilnadu using clustering approaches

Disease prediction using hybrid K-means and support vector machine

A novel sample weighting K-means clustering algorithm based on angles information

A bi-directional sampling based on K-means method for imbalance text classification

Recognition and anticipation of cancer and non-cancer prophecy using data mining approach

A classification method to classify high dimensional data

Performance evaluation of enhanced hierarchical and partitioning based clustering algorithm (EPBCA) in data mining

Improvements the HANN-L2F for classification by using k-means

Application of clustering algorithm on TV programmes preference grouping of subscribers

Fetal state classification from cardiotocography based on feature extraction using hybrid K-Means and support vector machine

Research and improve on K-means algorithm based on hadoop

KBB: A hybrid method for intrusion detection

Enhancement of online web recommendation system using a hybrid clustering and pattern matching approach

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options