Search results

Items from 1 to 20 out of 33 results

chapter

Cloud-Dew computing support for automatic data analysis in life sciences

Peter Brezany, Thomas Ludescher, Thomas Feilhauer

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 365 - 370

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

In this paper we show how the technologies associated with the evolution of Cloud computing to Dew computing can contribute to the advancing scientific computational productivity through automation. In the current big data paradigm developments, there is growing trend towards automation of data mining and other analytical processes involved in data science to increase productivity of associated applications...

chapter

Dynamic data clustering and visualization using FDClust algorithm

Khushboo Bansal, Meenakshi Bansal

2017 International Conference on Computer Communication and Informatics (ICCCI) > 1 - 5

2017 International Conference on Computer Communication and Informatics (ICCCI)

Dynamic data if used properly can bring huge benefits to the humanity, science and business. The various properties of dynamic data like volume, velocity, variety, variation and veracity render the current methods of data analysis ineffective. Dynamic data analysis needs fusion of methods for the data mining with those of machine learning. The k-means algorithm is one such algorithm that has existed...

chapter

A graph based Feature Selection algorithm utilizing attribute intercorrelation

Arinjoy Basak, Asit Kr. Das

2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 1 - 9

2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

Recently, every enterprise generates large volumes of high dimensional data on a regular basis. Complex data mining and analysis techniques are used to feasibly analyse this data. Feature selection aids in this by providing a reduced representation of this data while maintaining integrity. We propose a graph-based feature selection algorithm utilizing feature intercorrelation to construct a weighted...

chapter

A dynamic data correction algorithm based on polynomial smooth support vector machine

Dong-Mei Pu, Da-Qi Gao, Yu-Bo Yuan

2016 International Conference on Machine Learning and Cybernetics (ICMLC) > 2 > 820 - 824

2016 International Conference on Machine Learning and Cybernetics (ICMLC)

Data quality plays an important role in modern intelligent information system and is crucial to any data analysis task. Many imperfection-handling techniques avoid overfitting or simply remove offending portions of the data. Data correction can help to retain and recover as much information as possible from the original data resources. In this paper, we proposed a novel technique based on polynomial...

chapter

Data mining for better healthcare: A path towards automated data analysis?

Tania Cerquitelli, Elena Baralis, Lia Morra, Silvia Chiusano

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 60 - 63

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

In today's world, large volumes of medical data are being continuously generated, but their value is severely undermined by our inability to translate them into knowledge and, ultimately, actions. Data mining techniques allow the extraction of previously unknown interesting patterns from large datasets, but their complexity limits their practical diffusion. Data-driven analysis is a multi-step process,...

chapter

Distributed clustering algorithm for spatial data mining

Malika Bendechache, M-Tahar Kechadi

2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM) > 60 - 65

2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM)

Distributed data mining techniques and mainly distributed clustering are widely used in the last decade because they deal with very large and heterogeneous datasets which cannot be gathered centrally. Current distributed clustering approaches are normally generating global models by aggregating local results that are obtained on each site. While this approach mines the datasets on their locations...

chapter

Landscape characterization of numerical optimization problems using biased scattered data

Mario A. Munoz, Michael Kirley, Saman K. Halgamuge

2012 IEEE Congress on Evolutionary Computation > 1 - 8

2012 IEEE Congress on Evolutionary Computation (CEC)

The characterization of optimization problems over continuous parameter spaces plays an important role in optimization. A form of “fitness landscape” analysis is often carried out to describe the problem space in terms of modality, smoothness and variable separability. The outcomes of this analysis can then be used as a measure of problem difficulty and to predict the behaviour of a given algorithm...

chapter

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

K A A Nazeer, S D M Kumar, M P Sebastian

2011 Second International Conference on Emerging Applications of Information Technology > 261 - 264

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

With the advent of modern techniques for scientific data collection, large quantities of data are getting accumulated at various databases. Systematic data analysis methods are necessary to extract useful information from rapidly growing data banks. Cluster analysis is one of the major data mining methods and the k-means clustering algorithm is widely used for many practical applications. But the...

chapter

Measuring Similarity for Multidimensional Sequences

Hui Wang, Zhiwei Lin, S McClean, Jun Liu

2010 IEEE International Conference on Data Mining Workshops > 281 - 287

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

Multidimensional sequences are common, and measuring their similarity is a key to any analysis of such data. There is a wealth of similarity measures for sequences in the literature, but most of them are designed for a special type of sequence and later extended to more general types. These extensions are usually ad hoc, and the extended versions may lose the original conceptual interpretation of...

chapter

Multi-Density Clustering Algorithm Based on Grid Adjacency Relation

Guang-xing Li, Yan Yang

2010 Chinese Conference on Pattern Recognition (CCPR) > 1 - 5

2010 Chinese Conference on Pattern Recognition (CCPR 2010)

The paper presents a multi-density clustering algorithm based on grid adjacency relation (GAMD) using data distribution characteristics within units, which is reflected by the unit density and the center of mass. In order to determine the unit boundary, the algorithm measures the similarity between units by the relative density of units and relative distance of center of mass. Goodness of fit is proposed...

chapter

The Close-Degree of Concept Lattice and Attribute Reduction Algorithm Based on It

Huili Meng, Jiucheng Xu

2010 IEEE International Conference on Granular Computing > 732 - 734

2010 IEEE International Conference on Granular Computing (GrC-2010)

Concept lattice is a new mathematical tool for data analysis and knowledge processing. Attribute reduction is very important in the theory of concept lattice because it can make the discovery of implicit knowledge in data easier and the representation simpler. In this paper the reduction of the concept lattice was investigated. First, we present a close-degree of concept to measure the close-degree...

chapter

Detecting Deviants over Data Streams

Wei Xia, Zhang Wei

2010 International Conference on Internet Technology and Applications > 1 - 3

2010 International Conference on Internet Technology and Applications (iTAP 2010)

Identifying outliers is a difficult thing in data mining. We adopt the notion of deviants for outliers in data streams. Deviants are data set whose removal from the data sequence over data streams lead to sum of error SSE minimize. We present DDA algorithm to detect deviants over massive data streams. With this algorithm the histogram can more accurately determine the deviants and greatly reduce error.

chapter

Scalable evolutionary clustering algorithm with Self Adaptive Genetic Operators

Elizabeth León, Olfa Nasraoui, Jonatan Gómez

IEEE Congress on Evolutionary Computation > 1 - 8

2010 IEEE Congress on Evolutionary Computation

In this paper, we present a scalable evolutionary algorithm for clustering large and dynamic data sets, called Scalable Evolutionary Clustering with Self Adaptive Genetic Operators (Scalable ECSAGO). The proposed evolutionary clustering algorithm can adapt its genetic operators rate while the evolution leads to the optimal centers of the clusters. The sizes of the clusters are estimated using a hybrid...

chapter

Service Recommendation: Similarity-Based Representative Skyline

Liang Chen, Jian Wu, Shuiguang Deng, Ying Li

2010 6th World Congress on Services > 360 - 366

2010 IEEE Congress on Services (SERVICES-1)

Skyline attracts more and more attention from academic circle and industrial circle because of its application in multi-criterion decision support, preference answering and data analysis. However, it seems unnecessary to recommend all services in skyline while the number of skyline points is large. The number of services in skyline is always large for the reason that comparability decreases with the...

chapter

Application of PageRank Algorithm in Computer Forensics

Deguang Wang, Zhigang Zhou, Haibo Ma

2010 Second International Conference on Information Technology and Computer Science > 250 - 253

2010 2nd International Conference on Information Technology and Computer Science (ITCS 2010)

Computer forensics is the traces collection and processing, which is the offender to remain in the computer or network system, and as a legally binding evidence in the proceedings available to the court, so that suspects would be brought to justice. It mainly includes data protection, data collection, data analysis, the evidence presented in such processes. the data analysis is the key to computer...

chapter

Applying gene ontology to microarray gene expression data analysis

A C Yang, Hui-Huang Hsu, Ming-Da Lu

2010 International Conference on System Science and Engineering > 421 - 426

2010 International Conference on System Science and Engineering (ICSSE 2010)

Selecting informative genes from microarray gene expression data is the most important task while performing data analysis on the large amount of data. Mining genes having regulatory relations within thousands of genes is essential. To fit this need, a number of methods were proposed from various points of view. However, most existing methods solely focus on gene expression values themselves without...

chapter

Parallel Decision Tree Algorithm Based on Combination

Li Wenlong, Xing Changzheng

2010 International Forum on Information Technology and Applications > 1 > 99 - 101

2010 International Forum on Information Technology and Applications (IFITA 2010)

Early ID3, C4.5, CART and the other decision tree algorithms are no longer met the situation of massive data analysis for the time being. Those algorithms has the same limitations that they can not handle the updated data sets dynamically and the decision tree generated by these algorithms need to be purned. These weaknesses limit the use of the above-mentioned algorithms. So a novel parallel decision...

chapter

IK-BKM: An incremental clustering approach based on intra-cluster distance

S Ben Hariz, Z Elouedi

ACS/IEEE International Conference on Computer Systems and Applications - AICCSA 2010 > 1 - 8

2010 IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2010)

This paper introduces a novel incremental approach to clustering uncertain categorical data. This so-called Incremental K Belief K-modes Method (IK-BKM) extends the Belief K-modes one to update the cluster partition when new information is available namely the increase of final desired clusters' number. The main objective is to update clusters' partition without complete reclustring. Our method will...

chapter

A Novel Rules Extraction Method Based on Clustering Analysis

Zhi-hang Tang, Hui-ying Peng

2010 2nd International Workshop on Intelligent Systems and Applications > 1 - 4

2010 2nd International Workshop on Intelligent Systems and Applications (ISA)

Clustering is a method of unsupervised learning, and a common technique for statistical data analysis used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformatics a novel algorithm based on clustering to extract rules from neural networks is proposed. After neural networks have been trained and pruned successfully, inner-rules are generated by...

article

Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

P. Pehkonen, G. Wong, P. Toronen

IEEE/ACM Transactions on Computational Biology and Bioinformatics > 2010 > 7 > 1 > 37 - 49

Segmentation aims to separate homogeneous areas from the sequential data, and plays a central role in data mining. It has applications ranging from finance to molecular biology, where bioinformatics tasks such as genome data analysis are active application fields. In this paper, we present a novel application of segmentation in locating genomic regions with coexpressed genes. We aim at automated discovery...

Keywords:
HEURISTIC ALGORITHMS
DATA ANALYSIS

Publication date

Set your own date range

Publication type

book (30)
article (3)

Keywords

ALGORITHM DESIGN AND ANALYSIS (19)
CLUSTERING ALGORITHMS (12)
PATTERN CLUSTERING (9)
GENETICS (6)
ARRAYS (4)
GENE EXPRESSION (4)
ACCURACY (3)
APPROXIMATION ALGORITHMS (3)
ASSOCIATION RULES (3)
BIOINFORMATICS (3)
BIOLOGY COMPUTING (3)
CLASSIFICATION ALGORITHMS (3)
CLUSTERING (3)
CONTEXT (3)
DATA STRUCTURES (3)
DATABASES (3)
MACHINE LEARNING ALGORITHMS (3)
PARTITIONING ALGORITHMS (3)
PATTERN RECOGNITION (3)
STATISTICAL ANALYSIS (3)
ARTIFICIAL NEURAL NETWORKS (2)
BISMUTH (2)
COMPUTATIONAL MODELING (2)
COMPUTERS (2)
CONCEPT LATTICE (2)
DATABASE MANAGEMENT SYSTEMS (2)
DYNAMIC TIME WARPING (2)
FEATURE SELECTION (2)
GENOMICS (2)
GRAPH THEORY (2)
HEURISTIC PROGRAMMING (2)
HISTOGRAMS (2)
ITEMSETS (2)
KNOWLEDGE DISCOVERY (2)
KNOWLEDGE PROCESSING (2)
LATTICES (2)
MACHINE LEARNING (2)
MEDICAL COMPUTING (2)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (2)
PATTERN CLASSIFICATION (2)
SEQUENCES (2)
SHAPE (2)
SIMILARITY (2)
SPATIAL DATABASES (2)
TIME SERIES ANALYSIS (2)
TRAINING (2)
ACADEMIC CIRCLE (1)
ADAPTATION MODEL (1)
ADAPTIVE ALGORITHM (1)
ADAPTIVE FUZZY CLASSIFICATION SYSTEM (1)
AERODYNAMICS (1)
ALL COMMON SUBSEQUENCES (1)
ANALYTICAL MODELS (1)
AND ASSOCIATION RULES (1)
APPROXIMATION ERROR (1)
ARTIFICIAL INTELLIGENCE (1)
ASSOCATION RULES (1)
ASSOCIATION RULE (1)
ASSOCIATION RULE MINING ALGORITHMS (1)
ATTRIBUTE REDUCTION (1)
ATTRIBUTE REDUCTION ALGORITHM (1)
AUTOMATIC SCIENTIFIC DATA ANALYSIS (1)
AUTOMATION (1)
BAYES METHODS (1)
BAYESIAN METHODS (1)
BELIEF FUNCTION THEORY (1)
BELIEF MAINTENANCE (1)
BICLUSTERING APPROACH (1)
BIG DATA (1)
BIOCHEMISTRY (1)
BIOINFORMATICS (GENOME OR PROTEIN) DATABASES (1)
BIOLOGICAL CELLS (1)
BIOLOGICAL INFORMATION (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOLOGY AND GENETICS (1)
BIOMEDICAL MEASUREMENT (1)
BIOMEDICAL RESEARCH (1)
BRAIN DAMAGE RESTORATION (1)
BREATH GAS ANALYSIS (1)
BUFFER OVERFLOW (1)
C4.5 (1)
CANCER (1)
CART (1)
CDNA GENE MICROARRAY EXPRESSION DATA (1)
CLASS RATIO (1)
CLASS ROTATION (1)
CLASSIFICATION (1)
CLASSIFICATION PROBLEM (1)
CLOSE-DEGREE (1)
CLOUD COMPUTING (1)
CLOUD-DEW COMPUTING (1)
CLUSTER ANALYSIS (1)
CLUSTER PARTITION UPDATE (1)
CLUSTERING ANALYSIS (1)
CLUSTERING METHODS (1)
CLUSTERING VALIDITY EVALUATION (1)
CLUSTERS' NUMBER (1)
more

INFONA - science communication portal

Search results

Cloud-Dew computing support for automatic data analysis in life sciences

Dynamic data clustering and visualization using FDClust algorithm

A graph based Feature Selection algorithm utilizing attribute intercorrelation

A dynamic data correction algorithm based on polynomial smooth support vector machine

Data mining for better healthcare: A path towards automated data analysis?

Distributed clustering algorithm for spatial data mining

Landscape characterization of numerical optimization problems using biased scattered data

Enhancing the K-means Clustering Algorithm by Using a O(n logn) Heuristic Method for Finding Better Initial Centroids

Measuring Similarity for Multidimensional Sequences

Multi-Density Clustering Algorithm Based on Grid Adjacency Relation

The Close-Degree of Concept Lattice and Attribute Reduction Algorithm Based on It

Detecting Deviants over Data Streams

Scalable evolutionary clustering algorithm with Self Adaptive Genetic Operators

Service Recommendation: Similarity-Based Representative Skyline

Application of PageRank Algorithm in Computer Forensics

Applying gene ontology to microarray gene expression data analysis

Parallel Decision Tree Algorithm Based on Combination

IK-BKM: An incremental clustering approach based on intra-cluster distance

A Novel Rules Extraction Method Based on Clustering Analysis

Heuristic Bayesian Segmentation for Discovery of Coexpressed Genes within Genomic Regions

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options