Search results

chapter

Cut Tree Construction from Massive Graphs

Takuya Akiba, Yoichi Iwata, Yosuke Sameshima, Naoto Mizuno, more

2016 IEEE 16th International Conference on Data Mining (ICDM) > 775 - 780

2016 IEEE 16th International Conference on Data Mining (ICDM)

The construction of cut trees (also known as Gomory-Hu trees) for a given graph enables the minimum-cut size of the original graph to be obtained for any pair of vertices. Cut trees are a powerful back-end for graph management and mining, as they support various procedures related to the minimum cut, maximum flow, and connectivity. However, the crucial drawback with cut trees is the computational...

chapter

Efficient Mining Algorithm of Frequent Itemsets for Uncertain Data Streams

Wang Qianqian, Liu Fang-Ai

2016 9th International Symposium on Computational Intelligence and Design (ISCID) > 2 > 443 - 446

2016 9th International Symposium on Computational Intelligence and Design (ISCID)

With the rapid development of computer technology, web services has been widely used. In these applications, the uncertain data is in the form of streams. In view of this kind of situation, present a new generalized data structure, that is, PSUF - tree, to store uncertain data streams, all itemsets in recent window are contained in global PStree in a condensed format, establish a header table in which...

chapter

Fastbit-radix sort: Optimized version of radix sort

Anthony Vinay Kumar S, Arti Arya

2016 11th International Conference on Computer Engineering & Systems (ICCES) > 305 - 312

2016 11th International Conference on Computer Engineering & Systems (ICCES)

Sorting is applied in daily life from ordering simple lists to real world applications. Sorting presents the data in an ordered fashion which helps in analysis or allows computing data faster. Radix sort is a non-comparative integer sorting algorithm that sorts in a linear time complexity. Radix sort performs modulus operation on each data to extract the digits at a specific position and maintain...

chapter

Top-k utility-based gene regulation sequential pattern discovery

Morteza Zihayat, Heidar Davoudi, Aijun An

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 266 - 273

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Sequential pattern mining has been used in bioinformatics to discover frequent gene regulation sequential patterns based on time course microarray datasets. While mining frequent sequences are important in biological studies for disease treatment, to date, most of the approaches do not consider the importance of the genes with respect to a disease being studied when identifying gene regulation sequential...

chapter

A graph based Feature Selection algorithm utilizing attribute intercorrelation

Arinjoy Basak, Asit Kr. Das

2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 1 - 9

2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

Recently, every enterprise generates large volumes of high dimensional data on a regular basis. Complex data mining and analysis techniques are used to feasibly analyse this data. Feature selection aids in this by providing a reduced representation of this data while maintaining integrity. We propose a graph-based feature selection algorithm utilizing feature intercorrelation to construct a weighted...

chapter

Frequent itemsets mining using random walks for record insertion and deletion

Panita Thusaranon, Worapoj Kreesuradej

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE) > 1 - 6

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE)

In Association rules mining, the task of finding frequent itemsets in dynamic database is very important because the updates may not only invalidate some existing rules but also make other rules relevant. In this paper, we propose a new algorithm to maintain frequent itemsets of a dynamic database in the case of record insertion as well as deletion simultaneously. Basically, the proposed algorithm...

chapter

An Empirical Study on the Characteristics of Python Fine-Grained Source Code Change Types

Wei Lin, Zhifei Chen, Wanwangying Ma, Lin Chen, more

2016 IEEE International Conference on Software Maintenance and Evolution (ICSME) > 188 - 199

2016 IEEE International Conference on Software Maintenance and Evolution (ICSME)

Software has been changing during its whole life cycle. Therefore, identification of source code changes becomes a key issue in software evolution analysis. However, few current change analysis research focus on dynamic language software. In this paper, we pay attention to the fine-grained source code changes of Python software. We implement an automatic tool named PyCT to extract 77 kinds of fine-grained...

article

Online Learning from Trapezoidal Data Streams

Qin Zhang, Peng Zhang, Guodong Long, Wei Ding, more

IEEE Transactions on Knowledge and Data Engineering > 2016 > 28 > 10 > 2709 - 2723

In this paper, we study a new problem of continuous learning from doubly-streaming data where both data volume and feature space increase over time. We refer to the doubly-streaming data as trapezoidal data streams and the corresponding learning problem as online learning from trapezoidal data streams. The problem is challenging because both data volume and data dimension increase over time, and existing...

chapter

PPRA: A new pre-fetching and prediction based replication algorithm in data grid

Mahsa Beigrezaei, Abolfazle Toroghi Haghighat, Mohamd Reza Meybodi, Maryam Runiassy

2016 6th International Conference on Computer and Knowledge Engineering (ICCKE) > 257 - 262

2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)

Today, scientific and business applications generate huge amounts of data. Users of data grid, who are distributed all over the grid geographically, need such data. So ensuring the access to this distributed data efficiently is one of the most important challenges in Data grid network. Data replication algorithms are known as the most common method used to overcome this problem. They distribute several...

chapter

Combining Static and Dynamic Features for Multivariate Sequence Classification

Anna Leontjeva, Ilya Kuzovkin

2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA) > 21 - 30

2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

Model precision in a classification task is highly dependent on the feature space that is used to train the model. Moreover, whether the features are sequential or static will dictate which classification method can be applied as most of the machine learning algorithms are designed to deal with either one or another type of data. In real-life scenarios, however, it is often the case that both static...

chapter

Keyframe Extraction from Motion Capture Data for Visualization

Yang Yang, Lanling Zeng, Howard Leung

2016 International Conference on Virtual Reality and Visualization (ICVRV) > 154 - 157

2016 International Conference on Virtual Reality and Visualization (ICVRV)

In this paper, we propose a novel method to extract keyframes from motion capture data for people to better visualize and understand the content of the motion. It first applies a Butterworth filter to remove the noise in the motion capture data, then carries out principal component analysis (PCA) to reduce the dimension. By detecting the zero-crossing points of the velocity in the principal components,...

chapter

Design and implementation of ACO feature selection algorithm for data stream mining

Shivani Harde, Vaishali Sahare

2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT) > 1047 - 1051

2016 International Conference on Automatic Control and Dynamic Optimization Techniques (ICACDOT)

Big data confront many technical challenges that also confront by both academic research communities and commercial IT deployment. Data streams with the curse of dimensionality are founded to be the root sources of Big Data. The commonly used procedure for data sourced from data streams is continuously making batch based model and inducing algorithms which is infeasible for real-time data mining....

chapter

Updating P-dominated and P-dominating sets based on a classification tree

Zhiyong Hong

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 950 - 954

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

Dynamic updating knowledge is a hot study issue in data mining. This paper proposed a method for updating P-dominated and P-dominating sets of Dominance-based Rough Sets Approach (DRSA). Some examples are employed to validate our approach. These examples showed that our approach can simplify computation by avoiding unnecessary computing steps.

chapter

An incremental algorithm for rapidly computing tolerance class of incomplete information system

Mianwei Ding, Tengfei Zhang, Fumin Ma, Dong Yue

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 1296 - 1300

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

The tolerance class is a basic concept in rough set for incomplete information systems. The effective computation of tolerance class is vital for improving the performance of knowledge reduction and other related tasks. For the purpose of speeding up the tolerance class calculation, an improved static algorithm is developed firstly, followed by a novel incremental algorithm, which can update rapidly...

chapter

Adaptive robust models for identification of nonstationary systems in data stream mining tasks

Yevgeniy Bodyanskiy, Olena Vynokurova, Zdzislaw Szymanski, Ilya Kobylin, more

2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP) > 263 - 268

2016 IEEE First International Conference on Data Stream Mining & Processing (DSMP)

In the paper the adaptive robust models for adaptive identification of nonstationary systems are proposed. These proposed models can be used for solving Dynamical Data and Data Stream Mining tasks. These adaptive robust models are characterized by computational simplicity and high speed operation that allow the signal processing in on-line mode.

chapter

Aging data in dynamic graphs: A comparative study

Anita Zakrzewska, David A Bader

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 1055 - 1062

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

Dynamic graphs are used to represent changing relational data. In order to create a dynamic graph representing relationships or interactions over time, it is necessary to choose a method of adding new data and removing, or otherwise de-emphasizing, past data to decrease its influence. In particular, the question of aging edges is new to dynamic graphs and has not been thoroughly studied. In this work,...

chapter

Association Rules Discovery via Approximate Method from Probabilistic Database

Lihai Nie, Zhiyang Li, Wenyu Qu

2016 IEEE Trustcom/BigDataSE/ISPA > 909 - 914

2016 IEEE Trustcom/BigDataSE/ISPA

Association rules and frequent patterns discovery is always a hot topic in database communities. As real data is often affected by noise, in this paper, we study to find frequent patterns and generate association rules over probabilistic database under the Possible World Semantics. This is technically challenging, since a probabilistic database can have an exponential number of possible worlds. Although...

chapter

Comparative analysis of association rule mining algorithms

S. Vijayarani, S. Sharmila

2016 International Conference on Inventive Computation Technologies (ICICT) > 3 > 1 - 6

2016 International Conference on Inventive Computation Technologies (ICICT)

Data mining is one of the significant research domains in the field of computer science and it is defined as the extraction of hidden knowledge from the large data repositories. Important data mining techniques are classification, clustering, association rule generation, summarization, time series analysis and etc. Association rule is used to determine frequent patterns, association and correlations...

chapter

Malware classification method based on sequence of traffic flow

Hyoyoung Lim, Yukiko Yamaguchi, Hajime Shimada, Hiroki Takakura

2015 International Conference on Information Systems Security and Privacy (ICISSP) > 1 - 8

2015 International Conference on Information Systems Security and Privacy (ICISSP)

Network-based malware classification plays an important role in improving system security than system-based malware classification. The vast majority of malware needs a network activity in order to accomplish its purpose (e.g., downloading malware, connecting to a C&C server, etc.). Many malware classification approaches based on network behavior have thus been proposed. Nevertheless, they merely...

chapter

Mining the big data of residential appliances in the smart grid environment

Jiajia Yang, Junhua Zhao, Fushuan Wen, Weicong Kong, more

2016 IEEE Power and Energy Society General Meeting (PESGM) > 1 - 5

2016 IEEE Power and Energy Society General Meeting (PESGM)

Based on the dynamic time warping (DTW) matching method, a novel appliance identification algorithm for low frequency sampling load data is proposed. First, residential load sequences are segmented into subsequences composed of single appliance load profiles and multi-appliance load profiles. Then, reference load sequences of all candidate appliances, which have identical lengths, are generated before...

INFONA - science communication portal

Search results

Cut Tree Construction from Massive Graphs

Efficient Mining Algorithm of Frequent Itemsets for Uncertain Data Streams

Fastbit-radix sort: Optimized version of radix sort

Top-k utility-based gene regulation sequential pattern discovery

A graph based Feature Selection algorithm utilizing attribute intercorrelation

Frequent itemsets mining using random walks for record insertion and deletion

An Empirical Study on the Characteristics of Python Fine-Grained Source Code Change Types

Online Learning from Trapezoidal Data Streams

PPRA: A new pre-fetching and prediction based replication algorithm in data grid

Combining Static and Dynamic Features for Multivariate Sequence Classification

Keyframe Extraction from Motion Capture Data for Visualization

Design and implementation of ACO feature selection algorithm for data stream mining

Updating P-dominated and P-dominating sets based on a classification tree

An incremental algorithm for rapidly computing tolerance class of incomplete information system

Adaptive robust models for identification of nonstationary systems in data stream mining tasks

Aging data in dynamic graphs: A comparative study

Association Rules Discovery via Approximate Method from Probabilistic Database

Comparative analysis of association rule mining algorithms

Malware classification method based on sequence of traffic flow

Mining the big data of residential appliances in the smart grid environment

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options