Search results

chapter

Parallel architecture for implementation of frequent itemset mining using FP-growth

Amna Tehreem, Sajid Gul Khawaja, Muhammad Usman Akram, Shoab A. Khan, more

2017 International Conference on Signals and Systems (ICSigSys) > 92 - 98

2017 International Conference on Signals and Systems (ICSigSys)

Frequent itemset mining is a fundamental step in analysis of big data where correlation among the raw data in deemed necessary. In modern era the amount of data available for processing has grown exponentially, making it a stepper task for mining algorithms to provide solution in a timely manner. The software implementations are normally not efficient in handling such datasets thus focus on parallel...

chapter

A frequent pattern mining based shape defect diagnosis method for cold rolled strip products

Xinhang Li, Ningyun Lu, Bin Jiang, Huiping Zhao

2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP) > 90 - 94

2017 6th International Symposium on Advanced Control of Industrial Processes (AdCONIP)

Flatness is one of the most important specifications for strip products in cold rolling processes. Shape control of cold rolled product is often characterized as a complex process with multiple operation conditions, multi-variables, time-varying parameters, strong coupling and nonlinearity. Accurate online shape defect diagnosis is still a difficult task. This paper proposed a frequent pattern mining...

chapter

Accelerating Dynamic Itemset Counting on Intel many-core systems

Mikhail Zymbler

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 1343 - 1348

2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

The paper presents a parallel implementation of a Dynamic Itemset Counting (DIC) algorithm for many-core systems, where DIC is a variation of the classical Apriori algorithm.We propose a bit-based internal layout for transactions and itemsets with the assumption that such a representation of the transaction database fits in main memory. This technique reduces the memory space for storing the transaction...

chapter

Mining Change Histories for Unknown Systematic Edits

Tim Molderez, Reinout Stevens, Coen De Roover

2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR) > 248 - 256

2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR)

Software developers often need to repeat similar modifications in multiple different locations of a system's source code. These repeated similar modifications, or systematic edits, can be both tedious and error-prone to perform manually. While there are tools that can be used to assist in automating systematic edits, it is not straightforward to find out where the occurrences of a systematic edit...

chapter

A comparative analysis of frequent pattern mining algorithms used for streaming data

Shalini, Sanjay Kumar Jain

2017 International Conference on Computing, Communication and Automation (ICCCA) > 250 - 255

2017 International Conference on Computing, Communication and Automation (ICCCA)

Frequent pattern mining across streaming data i s a challenging task. It require real time response and incurs great computational complexity. In this paper, we discuss challenges of developing frequent pattern mining algorithms for streaming data, compare three algorithms proposed in literature and explore scope of improvement in the algorithms. We discuss the suitability of these algorithms according...

chapter

Mining high utility itemsets using TKO and TKU to find top-k high utility web access patterns

Sharda Khode, Sudhir Mohod

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) > 1 > 504 - 509

2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)

Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. Although a number of relevant approaches have been proposed in recent years, but they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets degrades the mining performance in terms of execution...

chapter

Balanced Parallel Frequent Pattern Mining over Massive Data Stream

Xi Fu, Lei Shi, Jing Li

2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService) > 50 - 59

2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService)

Frequent pattern mining is playing an increasingly important role in a growing number of real-time data flow scenarios, such as large-scale order stream data, network traffic monitoring, web accessing record stream, and so on. The continuous, unbounded and high speed characteristics of massive data stream are a huge challenge for the current frequent pattern mining approach. The main challenge is...

chapter

Cleaning Data with Forbidden Itemsets

Joeri Rammelaere, Floris Geerts, Bart Goethals

2017 IEEE 33rd International Conference on Data Engineering (ICDE) > 897 - 908

2017 IEEE 33rd International Conference on Data Engineering (ICDE)

Methods for cleaning dirty data typically rely on additional information about the data, such as user-specified constraints that specify when a database is dirty. These constraints often involve domain restrictions and illegal value combinations. Traditionally, a database is considered clean if all constraints are satisfied. However, many real-world scenario's only have a dirty database available...

chapter

SolMiner: Mining Distinct Solutions in Programs

Lannan Luo, Qiang Zeng

2016 IEEE/ACM 38th International Conference on Software Engineering Companion (ICSE-C) > 481 - 490

2016 IEEE/ACM 38th International Conference on Software Engineering Companion (ICSE-C)

Given a programming problem, because of a variety of data structures and algorithms that can be applied and different tradeoffs, such as space-time, to be considered, there may be many distinct solutions. By comparing his/her solution against others’ and learning from the distinct solutions, a learner may quickly improve programming skills and gain experience in making trade-offs. Meanwhile, on the...

chapter

Hp-Apriori: Horizontal parallel-apriori algorithm for frequent itemset mining from big data

Mohammad-Hossein Nadimi-Shahraki, Mehdi Mansouri

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 286 - 290

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Due to large scale and complexity of big data, mining the big data using a single personal computer is a difficult problem. With increasing in the size of databases, parallel computing systems can cause considerable advantages in the data mining applications by means of the exploitation of data mining algorithms. Parallelization of association rule mining algorithms is an important task in data mining...

chapter

Communication protocol identification based on data mining and automatic reasoning

Le Cai, Rong Shi, Du Xu

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 211 - 216

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

In recent years, big data has become an important resource of the information society, and the focus of research in all walks of life is to extract the effective information from big data with the greatest possibility. On the other hand, with the increase of the complexity of network data, the problem of cyber security becomes more and more serious. Protocol identification technology is an effective...

chapter

An incremental algorithm for frequent itemset mining on spark

Min Yu, Chuang Zuo, Yunpeng Yuan, Yulu Yang

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 276 - 280

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

Frequent Itemset Mining is one of the most investigated fields of data mining. It is expensive to mine frequent itemsets for a large scale data set. Especially when some data is added into the data set, it is still time-consuming from the scratch to re-compute the complete data set to update the frequent itemsets of the data set. Aiming to improve the performance of frequent itemset mining for large...

chapter

DFPS: Distributed FP-growth algorithm based on Spark

Xiujin Shi, Shaozong Chen, Hui Yang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1725 - 1731

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Frequent Itemset Mining (FIM) is the most important and time-consuming step of association rules mining. With the increment of data scale, many efficient single-machine algorithms of FIM, such as FP-growth and Apriori, cannot accomplish the computing tasks within reasonable time. As a result of the limitation of single-machine methods, researchers presented some distributed algorithms based on MapReduce...

chapter

Intrusion detection system based on data mining for host log

Ming Zhu, ZiLi Huang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1742 - 1746

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

The traditional intrusion detection technology is mostly based on the needs of Web log, using a single data mining to improve the algorithm analysis, which cannot be used in an unknown environment of zero-knowledge rule database, and the efficiency of detecting the potential threats and abnormal behavior is not significant. Therefore, the Paper proposes an intrusion detection system based on data...

chapter

Research on data stream mining algorithm for frequent itemsets based on sliding window model

Hongmei Wang, Fentian Li, Dongkai Tang, Zeru Wang

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 259 - 263

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

With the continuous expansion of the data stream applications, the data stream frequent pattern mining is becoming a hot research topic in the field of data mining, the domestic and foreign scholars put forward a large number of data stream frequent itemsets mining algorithms. This paper improves the related definitions of frequent itemsets and sliding windows, classifies sliding windows from data...

chapter

Maximal itemsets mining algorithm based on Bees' Algorithm

Mais A. Mohammed, Hussein Al-Khafaji

2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT) > 1 - 6

2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT)

Maximal frequent itemset is the largest frequent itemset in a database which is not covered by other itemsets. All frequent itemsets can be built up from maximal one. Moreover, it is possible to focus on any part of the maximal frequent itemset to supervise Data Mining. Bees' Algorithm is simple, robust and population-based stochastic optimization algorithm which is based on bees' natural foraging...

chapter

Searching Data Cube for Submerging and Emerging Cuboids

Viet Phan-Luong

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA) > 586 - 593

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA)

Many existing approaches to data cube computation search for the group-by partitions on fact table with support greater than some threshold, that is, those can be obtained from the SQL group-by queries with the clause HAVING COUNT(*) >= supp, where supp is a support threshold. Those partitions constitute what is called the iceberg data cube. The present work proposes an efficient method to compute...

chapter

Designing combo recharge plans for telecom subscribers using itemset mining technique

Giridhar Maji, Sharmistha Mandal, Souvik Bhattacharya, Soumya Sen

2017 IEEE International Conference on Industrial Technology (ICIT) > 1232 - 1237

2017 IEEE International Conference on Industrial Technology (ICIT)

Customer retention in Telecom market is a big research challenge in developed as well as developing economies as the market is almost saturated as well as competitive with large number of local and global service providers. It is also well known that from business point of view retaining an existing customer is much less costly than acquiring a new one. Hence retaining existing customer by making...

chapter

Validating product correctness of persistent itemset mining as a service prototype

B. Shamreen Ahamed, P. M. S. S Chandu

2017 International Conference on Information Communication and Embedded Systems (ICICES) > 1 - 7

2017 International Conference on Information Communication and Embedded Systems (ICICES)

Cloud computing proposes a policy to user where the data to be retrieved can be swapped between the user and the server. The information being given to a third party server comprises confidencial threats as users with fragile computational power cannot validate the correctness of the data that are grouped. This paper, aims at the broken itemsets, in which the server is not reliable and outbursts the...

chapter

Performance analysis of clustering algorithms in medical datasets

P. Premalatha, S. Subasree

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT) > 1 - 6

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT)

Generally, the medical datasets are heterogeneous and large dimensional that contains a million of patient records. Extracting information from such datasets is a tedious process, which can be made easier by some of the clustering algorithms available in data mining. In this paper, three clustering algorithms such as Medical Storage Platform for data Mining (MSPM), Homogeneity Similarity based Hierarchical...

INFONA - science communication portal

Search results

Parallel architecture for implementation of frequent itemset mining using FP-growth

A frequent pattern mining based shape defect diagnosis method for cold rolled strip products

Accelerating Dynamic Itemset Counting on Intel many-core systems

Mining Change Histories for Unknown Systematic Edits

A comparative analysis of frequent pattern mining algorithms used for streaming data

Mining high utility itemsets using TKO and TKU to find top-k high utility web access patterns

Balanced Parallel Frequent Pattern Mining over Massive Data Stream

Cleaning Data with Forbidden Itemsets

SolMiner: Mining Distinct Solutions in Programs

Hp-Apriori: Horizontal parallel-apriori algorithm for frequent itemset mining from big data

Communication protocol identification based on data mining and automatic reasoning

An incremental algorithm for frequent itemset mining on spark

DFPS: Distributed FP-growth algorithm based on Spark

Intrusion detection system based on data mining for host log

Research on data stream mining algorithm for frequent itemsets based on sliding window model

Maximal itemsets mining algorithm based on Bees' Algorithm

Searching Data Cube for Submerging and Emerging Cuboids

Designing combo recharge plans for telecom subscribers using itemset mining technique

Validating product correctness of persistent itemset mining as a service prototype

Performance analysis of clustering algorithms in medical datasets

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options