Search results

Items from 1 to 20 out of 40 results

chapter

A principal curve-based method for data clustering

Elson Claudio Correa Moraes, Danton Diego Ferreira

2016 International Joint Conference on Neural Networks (IJCNN) > 3966 - 3971

2016 International Joint Conference on Neural Networks (IJCNN)

In this work a new method for data clustering based on principal curves is presented. Principal curves consist of a nonlinear generalization of Principal Component Analysis and may also be regarded as continuous versions of 1-D self-organizing maps. The proposed method divides the principal curves extracted by the k-segments algorithm into two or more curves, according to the number of clusters defined...

chapter

Fuzzy data mining and web intelligence

Venkata Subba Reddy Poli

2015 International Conference on Fuzzy Theory and Its Applications (iFUZZY) > 74 - 79

2015 International Conference on Fuzzy Theory and Its Applications (iFUZZY)

The data mining on web is difficult for online analytic processing (OLAP) with BIG DATA. The data mining is made simple by approximating the databases of BIG DATA for knowledge discovery process particularly MapReducing. The approximate information is fuzzy rather than probability. In this paper, fuzzy web data mining is discussed for BIG DATA for association rules. The query processing is discussed...

chapter

DISC: Efficient Uncertain Frequent Pattern Mining with Tightened Upper Bounds

Richard Kyle MacKinnon, Teagan D. Strauss, Carson Kai-Sang Leung

2014 IEEE International Conference on Data Mining Workshop > 1038 - 1045

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

UF-growth is a tree-based exact algorithm for mining frequent patterns from uncertain data. While it directly calculates the expected support of an item set, it requires a significant amount of storage space to capture all existential probability values among the items. To eliminate the extra space requirement of UF-growth, the CUF-growth algorithm combines nodes with the same item by storing an upper...

chapter

An Enhancement in Clustering for Sequential Pattern Mining through Neural Algorithm Using Web Logs

Sheetal Sahu, Praneet Saurabh, Sandeep Rai

2014 International Conference on Computational Intelligence and Communication Networks > 758 - 764

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

An Organization need to understand their customers' behavior, preferences and future needs which depend upon past behavior. Web Usage Mining is an active research topic in which customers session clustering is done to understand the customers activities. This paper investigates the problem of mining frequent pattern and especially focuses on reducing the number of scans of the database and reflecting...

chapter

An Asynchronous Periodic Sequential Patterns Mining Algorithm with Multiple Minimum Item Supports

Xiangzhan Yu, Haining Yu

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing > 274 - 281

2014 Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

Original sequential pattern mining model only considers occurrence frequentness of sequential patterns, disregards their occurrence periodicity. We propose the asynchronous periodic sequential pattern mining model to discover the sequential patterns which are not only occurring frequently, but also appearing periodically. For this mining model, we propose a pattern-growth mining algorithm to mine...

chapter

Database redundant attribute detection using fractal dimension

Bo Liu

2014 IEEE 5th International Conference on Software Engineering and Service Science > 561 - 564

2014 5th IEEE International Conference on Software Engineering and Service Science (ICSESS)

The method for detecting redundant attributes in relational datasets using the fractal ideology is studied. Based on the fractal dimension of a dataset and its variations, an algorithm for detecting redundant attributes is presented. The work has the following features: datasets with numeric and discrete attributes can be processed; an approach based on depth-equal data dimension division(i.e., the...

chapter

Sensitive attribute based non-homogeneous anonymization for privacy preserving data mining

P. Usha, R. Shriram, S. Sathishkumar

International Conference on Information Communication and Embedded Systems (ICICES2014) > 1 - 5

2014 International Conference on Information Communication and Embedded Systems (ICICES)

Data mining is the process of extracting interesting patterns or knowledge from large amount of data. With the development of data mining technology, an increasing number of data can be mined out to reveal some potential information about the user, because of which privacy of the user may be violated easily. Privacy Preserving Data Mining (PPDM) is used to mine the potential valuable knowledge without...

chapter

An adaptive sliding window based continuous Top-K dominating queries

G. Sandhya, S. Kousalya Devi

2013 7th International Conference on Intelligent Systems and Control (ISCO) > 349 - 353

2013 7th International Conference on Intelligent Systems and Control (ISCO)

Top-K dominating query selects k data objects and influences the highest number of objects in a dataset. This is a decision supportable query since it provides data analysts a best way for finding significant objects. This search is not only for the earlier examination of large upper bounds that leads to earlier identification of results, but also eliminates partial dominance relationship between...

chapter

Pattern Mining from Trajectory GPS Data

Xiaoliang Gen, Hiroki Arimura, Takeaki Uno

2012 IIAI International Conference on Advanced Applied Informatics > 60 - 65

2012 IIAI International Conference on Advanced Applied Informatics (IIAIAAI)

In this paper, we consider data mining from large discrete trajectory data. We study closed pattern mining for the class of trajectory envelope patterns. First, we introduce the basic definition of trajectory data. Then, we present a depth-first search algorithm that finds all trajectory envelope patterns in a given database that satisfies constrants on maximum width, minimum length, and minimum frequency...

chapter

A Two-Phase Heuristic Construction of Feature Sets for Classification

Miguel Garcia-Torres, Roberto Ruiz, Belen Meli´n Batista, Jose A. Moreno Perez, more

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence > 1028 - 1031

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence (ICTAI)

The aim of feature selection applied to a classification task is to find a minimal subset of features for being used in the classification. Some researches have focused their effort on selecting a useful set of attributes, others on selecting a relevant and not redundant set of attributes. We proposed a heuristic construction algorithm for selecting a useful and not redundant subset of features. The...

chapter

Weighted MUSE for Frequent Sub-Graph Pattern Finding in Uncertain DBLP Data

Shawana Jamil, Azam Khan, Zahid Halim, A. Rauf Baig

2011 International Conference on Internet Technology and Applications > 1 - 6

2011 International Conference on Internet Technology and Applications (iTAP)

Studies shows that finding frequent sub-graphs in uncertain graphs database is an NP complete problem. Finding the frequency at which these sub-graphs occur in uncertain graph database is also computationally expensive. This paper focus on investigation of mining frequent sub-graph patterns in DBLP uncertain graph data using an approximation based method. The frequent sub-graph pattern mining problem...

chapter

A novel predictor for moving objects

Hongjun Li, Changjie Tang, Shaojie Qiao

The 3rd International Conference on Information Sciences and Interaction Sciences > 220 - 223

2010 3rd International Conference on Information Sciences and Interaction Sciences (ICIS)

Existing trajectory prediction algorithms mainly employ kinematical models to approximate real world routes and always ignore spatial and temporal distance. In order to overcome the drawbacks of existing trajectory prediction approaches, this paper proposes a novel trajectory prediction algorithm. It works as: (1) mining the interesting regions from trajectory data sets; (2) extracting the trajectory...

chapter

Maximizing visibility of objects

Muhammed Miah

2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010) > 289 - 292

2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)

In recent years, there has been significant interest in the development of ranking functions and efficient top-k retrieval algorithms to help users in ad-hoc search and retrieval in databases (e.g., buyers searching for products in a catalog). We introduce a complementary problem: how to guide a seller in selecting the best attributes of a new tuple (e.g., a new product) to highlight so that it stands...

chapter

A Cost-Effective LSH Filter for Fast Pairwise Mining

Gang Zhao, Yun Xiong, Longbing Cao, Dan Luo, more

2009 Ninth IEEE International Conference on Data Mining > 1088 - 1093

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

The pairwise mining problem is to discover pairwise objects having measures greater than the user-specified minimum threshold from a collection of objects. It is essential in a large variety of database and data-mining applications. Of late, there has been increasing interest in applying a Locality-Sensitive Hashing (LSH) scheme for pairwise mining. LSH-type methods have shown themselves to be simply...

chapter

A Paramount Pair of Cache Replacement Algorithms on L1 and L2 Using Multiple Databases with Security

R. Gupta, S. Tokekar, D.K. Mishra

2009 Second International Conference on Emerging Trends in Engineering&Technology > 346 - 351

2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009)

To overpass the speed gap between processor and main memory; cache memory is used. Cache memory is having hierarchical structure, including level 1 cache (L1), level 2 cache (L2) etc. Effective page replacement algorithm will result in effectual utilization of cache. L1 is having rich temporal locality while L2 is having poor temporal locality, thus same replacement algorithms for both the levels...

chapter

Efficient Discovery of Frequent Correlated Subgraph Pairs

Yiping Ke, J. Cheng, J.X. Yu

2009 Ninth IEEE International Conference on Data Mining > 239 - 248

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

The recent proliferation of graph data in a wide spectrum of applications has led to an increasing demand for advanced data analysis techniques. In view of this, many graph mining techniques, such as frequent subgraph mining and correlated subgraph mining, have been proposed. In many applications, both frequency and correlation play an important role. Thus, this paper studies a new problem of mining...

chapter

Duplicate Record Detection for Database Cleansing

M. Rehman, V. Esichaikul

2009 Second International Conference on Machine Vision > 333 - 338

2009 Second International Conference on Machine Vision (ICMV 2009)

Many organizations collect large amounts of data to support their business and decision making processes. The data collected from various sources may have data quality problems in it. These kinds of issues become prominent when various databases are integrated. The integrated databases inherit the data quality problems that were present in the source database. The data in the integrated systems need...

chapter

Feature subset selection using generalized steepest ascent search algorithm

S. Nakariyakul

2009 Eighth International Symposium on Natural Language Processing > 147 - 151

2009 Eighth International Symposium on Natural Language Processing. SNLP 2009

This paper presents a novel generalized steepest ascent algorithm for selecting a subset of features. Our proposed algorithm is an improvement upon the prior steepest ascent algorithm by selecting a better starting search point and performing a more thorough search than the steepest ascent algorithm. For any given criterion function used to evaluate the effectiveness of a selected feature subsets,...

chapter

Faster Estimation of the Correlation Fractal Dimension Using Box-counting

C. Attikos, M. Doumpos

2009 Fourth Balkan Conference in Informatics > 93 - 95

2009 Fourth Balkan Conference in Informatics (BCI 2009)

Fractal dimension is widely adopted in spatial databases and data mining, among others as a measure of dataset skewness. State-of-the-art algorithms for estimating the fractal dimension exhibit linear runtime complexity whether based on box-counting or approximation schemes. In this paper, we revisit a correlation fractal dimension estimation algorithm that redundantly rescans the dataset and, extending...

chapter

Producing synoptic data structure for high speed data streams in telecommunication network management

Jiang Guo-quan, Deng Bo, Zhang Xiao-yi, Ding Kun

2009 4th International Conference on Computer Science&Education > 815 - 818

2009 4th International Conference on Computer Science & Education (ICCSE 2009)

In large telecommunication network management system, substantial data containing the information of network traffic, network element status, device running situation and all other messages are continuously sent from each special network management system to the integrated network management system. This kind of data is typically the stream data. Current network management system employs traditional...

Data set:
ieee
Keywords:
DATA MINING
APPROXIMATION ALGORITHMS
DATABASES

Publication date

Set your own date range

INFONA - science communication portal

Search results

A principal curve-based method for data clustering

Fuzzy data mining and web intelligence

DISC: Efficient Uncertain Frequent Pattern Mining with Tightened Upper Bounds

An Enhancement in Clustering for Sequential Pattern Mining through Neural Algorithm Using Web Logs

An Asynchronous Periodic Sequential Patterns Mining Algorithm with Multiple Minimum Item Supports

Database redundant attribute detection using fractal dimension

Sensitive attribute based non-homogeneous anonymization for privacy preserving data mining

An adaptive sliding window based continuous Top-K dominating queries

Pattern Mining from Trajectory GPS Data

A Two-Phase Heuristic Construction of Feature Sets for Classification

Weighted MUSE for Frequent Sub-Graph Pattern Finding in Uncertain DBLP Data

A novel predictor for moving objects

Maximizing visibility of objects

A Cost-Effective LSH Filter for Fast Pairwise Mining

A Paramount Pair of Cache Replacement Algorithms on L1 and L2 Using Multiple Databases with Security

Efficient Discovery of Frequent Correlated Subgraph Pairs

Duplicate Record Detection for Database Cleansing

Feature subset selection using generalized steepest ascent search algorithm

Faster Estimation of the Correlation Fractal Dimension Using Box-counting

Producing synoptic data structure for high speed data streams in telecommunication network management

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options