Search results

Items from 1 to 20 out of 34 results

chapter

Sequential all frequent itemsets detection: A method to detect all frequent sequential itemsets using LERP-Reduced Suffix Array data structure and ARPaD algorithm

Konstantinos F. Xylogiannopoulos, Panagiotis Karampelas, Reda Alhajj

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 1141 - 1148

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

Sequential frequent itemsets detection is one of the core problems in data mining. In the current paper we propose a new methodology based on our previous work regarding the detection of all repeated patterns in a string. By analyzing big datasets from FIMI website of up to one million transactions we were able to detect not only the most frequent sequential itemsets but any sequential itemset occurred...

chapter

Mining API Usage Examples from Test Code

Zixiao Zhu, Yanzhen Zou, Bing Xie, Yong Jin, more

2014 IEEE International Conference on Software Maintenance and Evolution > 301 - 310

2014 IEEE International Conference on Software Maintenance and Evolution (ICSME)

Lack of effective usage examples in API documents has been proven to be a great obstacle to API learning. To deal with this issue, several approaches have been proposed to automatically extract usage examples from client code or related web pages, which are unfortunately not available for newly released API libraries. In this paper, we propose a novel approach to mining API usage examples from test...

chapter

E-CVFDT: An improving CVFDT method for concept drift data stream

Gang Liu, Hong-rong Cheng, Zhi-guang Qin, Qiao Liu, more

2013 International Conference on Communications, Circuits and Systems (ICCCAS) > 1 > 315 - 318

2013 International Conference on Communications, Circuits and Systems (ICCCAS)

Distribution of data stream is always changed in the real world. This problem is usually defined as concept drift ^[1]. The state-of-the-art decision tree classification method CVFDT^[2] can solve the concept drift problem well, but the efficiency is debased because of its general method of handling instances in CVFDT without considering the types of concept drift. In this paper, an algorithm called...

chapter

A Classification Algorithm Based on Association Rule Mining

Yang Junrui, Xu Lisha, He Hongde

2012 International Conference on Computer Science and Service System > 2056 - 2059

2012 International Conference on Computer Science and Service System (CSSS)

The main difference of the associative classification algorithms is how to mine frequent item sets, analyze the rules exported and use for classification. This paper presents an associative classification algorithm based on Trie-tree that named CARPT, which remove the frequent items that cannot generate frequent rules directly by adding the count of class labels. And we compress the storage of database...

chapter

QuCOM: K nearest features neighborhood based qualitative spatial co-location patterns mining algorithm

You Wan, Chenghu Zhou

Proceedings 2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services > 54 - 59

2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM)

Spatial Co-location patterns are similar to association rules but explore more relying spatial auto-correlation. They represent subsets of Boolean spatial features whose instances are often located in close geographic proximity. Existing co-location patterns mining researches only concern the spatial attributes, and few of them can handle the huge amount of non-spatial attributes in spatial datasets...

chapter

Parallelizing an Information Theoretic Co-clustering Algorithm Using a Cloud Middleware

V Ramanathan, Wenjing Ma, V T Ravi, Tantan Liu, more

2010 IEEE International Conference on Data Mining Workshops > 186 - 193

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

The emerging cloud environments are well suited for storage and analysis of large datasets, since they can allow on-demand access to resources. However, developing high-performance implementations of data analysis tasks is a challenging problem. In our prior work, we have developed a middleware called FREERIDE (FRamework for Rapid Implementation of Data mining Engines). FREERIDE is based upon the...

chapter

Computation of Intent Reduction Based on Incremental Construction of Concept Lattices

Ge Bin, Meng Xiang-rui

2010 International Conference on Management and Service Science > 1 - 4

2010 International Conference on Management and Service Science (MASS 2010)

Concept lattice is accurate and complete in knowledge representation and is an effective tool for data analysis and knowledge discovery. This paper focuses on incremental computation of intent reduction of concepts. By theoretical analysis of characteristic change of intent reduction of lattice nodes during incremental construction of concept lattice, it advances an incremental algorithm to compute...

chapter

An Improved Apriori Algorithm

Yongge Shi, Yiqun Zhou

2010 IEEE International Conference on Granular Computing > 759 - 762

2010 IEEE International Conference on Granular Computing (GrC-2010)

In order to improve efficiency of excavation in relational database with multi-dimensional association rules, this paper analyzed Apriori algorithm and BUC algorithm based on practice. Then an improved Apriori algorithm-DGP algorithm which based on the multidimensional association rule was presented, it has more efficient and it will be used in the relational database. At last it was applied for analyzing...

chapter

Constructing Classification Model with MapReduce

Xiangxiang Chen, Kaigui Wu, Changze Wu

2010 International Conference on Multimedia Information Networking and Security > 611 - 615

2010 Second International Conference on Multimedia Information Networking and Security (MINES 2010)

Abstract-By analyzing the process of classification and MapReduce computing paradigms, it is found that the parallel and distributed computing model in MapReduce is appropriate for constructing classifier model. This paper presents a MapReduce algorithm for parallel and distributed classification, aiming to reduce the computational time in training process on large scale documents. Our experiment...

chapter

Improving Sensor Subset Selection of Machine Olfaction Using Multi-class SVM

E. Phaisangittisagul

2010 Third International Conference on Knowledge Discovery and Data Mining > 28 - 31

2010 3rd International Conference on Knowledge Discovery and Data Mining (WKDD 2010)

An approach of sensor subset selection is considered one of significant issues in machine olfaction. Basically, each sensor should provide different selectivity profiles over the range of target odor application so that a unique odor pattern is produced from each sensor in the array. However, some or most of the features obtained from an array of sensors in practice are redundant and irrelevant due...

chapter

A New Triangulation Algorithm Based on the Determination of the Polygon's Diagonals

Hai-Ying Sun, Liang Ma

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 3

2009 International Conference on Computational Intelligence and Software Engineering

This paper presents a triangulation algorithm for the general plane polygon, this algorithm does not consider the polygon's concave or convex and its vertices are clockwise or counter clockwise. It first makes the elimination marks for the diagonals outside of the polygon, then determines the diagonals which intersect with the polygon and makes the elimination marks. In order to avoid the long and...

chapter

PBFMCSP: Prefix Based Fast Mining of Closed Sequential Patterns

M. Thilagu, R. Nadarajan, M.S.I. Ahmed, S.S. Bama

2009 International Conference on Advances in Computing, Control, and Telecommunication Technologies > 484 - 488

2009 International Conference on Advances in Computing, Control, & Telecommunication Technologies (ACT 2009)

In recent years, mining of sequential patterns has been studied extensively in various domains. Most of the existing algorithms find patterns in transactional databases by scanning the records whether they contain patterns or not. This paper proposes a novel algorithm to mine closed sequential patterns using an inverted matrix and prefix based sequence element matrix. Inverted matrix minimizes the...

chapter

Text Clustering Based on Key Phrases

Ai Wang, YaoDong Li, Wei Wang

2009 First International Conference on Information Science and Engineering > 986 - 989

2009 1st International Conference on Information Science and Engineering (ICISE 2009)

Text clustering is a hot and essential topic in data mining and information retrieval. This paper proposed a KP-FCM clustering method, which used the key phrases as text features and applied the Fuzzy c-means (FCM) as clustering algorithm. In this method, key phrases were extracted by an algorithm based on suffix array. Experimental results on two standard text clustering benchmark corpuses, OHSUMED...

chapter

Improving Scientific Data Extraction Using Metadata Classification

Yue Shan Chang, Hsuan-Jen Lai, Hsiang-Tai Cheng

2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks > 669 - 673

2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks (ISPAN 2009)

There are large scientific data archives manage and store huge quantities of data, deal with this data throughout its life cycle, and focus on particular scientific domains. Metadata can be used for assisting the information retrieval. Using metadata to represent the file system also reduces the processing required to handle operations. While the number of metadata file is daily incremental with the...

chapter

DOA Detection Method Based on Midamble Sequence

Haiyang Fu, Fan Li, Shixiang Shao, Xiangdong Jia

2009 International Symposium on Computer Network and Multimedia Technology > 1 - 4

2009 International Symposium on Computer Network and Multimedia Technology (CNMT 2009)

One of the core technologies in smart antenna (SA) is DOA estimation. The current DOA estimation methods can be classified into three basic categories: spectrum searching algorithms, subspace algorithms and algorithms for best performance. All of these three categories have some limitations and can not be applied in CDMA system directly. This paper proposes a new simple and practical method for DOA...

chapter

Improved Algorithm for Computing Convex Hull of Plane Point Set

Dong Wei, XingHua Liu

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

An algorithm for computing the convex hull of scattered plane point set through the extreme points on the boundary of plane is proposed. According to the extreme points, the plane point set is divided into five zones. The four zones on the boundary contain all convex vertexes. By computing extreme points of subsets in the four marginal zones, a polygon that contains all convex vertexes is obtained...

chapter

Non-digitizing Data Restoration with Using Indirect Data Processing

T.M. Gelfeld, M. Reiner

2009 Computation World: Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns > 220 - 226

2009 Computation World: Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns (ComputationWorld 2009)

In this study, we proposed the method of automatics searching predefined events location in digital images of old paper-tape data recording, which in essence is indirect processing. The main idea of proposed algorithm is isomorphic transformation of paper-tape digital images to the time-serial data. The time-serial data obtained by this transformation is clustered and classified to obtain the positions...

chapter

An online clustering algorithm for Chinese web snippets based on Generalized Suffix Array

Zhang Hui, Wang Han, Yang Gao, Zhou Jingmin

2009 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery > 148 - 154

2009 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery. CyberC 2009

As the information on the Internet increases dramatically, the Web search engine has become an indispensable tool to search and locate the required information. Web snippets clustering can classify the search results and help users to narrow the search scope. This paper presents an online clustering algorithm for Chinese web snippets using common substrings. The algorithm firstly preprocesses the...

chapter

Research on Policy Conflict Based on Layered Policy Representation Framework

Hu Jun, Sun Ye, Wang Bai-yun

2009 Fifth International Conference on Semantics, Knowledge and Grid > 144 - 151

2009 Fifth International Conference on Semantics, Knowledge and Grid (SKG 2009)

Based on a kind of layered policy representation framework, several types of policy conflict were proposed through the research on the characteristics of each layer of the policy and the relationship of policies, then an in-depth analysis on these types of policy conflict were discussed, moreover, a variety of relevant detection and elimination algorithms of policy conflict were put forward. Finally,...

chapter

A Method for Implementing a Statistically Significant Number of Data Classes in the Jenks Algorithm

M.A. North

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 35 - 38

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

The Jenks natural breaks algorithm is a standard method for dividing a dataset into a certain number of homogenous classes. The algorithm is commonly used in geographic information systems (GIS) applications. One major drawback to the use of Jenks in this context is that the number of desired classes must be indicated before the algorithm is applied to the dataset. Without a mechanism for determining...

Data set:
ieee
Keywords:
DATA MINING
ARRAYS
CLASSIFICATION ALGORITHMS

Publication date

Set your own date range

Content availability

Available (33)
None (1)

Keywords

ALGORITHM DESIGN AND ANALYSIS (16)
CLUSTERING ALGORITHMS (7)
PATTERN CLASSIFICATION (7)
FEATURE EXTRACTION (6)
EDUCATIONAL INSTITUTIONS (5)
SIGNAL PROCESSING ALGORITHMS (5)
COMPUTERS (4)
PARALLEL PROCESSING (4)
PATTERN CLUSTERING (4)
SIGNAL PROCESSING (4)
ARRAY SIGNAL PROCESSING (3)
ARTIFICIAL NEURAL NETWORKS (3)
COMPUTER ARCHITECTURE (3)
EQUATIONS (3)
GENETIC ALGORITHMS (3)
IMAGE PROCESSING (3)
INDEXES (3)
INFORMATION RETRIEVAL (3)
ITEMSETS (3)
MATHEMATICAL MODEL (3)
PATTERN RECOGNITION (3)
PRESSES (3)
RANDOM ACCESS MEMORY (3)
SOFTWARE (3)
SOFTWARE ALGORITHMS (3)
TRAINING (3)
ACCURACY (2)
ANALYTICAL MODELS (2)
ASSOCIATION RULES (2)
CHARACTER RECOGNITION (2)
CLASSIFICATION (2)
CLASSIFICATION ALGORITHM (2)
CLOCKS (2)
COMPLEXITY THEORY (2)
COMPUTATIONAL GEOMETRY (2)
COMPUTER LANGUAGES (2)
COMPUTER SCIENCE (2)
CORRELATION (2)
DATA ANALYSIS (2)
DATA HANDLING (2)
DATA MODELS (2)
DATABASES (2)
DECISION TREES (2)
DIRECTION OF ARRIVAL ESTIMATION (2)
DISPLAYS (2)
DISTRIBUTED DATABASES (2)
ESTIMATION (2)
FIELD PROGRAMMABLE GATE ARRAYS (2)
GAIN (2)
IMAGE EDGE DETECTION (2)
IMAGE RECOGNITION (2)
KNOWLEDGE DISCOVERY (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LIBRARIES (2)
MONITORING (2)
NOISE (2)
PARTITIONING ALGORITHMS (2)
PERFORMANCE ANALYSIS (2)
REAL TIME SYSTEMS (2)
SIGNAL RESOLUTION (2)
SUPPORT VECTOR MACHINE CLASSIFICATION (2)
SUPPORT VECTOR MACHINES (2)
TEXT ANALYSIS (2)
TRAINING DATA (2)
TRANSFORMS (2)
USA COUNCILS (2)
VECTORS (2)
ACCELERATION (1)
ADABOOST (1)
ADAPTATION MODEL (1)
ADAPTIVE ANTENNA ARRAYS (1)
ADDERS (1)
ANALOG-DIGITAL CONVERSION (1)
ANALYTICAL MULTIPATH MODEL (1)
ANTENNA ARRAYS (1)
ANTENNAS (1)
ANTENNAS AND PROPAGATION (1)
API (1)
APPROXIMATION ALGORITHMS (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
APRIORI ALGORITHM (1)
ARGO (1)
ARPAD (1)
ARTIFICIAL INTELLIGENCE (1)
ASH (1)
ASSOCIATIVE CLASSIFICATION (1)
ATMOSPHERIC MEASUREMENTS (1)
ATMOSPHERIC MODELING (1)
ATMOSPHERIC WAVES (1)
ATTRIBUTE DIVISION ALGORITHM (1)
ATTRIBUTE SELECTION (1)
AUTOMATIC TARGET RECOGNITION (1)
AUTOMATION (1)
AVERAGE MEMORY ACCESS TIME (1)
AVERAGE MEMORY ACCESS TIME (AMAT) (1)
BACKSCATTER (1)
more

INFONA - science communication portal

Search results

Sequential all frequent itemsets detection: A method to detect all frequent sequential itemsets using LERP-Reduced Suffix Array data structure and ARPaD algorithm

Mining API Usage Examples from Test Code

E-CVFDT: An improving CVFDT method for concept drift data stream

A Classification Algorithm Based on Association Rule Mining

QuCOM: K nearest features neighborhood based qualitative spatial co-location patterns mining algorithm

Parallelizing an Information Theoretic Co-clustering Algorithm Using a Cloud Middleware

Computation of Intent Reduction Based on Incremental Construction of Concept Lattices

An Improved Apriori Algorithm

Constructing Classification Model with MapReduce

Improving Sensor Subset Selection of Machine Olfaction Using Multi-class SVM

A New Triangulation Algorithm Based on the Determination of the Polygon's Diagonals

PBFMCSP: Prefix Based Fast Mining of Closed Sequential Patterns

Text Clustering Based on Key Phrases

Improving Scientific Data Extraction Using Metadata Classification

DOA Detection Method Based on Midamble Sequence

Improved Algorithm for Computing Convex Hull of Plane Point Set

Non-digitizing Data Restoration with Using Indirect Data Processing

An online clustering algorithm for Chinese web snippets based on Generalized Suffix Array

Research on Policy Conflict Based on Layered Policy Representation Framework

A Method for Implementing a Statistically Significant Number of Data Classes in the Jenks Algorithm

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options