2007 IEEE Symposium on Computational Intelligence and Data Mining

We present a method of learning relationships at the triadic level of a relationship network. The method proposes learning linkages of a particular network using a Support Vector Machine (SVM) classifier trained on the known part of a relationship network. Using features drawn from the topological information of the two degrees of separation of a link a classifier learns whether two people of that...

chapter

Validity of Probabilistic Rules

Marina Sapir, Mikhail Teverovskiy

2007 IEEE Symposium on Computational Intelligence and Data Mining > 6 - 9

2007 First IEEE Symposium on Computational Intelligence and Data Mining

We propose an axiomatic approach to defining of the validity of probabilistic inductive rules E H. The set of rules is evaluated against an available dataset, where the conditions E, H are either true or false for each instance in the dataset. Introduced here are six axioms which formalize common sense dependencies between the validity of rules and their support, confidence, lift and amount of available...

chapter

An Efficient Distance Calculation Method for Uncertain Objects

Lurong Xiao, Edward Hung

2007 IEEE Symposium on Computational Intelligence and Data Mining > 10 - 17

2007 First IEEE Symposium on Computational Intelligence and Data Mining

Recently the academic communities have paid more attention to the queries and mining on uncertain data. In the tasks such as clustering or nearest-neighbor queries, expected distance is often used as a distance measurement among uncertain data objects. Traditional database systems store uncertain objects using their expected (average) location in the data space. Distances can be calculated easily...

chapter

K2GA: Heuristically Guided Evolution of Bayesian Network Structures from Data

Eli Faulkner

2007 IEEE Symposium on Computational Intelligence and Data Mining > 18 - 25

2007 First IEEE Symposium on Computational Intelligence and Data Mining

We present K2GA, an algorithm for learning Bayesian network structures from data. K2GA uses a genetic algorithm to perform stochastic search, while employing a modified version of the K2 heuristic to score proposed networks and improve future generations. We show each component of K2GA, a combination of these components to form the basic algorithm, extensions to the algorithm for improved accuracy,...

chapter

Extracting Borderline Associations

Wei Kian Chen, Dustin Baumgartner, Ryan Millikin

2007 IEEE Symposium on Computational Intelligence and Data Mining > 26 - 30

2007 First IEEE Symposium on Computational Intelligence and Data Mining

In this paper, we present an extension of the well known algorithm for association mining, Apriori. This extended algorithm, ApriorBL, considers associations between items which occur together - focusing solely on the borderline cases. These borderline cases occur often enough to provide valuable information; however, there are currently no algorithms that target them. We discuss how the AprioriBL...

chapter

Selecting the Right Peer Schools for AACSB Accreditation - A Data Mining Application

M.Y. Kiang, S.A. Fisher, D.M. Fisher, R.T. Chi

2007 IEEE Symposium on Computational Intelligence and Data Mining > 31 - 34

2007 First IEEE Symposium on Computational Intelligence and Data Mining

For a business school, the selection of its peer schools is an important component of its International Association for Management Education (AACSB) (re)accreditation process. A school typically compares itself with other institutions having similar structural and identity-based attributes. The identification of peer schools is critical and can have a significant impact on a business school's accreditation...

chapter

Structure Prediction in Temporal Networks using Frequent Subgraphs

Mayank Lahiri, Tanya Y. Berger-Wolf

2007 IEEE Symposium on Computational Intelligence and Data Mining > 35 - 42

2007 First IEEE Symposium on Computational Intelligence and Data Mining

There are several types of processes which can be modeled explicitly by recording the interactions between a set of actors over time. In such applications, a common objective is, given a series of observations, to predict exactly when certain interactions will occur in the future. We propose a representation for this type of temporal data and a generic, streaming, adaptive algorithm to predict the...

chapter

An Analytical Evaluation of Objective Measures Behavior for Generalized Association Rules

Veronica Oliveira de Carvalho, Solange Oliveira Rezende, Mario de Castro

2007 IEEE Symposium on Computational Intelligence and Data Mining > 43 - 50

2007 First IEEE Symposium on Computational Intelligence and Data Mining

The association rule mining task identifies all the intrinsic associations among the items contained in data and leads to only specialized knowledge. To overcome this problem the generalized association rules appeared. This type of rule associates not only the items contained in data, but also some items encoded into a given taxonomy. Therefore, the techniques used to obtain generalized association...

chapter

Versatile and Efficient Meta-Learning Architecture: Knowledge Representation and Management in Computational Intelligence

K. Grabczewski, N. Jankowski

2007 IEEE Symposium on Computational Intelligence and Data Mining > 51 - 58

2007 First IEEE Symposium on Computational Intelligence and Data Mining

There are many data mining systems derived from machine learning, neural network, statistics and other fields. Most of them are dedicated to some particular algorithms or applications. Unfortunately, their architectures are still too naive to provide satisfactory background for advanced meta-learning problems. In order to efficiently perform sophisticated meta-level analysis, we need a very versatile,...

chapter

Query-sensitive Feature Selection for Lazy Learners

Xin Tong, Mingyang Gu

2007 IEEE Symposium on Computational Intelligence and Data Mining > 59 - 65

2007 First IEEE Symposium on Computational Intelligence and Data Mining

Feature selection contributes to increasing many learners' accuracy by identifying and removing irrelevant features in multidimensional datasets. Conventional feature selection methods determine the optimal feature subset independently from and prior to the introduction of a new query. In general, some features will be relevant only in certain tasks. We argue that a query, as an indicator of the attention...

chapter

Comparison of Classifiers Efficiency on Missing Values Recovering: Application in a Marketing Database with Massive Missing Data

B.M. Nogueira, T.R.A. Santos, L.E. Zarate

2007 IEEE Symposium on Computational Intelligence and Data Mining > 66 - 72

2007 First IEEE Symposium on Computational Intelligence and Data Mining

Missing data in databases are considered to be one of the biggest problems faced on data mining application. This problem can be aggravated when there is massive missing data in the presence of imbalanced databases. Several techniques as samples deletion, values imputation, values prediction through classifiers and approximation of patterns have been proposed and compared, but these comparisons do...

chapter

Manifold Learning using Growing Locally Linear Embedding

Junsong Yin, Dewen Hu, Zongtan Zhou

2007 IEEE Symposium on Computational Intelligence and Data Mining > 73 - 80

2007 First IEEE Symposium on Computational Intelligence and Data Mining

Locally Linear Embedding (LLE) is an effective nonlinear dimensionality reduction method for exploring the intrinsic characteristics of high dimensional data. This paper mainly proposes a hierarchical framework manifold learning method, based on LLE and Growing Neural Gas(GNG), named Growing Locally Linear Embedding(GLLE). First, we address the major limitations of the original LLE: intrinsic dimensionality...

chapter

A Novel Complex-Valued Counterpropagation Network

Prem K. Kalra, Deepak Mishra, Kanishka Tyagi

2007 IEEE Symposium on Computational Intelligence and Data Mining > 81 - 87

2007 First IEEE Symposium on Computational Intelligence and Data Mining

The Counterpropagation network is a combination of competitive network (Kohonen layer) and Grossberg outstar structure. In this paper we have proposed a complex valued representation on conventional forward only counterpropagation network. Many researchers have investigated the computational capabilities of neuron models for real values only. The novel part of the paper is, while considering the complex...

chapter

A Prototype-driven Framework for Change Detection in Data Stream Classification

Hamed Valizadegan, Pang-Ning Tan

2007 IEEE Symposium on Computational Intelligence and Data Mining > 88 - 95

2007 First IEEE Symposium on Computational Intelligence and Data Mining

This paper presents a prototype-driven framework for classifying evolving data streams. Our framework uses cluster prototypes to summarize the data and to determine whether the current model is outdated. This strategy of rebuilding the model only when significant changes are detected helps to reduce the computational overhead and the amount of labeled examples needed. To improve its accuracy, we also...

chapter

Evolutionary Optimization of Three-Photon Absorption in Molecular Iodine

R. Burbidge, J.J. Rowland, R.D. King, N.T. Form, more

2007 IEEE Symposium on Computational Intelligence and Data Mining > 96 - 100

2007 First IEEE Symposium on Computational Intelligence and Data Mining

We report on the application of an evolutionary algorithm to a noisy, dynamic optimization problem in chemistry: the maximization of three-photon absorption in molecular iodine. An evolution strategy is used in real-time in a closed loop experiment to search the space of physically realizable phase-modulated femtosecond laser pulses. The probability of three-photon absorption is estimated by measuring...

chapter

Induction Tree methods to classify M. tuberculosis spoligotypes

Georges Valetudie

2007 IEEE Symposium on Computational Intelligence and Data Mining > 101 - 106

2007 First IEEE Symposium on Computational Intelligence and Data Mining

In this paper we compared and analyzed four graph induction methods to automatically classify spoligotypes. A spoligotype is a sequence of 43 binary values provided by a DNA analysis technique. This method is known to be useful and efficient to many supervised learning problems. We found it interesting to use these techniques especially for sequential data, in order to create a classifier based on...

chapter

Data Clustering and Fuzzy Neural Network for Sales Forecasting in Printed Circuit Board Industry

Pei-Chann Chang, Chen-Hao Liu, Chin-Yuan Fan, Hsiao-Ching Chang

2007 IEEE Symposium on Computational Intelligence and Data Mining > 107 - 113

2007 First IEEE Symposium on Computational Intelligence and Data Mining

Reliable prediction of sales can improve the quality of business strategy. This research develops a hybrid model by integrating K-mean cluster and Fuzzy Back Propagation Network (KFBPN) to forecast the future sales of a printed circuit board factory. Based on the K-mean clustering technique, the historic data can be classified into different clusters, thus the noise of the original data can be reduced...

Publication date

Set your own date range

Keywords

DATA MINING (16)
PATTERN CLUSTERING (6)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (3)
DATA MINING APPLICATION (2)
DATABASE MANAGEMENT SYSTEMS (2)
EVOLUTIONARY COMPUTATION (2)
NEURAL NETWORKS (2)
PATTERN CLASSIFICATION (2)
PATTERN MATCHING (2)
PATTERN RECOGNITION (2)
PRINCIPAL COMPONENT ANALYSIS (2)
ROUGH SETS (2)
SUPPORT VECTOR MACHINES (2)
TREE DATA STRUCTURES (2)
ACCREDITATION (1)
ADAPTIVE CLUSTERING (1)
AGGLOMERATIVE INFORMATION BOTTLENECK (1)
AIR POLLUTION (1)
ANATOMY PHRASES (1)
BACK-PROPAGATION NEURAL NETWORK (BPN) (1)
BACKGROUND APPLICATION ACTIVITY (1)
BACKPROPAGATION (1)
BACKPROPAGATION NEURAL NETWORK (1)
BANDWIDTH ALLOCATION (1)
BANDWIDTH MANAGEMENT SYSTEM (1)
BEHAVIOR SCORING (1)
BIOLOGY COMPUTING (1)
BIPARTITE GRAPH (1)
BROWSING BEHAVIOR MINING (1)
BUSINESS SCHOOL (1)
CANDIDATE CLUSTER MINING (1)
CANONICAL FORM (1)
CARTOGRAPHY (1)
CLICK STEAM DATA (1)
CLIMATE (1)
CLUSTERING (1)
CLUSTERING ANALYSIS (1)
CLUSTERING SAMPLES (1)
COLLABORATIVE FILTERING (1)
COMPUTATIONAL INTELLIGENCE (1)
COMPUTER BEHAVIOR (1)
COMPUTER CONFIGURATIONS (1)
COMPUTER MEASUREMENTS (1)
COMPUTER WORM DETECTION (1)
CONSTRAINED QUESTION FUNCTIONS (1)
CONSTRAINED QUESTION VARIABLES (1)
CONTENT-BASED IMAGE RETRIEVAL (1)
CRYPTOGRAPHY (1)
CUSTOMER ACCESS SEQUENCE (1)
CUSTOMER FINANCE-AIDED SERVICE (1)
D-LIST DATA STRUCTURES (1)
DATA ANALYSIS (1)
DATA ANALYSIS SYSTEM (1)
DATA CLUSTERING (1)
DATA MINING SYSTEMS (1)
DATA PROCESSING (1)
DATA SEQUENCES (1)
DATA STREAM APPLICATIONS (1)
DATA VISUALISATION (1)
DATABASE PREPROCESSING (1)
DATABASES (1)
DATASET PATTERNS (1)
DECISION TREE CLASSIFICATION (1)
DECISION TREES (1)
DENSITY (1)
DIMENSIONALITY REDUCTION (1)
DISCRETE WAVELET TRANSFORM (DWT) (1)
DISTRIBUTED DOCUMENT CLUSTERING (1)
DISTRIBUTED INFORMATION BOTTLENECK (1)
DISTRIBUTED PROCESSING (1)
DOCUMENT HANDLING (1)
DYNAMIC GEO-SPATIAL QUERY PROBLEMS (1)
DYNAMIC PROGRAMMING (1)
EDUCATIONAL ADMINISTRATIVE DATA PROCESSING (1)
ELECTROENCEPHALOGRAM (EEG) (1)
ENCRYPTED INTERNET VOICE TRAFFIC (1)
ENCRYPTED VOIP (1)
EO-1 SATELLITE (1)
EVOLUTIONARY ARTIFICIAL NEURAL NETWORKS (1)
EVOLUTIONARY NEURAL NETWORKS (1)
EVOLUTIONARY OPTIMIZATION (1)
EVOLUTIONARY SEARCH (1)
EXCHANGE RATE (1)
EXTENDED RESOURCE-AWARE CLUSTER (1)
EYE TRACKING SYSTEMS (1)
FACILITY ESTABLISHMENT (1)
FACILITY LOCATION (1)
FACILITY LOCATIONS (1)
FAST UPDATED FP-TREE STRUCTURE (1)
FAULT DETECTION (1)
FEATURE SELECTION (1)
FEATURE VECTOR (1)
FIELD-BASED SERVICES (1)
FINANCIAL ENGINEERING (1)
FREQUENT SEQUENTIAL DATA STREAM PATTERN MINER (1)
FREQUENT SEQUENTIAL PATTERN DISCOVERY (1)
FREQUENT SEQUENTIAL PATTERNS (1)
FREQUENT SUBTREE MINING (1)
FSP-TREE DATA STRUCTURES (1)
FUFP-TREE CONSTRUCTION (1)
more

INFONA - science communication portal

2007 IEEE Symposium on Computational Intelligence and Data Mining

General CIDM Co-chairs' Welcome Letter

Program Committee

IEEE Symposium on Computational Intelligence and Data Mining (CIDM 2007)

Link Analysis of Incomplete Relationship Networks

Validity of Probabilistic Rules

An Efficient Distance Calculation Method for Uncertain Objects

K2GA: Heuristically Guided Evolution of Bayesian Network Structures from Data

Extracting Borderline Associations

Selecting the Right Peer Schools for AACSB Accreditation - A Data Mining Application

Structure Prediction in Temporal Networks using Frequent Subgraphs

An Analytical Evaluation of Objective Measures Behavior for Generalized Association Rules

Versatile and Efficient Meta-Learning Architecture: Knowledge Representation and Management in Computational Intelligence

Query-sensitive Feature Selection for Lazy Learners

Comparison of Classifiers Efficiency on Missing Values Recovering: Application in a Marketing Database with Massive Missing Data

Manifold Learning using Growing Locally Linear Embedding

A Novel Complex-Valued Counterpropagation Network

A Prototype-driven Framework for Change Detection in Data Stream Classification

Evolutionary Optimization of Three-Photon Absorption in Molecular Iodine

Induction Tree methods to classify M. tuberculosis spoligotypes

Data Clustering and Fuzzy Neural Network for Sales Forecasting in Printed Circuit Board Industry

Filter options

Publication date

Keywords

INFONA - science communication portal

2007 IEEE Symposium on Computational Intelligence and Data Mining $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2007 IEEE Symposium on Computational Intelligence and Data Mining