2008 IEEE International Conference on Data Mining Workshops

Items from 101 to 120 out of 139 results

chapter

The Set Classification Problem and Solution Methods

Xia Ning, G. Karypis

2008 IEEE International Conference on Data Mining Workshops > 720 - 729

2008 IEEE International Conference on Data Mining Workshops

This paper focuses on developing classification algorithms for problems in which there is a need to predict the class based on multiple observations (examples) of the same phenomenon (class). These problems give rise to a new classification problem, referred to as set classification, that requires the prediction of a set of instances given the prior knowledge that all the instances of the set belong...

chapter

If Constraint-Based Mining is the Answer: What is the Constraint? (Invited Talk)

J.-F. Boulicaut

2008 IEEE International Conference on Data Mining Workshops > 730

2008 IEEE International Conference on Data Mining Workshops

Constraint-based mining has been proven to be extremely useful. It has been applied not only to many pattern discovery settings (e.g., for sequential pattern mining) but also, recently, on classification and clustering tasks (see, e.g., ). It appears as a key technology for an inductive database perspective on knowledge discovery in databases (KDD), and constraint-based mining is indeed an answer...

chapter

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Shiliang Sun

2008 IEEE International Conference on Data Mining Workshops > 731 - 735

2008 IEEE International Conference on Data Mining Workshops

For multi-view learning, existing methods usually exploit originally provided features for classifier training, which ignore the latent correlation between different views. In this paper, semantic features integrating information from multiple views are extracted for pattern representation. Canonical correlation analysis is used to learn the representation of semantic spaces where semantic features...

chapter

Ontology-Based Protein-Protein Interactions Extraction from Literature Using the Hidden Vector State Model

Yulan He, K. Nakata, Deyu Zhou

2008 IEEE International Conference on Data Mining Workshops > 736 - 743

2008 IEEE International Conference on Data Mining Workshops

This paper proposes a novel framework of incorporating protein-protein interactions (PPI) ontology knowledge into PPI extraction from biomedical literature in order to address the emerging challenges of deep natural language understanding. It is built upon the existing work on relation extraction using the hidden vector state (HVS) model. The HVS model belongs to the category of statistical learning...

chapter

Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages

Xinghua Li, Xindong Wu, Xuegang Hu, Fei Xie, more

2008 IEEE International Conference on Data Mining Workshops > 744 - 751

2008 IEEE International Conference on Data Mining Workshops

This paper presents a new keyword extraction algorithm for Chinese news Web pages using lexical chains and word co-occurrence combined with frequency features, cohesion features, and corelation features. A lexical chain is an external performance consistency by semantically related words of a text, and is the representation of the semantic content of a portion of the text. Word co-occurrence distribution...

chapter

OntoDM: An Ontology of Data Mining

P. Panov, S. Dzeroski, L. Soldatova

2008 IEEE International Conference on Data Mining Workshops > 752 - 760

2008 IEEE International Conference on Data Mining Workshops

Motivated by the need for unification of the field of data mining and the growing demand for formalized representation of outcomes of research, we address the task of constructing an ontology of data mining. The proposed ontology, named OntoDM, is based on a recent proposal of a general framework for data mining, and includes definitions of basic data mining entities, such as datatype and dataset,...

chapter

Semantic Annotation and Services for KDD Tools Sharing and Reuse

C. Diamantini, D. Potena

2008 IEEE International Conference on Data Mining Workshops > 761 - 770

2008 IEEE International Conference on Data Mining Workshops

Active KDD research groups typically make their software tools at disposal of others through the net. However, integration and reuse of these tools typically require a considerable amount of time to understand software scope and use, install it, transform data in a format compatible with the required input. This paper introduces a semantic based, service-oriented framework for tools sharing and reuse,...

chapter

Exploiting Data Semantics to Discover, Extract, and Model Web Sources

J.L. Ambite, C.A. Knoblock, K. Lerman, A. Plangprasopchok, more

2008 IEEE International Conference on Data Mining Workshops > 771 - 779

2008 IEEE International Conference on Data Mining Workshops

We describe Deimos, a system that automatically discovers and models new sources of information.The system exploits four core technologies developed by our group that makes an end-to-end solution to this problem possible. First, given an example source, Deimos finds other similar sources online. Second, it invokes and extracts data from these sources. Third, given the syntactic structure of a source,...

chapter

Using Contextual Information to Decrease the Cost of Incorrect Predictions in On-line Customer Behavior Modeling

M. Gorgoglione, C. Palmisano, S. Lombardi

2008 IEEE International Conference on Data Mining Workshops > 780 - 788

2008 IEEE International Conference on Data Mining Workshops

The performance of user profiling models depends on both the predictive accuracy and the cost of incorrect predictions. In this paper we study whether including contextual information leads to a decrease in the misclassification cost. Several experimental analyses were done by varying the cost ratio, the market granularity and the granularity of context. The experimental results show that context...

chapter

Semantic Analysis Method for Unstructured Data in Telecom Services

M. Iwashita, K. Nishimatsu, S. Shimogawa

2008 IEEE International Conference on Data Mining Workshops > 789 - 795

2008 IEEE International Conference on Data Mining Workshops

A variety of services have recently been provided depending on highly developed networks and personal equipment. With these advances, connecting this equipment has become increasingly more complicated. Problems such as an increase in no-connection and determining the cause have become difficult in some cases because software is often updated to keep up with advancements in services or security. Telecom...

chapter

Using Contextual Information in Transactional Segmentation: An Empirical Study in E-Commerce

M.F. Faraone, M. Gorgoglione, C. Palmisano

2008 IEEE International Conference on Data Mining Workshops > 796 - 805

2008 IEEE International Conference on Data Mining Workshops

The growing complexity and variability characterizing markets have induced scholars and marketers to propose new segmentation approaches. Recent research has shown that including the context in which a transaction occurs in customer behavior models, improves the ability of predicting their behavior. However, no systematic research has studied whether contextual information really matters in market...

chapter

Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data

M. Deodhar, J. Ghosh

2008 IEEE International Conference on Data Mining Workshops > 806 - 815

2008 IEEE International Conference on Data Mining Workshops

Several marketing problems involve prediction of customer purchase behavior and forecasting future preferences. We consider predictive modeling of large scale, bi-modal or multimodal temporal marketing data, for instance, datasets consisting of customer spending behavior over time. Such datasets are characterized by variability in purchase patterns across different customer subgroups and shifting...

chapter

Title-Composing Support System for Reaching New Audiences

Y. Nishihara, W. Sunayama

2008 IEEE International Conference on Data Mining Workshops > 816 - 822

2008 IEEE International Conference on Data Mining Workshops

This paper proposes a support system for composing good titles for research papers in order to reach new audiences. Our system takes titles as input. The system evaluates title understandability and interest level of a title. The system ranks titles and outputs a title list. Users are able to recompose their titles by referring to the list and each evaluation value. Using the system, users can obtain...

chapter

Innovation Game as Workplace for Sensing Values in Design and Market

Y. Ohsawa, Y. Maeno, A. Takaichi, Y. Nishihara

2008 IEEE International Conference on Data Mining Workshops > 823 - 828

2008 IEEE International Conference on Data Mining Workshops

The "value" in this paper can be dealt with as a new variable which business workers create from their interaction with the dynamic environment, on which they redesign products and the market sustainably. Here we first show how data mining and data visualization can provide useful tools for aiding marketerspsila/designerspsila sensitivity of emerging values of consumers/users. By visualizing...

chapter

Character String Analysis and Customer Path in Stream Data

K. Yada

2008 IEEE International Conference on Data Mining Workshops > 829 - 836

2008 IEEE International Conference on Data Mining Workshops

This purpose of this study is to propose a knowledge-discovery system that can abstract helpful information from character strings representing shopper visits to product sections associated with positive and negative purchasing events by applying character string parsing technologies to stream data describing customer purchasing behavior inside a store. Taking data that traced customers' movements...

chapter

Combining Behavioral and Social Network Data for Online Advertising

A. Bagherjeiran, R. Parekh

2008 IEEE International Conference on Data Mining Workshops > 837 - 846

2008 IEEE International Conference on Data Mining Workshops

There are two main requirements for effective advertising in social networks. The first is that links in the social network are relevant to the targeted ads. The second is that social information can be easily incorporated with existing targeting methods to predict response rates. Our purpose in this paper is to investigate these requirements. We measure the relevance of a social network, the Yahoo!...

chapter

Semantic Concept Learning through Massive Internet Video Mining

Peijiang Yuan, Bo Zhang, Jianmin Li

2008 IEEE International Conference on Data Mining Workshops > 847 - 853

2008 IEEE International Conference on Data Mining Workshops

Semantic concept learning is one of the most challenging problems in video retrieval. The key barrier for semantic concept learning is lack of annotated training data. Internet videos are different from ordinary videos: massive, rich information, customized, non-uniform format, uneven quality, little descriptive text, only a few shots with limited length etc. Therefore, Internet is a potential repository...

chapter

Video^M: Multi-video Synopsis

Teng Li, Tao Mei, In-So Kweon, Xian-Sheng Hua

2008 IEEE International Conference on Data Mining Workshops > 854 - 861

2008 IEEE International Conference on Data Mining Workshops

Conventional video representation methods focus predominantly on a single video, aiming at reducing the space-time redundancy as much as possible, while this paper describes a novel approach to simultaneously presenting dynamics of multiple videos, aiming at a less intrusive viewing experience. Given a main video and multiple supplementary videos, the proposed approach automatically constructs a synthesized...

chapter

Human Action Recognition by Radon Transform

Yan Chen, Qiang Wu, Xiangjian He

2008 IEEE International Conference on Data Mining Workshops > 862 - 868

2008 IEEE International Conference on Data Mining Workshops

A new feature description is used for human behaviour representation and recognition. The feature is based on Radon transforms of extracted silhouettes. Key postures are selected based on the Radon transform. Key postures are combined to construct an action template for each sequence. Linear discriminant analysis (LDA) is applied to the set of key postures to obtain low dimensional feature vectors...

chapter

A New Method for Multi-view Face Clustering in Video Sequence

Panpan Huang, Yunhong Wang, Ming Shao

2008 IEEE International Conference on Data Mining Workshops > 869 - 873

2008 IEEE International Conference on Data Mining Workshops

In the problem of face clustering with multi-views, the similarity between faces of different persons with similar pose is usually greater than the similarity between multi-view faces of the same person. This may exert a tremendous impact on the clustering result that sent back to the user. To solve this problem, we should do pose clustering first and then within each dasiapose grouppsila, clustering...

Publication date

Set your own date range

Keywords

DATA MINING (86)
CLASSIFICATION ALGORITHMS (29)
DATABASES (23)
DATA MODELS (19)
LEARNING (ARTIFICIAL INTELLIGENCE) (19)
CLUSTERING ALGORITHMS (18)
TRAINING (18)
DISTANCE MEASUREMENT (17)
FEATURE EXTRACTION (16)
ACCURACY (15)
PATTERN CLUSTERING (15)
ALGORITHM DESIGN AND ANALYSIS (14)
CONFERENCES (14)
PATTERN CLASSIFICATION (14)
ASSOCIATION RULES (13)
QUERY PROCESSING (11)
INTERNET (10)
ITEMSETS (10)
INDEXES (8)
KNOWLEDGE DISCOVERY (8)
PREDICTIVE MODELS (8)
STATISTICAL ANALYSIS (8)
COMPUTATIONAL MODELING (7)
CORRELATION (7)
DATABASE MANAGEMENT SYSTEMS (7)
DECISION TREES (7)
ESTIMATION (7)
GRAPH THEORY (7)
KERNEL (7)
MATHEMATICAL MODEL (7)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (7)
TEXT ANALYSIS (7)
TRAINING DATA (7)
OPTIMIZATION (6)
RELIABILITY (6)
VISUAL DATABASES (6)
WEB SERVICES (6)
BIOLOGICAL SYSTEM MODELING (5)
CLASSIFICATION (5)
CLUSTERING (5)
FILTERING (5)
GRAPH MINING (5)
MACHINE LEARNING (5)
MARKETING (5)
MATRIX ALGEBRA (5)
MERGING (5)
ONTOLOGIES (5)
PROTEINS (5)
SPATIAL DATABASES (5)
APPROXIMATION METHODS (4)
BIOLOGY (4)
BUILDINGS (4)
BUSINESS (4)
CITIES AND TOWNS (4)
DATA ANALYSIS (4)
ENGINES (4)
EQUATIONS (4)
HIDDEN MARKOV MODELS (4)
HUMANS (4)
IMAGE CLASSIFICATION (4)
LABELING (4)
LEARNING SYSTEMS (4)
METEOROLOGY (4)
NOISE (4)
PEDIATRICS (4)
PROBABILITY (4)
PROPOSALS (4)
REDUNDANCY (4)
REGRESSION ANALYSIS (4)
REMOTE SENSING (4)
SET THEORY (4)
SOCIAL NETWORK SERVICES (4)
SOFTWARE (4)
SUPPORT VECTOR MACHINES (4)
TIME SERIES ANALYSIS (4)
WEB PAGES (4)
AGRICULTURE (3)
AMINO ACIDS (3)
ANALYTICAL MODELS (3)
ANOMALY DETECTION (3)
ATMOSPHERIC MEASUREMENTS (3)
BENCHMARK TESTING (3)
CLASSIFICATION TREE ANALYSIS (3)
COMPANIES (3)
COMPLEXITY THEORY (3)
COMPUTER SCIENCE (3)
CONSUMER BEHAVIOUR (3)
DATA HANDLING (3)
DATA VISUALISATION (3)
DATA VISUALIZATION (3)
DELAY (3)
DISTRIBUTED DATABASES (3)
EVOLUTION (BIOLOGY) (3)
GEOGRAPHIC INFORMATION SYSTEMS (3)
GRAPHICS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE SEQUENCES (3)
INFORMATION EXTRACTION (3)
IP NETWORKS (3)
KNOWLEDGE ENGINEERING (3)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

The Set Classification Problem and Solution Methods

If Constraint-Based Mining is the Answer: What is the Constraint? (Invited Talk)

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Ontology-Based Protein-Protein Interactions Extraction from Literature Using the Hidden Vector State Model

Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages

OntoDM: An Ontology of Data Mining

Semantic Annotation and Services for KDD Tools Sharing and Reuse

Exploiting Data Semantics to Discover, Extract, and Model Web Sources

Using Contextual Information to Decrease the Cost of Incorrect Predictions in On-line Customer Behavior Modeling

Semantic Analysis Method for Unstructured Data in Telecom Services

Using Contextual Information in Transactional Segmentation: An Empirical Study in E-Commerce

Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data

Title-Composing Support System for Reaching New Audiences

Innovation Game as Workplace for Sensing Values in Design and Market

Character String Analysis and Customer Path in Stream Data

Combining Behavioral and Social Network Data for Online Advertising

Semantic Concept Learning through Massive Internet Video Mining

Video^M: Multi-video Synopsis

Human Action Recognition by Radon Transform

A New Method for Multi-view Face Clustering in Video Sequence

Filter options

Publication date

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops