Data Warehousing and Knowledge Discovery
11th International Conference, DaWaK 2009 Linz, Austria, August 31–September 2, 2009 Proceedings

Running analytics computation inside database engines through the use of UDFs (User Defined Functions) has been extensively investigated, but not yet become a scalable approach due to two major limitations. One limitation lies in that the existent UDFs are not relation-in, relation-out and schema-aware, unable to model complex applications, and cannot be composed with relational operators in a SQL...

chapter

High Performance Analytics with the R³-Cache

Todd Eavis, Ruhan Sayeed

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Analytics > 271-286

Contemporary data warehouses now represent some of the world’s largest databases. As these systems grow in size and complexity, however, it becomes increasingly difficult for brute force query processing approaches to meet the performance demands of end users. Certainly, improved indexing and more selective view materialization are helpful in this regard. Nevertheless, with warehouses moving into...

chapter

Open Source BI Platforms: A Functional and Architectural Comparison

Matteo Golfarelli

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Analytics > 287-297

While in the past the BI market was strictly dominated by closed source and commercial tools, the last few years were characterized by the birth of open source solutions: first as single BI tools, and later as complete BI platforms. An Open Source BI platform provides a full spectrum of BI capabilities within a unified system that reduces the overhead for the development and management of each application,...

chapter

Ontology-Based Exchange and Immediate Application of Business Calculation Definitions for Online Analytical Processing

Matthias Kehlenbeck, Michael H. Breitner

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Analytics > 298-311

Business users define calculated facts based on the dimensions and facts contained in a data warehouse. These business calculation definitions contain necessary knowledge regarding quantitative relations for deep analyses and for the production of meaningful reports. The business calculation definitions are implementation and widely organization independent. But no automated procedures facilitating...

chapter

Dynamic Clustering-Based Estimation of Missing Values in Mixed Type Data

Vadim V. Ayuyev, Joseph Jupin, Philip W. Harris, Zoran Obradovic

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Clustering > 366-377

The appropriate choice of a method for imputation of missing data becomes especially important when the fraction of missing values is large and the data are of mixed type. The proposed dynamic clustering imputation (DCI) algorithm relies on similarity information from shared neighbors, where mixed type variables are considered together. When evaluated on a public social science dataset of 46,043 mixed...

chapter

The PDG-Mixture Model for Clustering

M. Julia Flores, José A. Gámez, Jens D. Nielsen

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Clustering > 378-389

Within data mining, clustering can be considered the most important unsupervised learning problem which deals with finding a structure in a collection of unlabeled data. Generally, clustering refers to the process of organizing objects into groups whose members are similar. Among clustering approaches, those methods based on probabilistic models have been extensively developed, such as Naïve Bayes...

chapter

Clustering for Video Retrieval

Petr Chmelar, Ivana Rudolfova, Jaroslav Zendulka

Lecture Notes in Computer Science > Data Warehousing and Knowledge Discovery > Clustering > 390-401

The paper deals with an application of clustering we used as one of data reduction methods included in processing huge amount of video data provided for TRECVid evaluations. The problem we solved by means of clustering was to partition the local feature descriptors space so that thousands of partitions represent visual words, which may be effectively employed in video retrieval using classical information...

Series:
Lecture Notes in Computer Science

Publication date

Set your own date range

Content availability

Available (37)
None (26)

INFONA - science communication portal

Data Warehousing and Knowledge Discovery
11th International Conference, DaWaK 2009 Linz, Austria, August 31–September 2, 2009 Proceedings

Data Warehouse Modeling

Data Mining

Physical Design

Invited Talk

Data Mining Applications

Pattern Mining

Spatio-Temporal Mining

Data Streams

Clustering

Rule Mining

Analytics

Data Cubes

Olap Recommendation

Extend UDF Technology for Integrated Analytics

High Performance Analytics with the R³-Cache

Open Source BI Platforms: A Functional and Architectural Comparison

Ontology-Based Exchange and Immediate Application of Business Calculation Definitions for Online Analytical Processing

Dynamic Clustering-Based Estimation of Missing Values in Mixed Type Data

The PDG-Mixture Model for Clustering

Clustering for Video Retrieval

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Data Warehousing and Knowledge Discovery 11th International Conference, DaWaK 2009 Linz, Austria, August 31–September 2, 2009 Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Data Warehousing and Knowledge Discovery
11th International Conference, DaWaK 2009 Linz, Austria, August 31–September 2, 2009 Proceedings