2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

chapter

Querying Communities in Relational Databases

Lu Qin, J.X. Yu, Lijun Chang, Yufei Tao

2009 IEEE 25th International Conference on Data Engineering > 724 - 735

Keyword search on relational databases provides users with insights that they can not easily observe using the traditional RDBMS techniques. Here, an l-keyword query is specified by a set of l keywords, {k₁, k₂, middot middot middot , k_l}. It finds how the tuples that contain the keywords are connected in a relational database via the possible foreign key references. Conceptually, it is to find some...

chapter

Integrating and Ranking Uncertain Scientific Data

L. Detwiler, W. Gatterbauer, B. Louie, Dan Suciu, more

2009 IEEE 25th International Conference on Data Engineering > 1235 - 1238

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Mediator-based data integration systems resolve exploratory queries by joining data elements across sources. In the presence of uncertainties, such multiple expansions can quickly lead to spurious connections and incorrect results. The BioRank project investigates formalisms for modeling uncertainty during scientific data integration and for ranking uncertain query results. Our motivating application...

chapter

On the Efficiency of Provenance Queries

A. Kementsietsidis, Min Wang

2009 IEEE 25th International Conference on Data Engineering > 1223 - 1226

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

While models for data provenance have been extensively studied in the literature, the efficient evaluation of the resulting provenance queries remains an open problem. Traditional query optimization techniques, like the use of general-purpose indexes, or the materialization of provenance data, fail on different fronts to address the problem. Provenance-specific optimization techniques, like the use...

chapter

Leveraging COUNT Information in Sampling Hidden Databases

A. Dasgupta, Nan Zhang, G. Das

2009 IEEE 25th International Conference on Data Engineering > 329 - 340

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

A large number of online databases are hidden behind form-like interfaces which allow users to execute search queries by specifying selection conditions in the interface. Most of these interfaces return restricted answers (e.g., only top-k of the selected tuples), while many of them also accompany each answer with the COUNT of the selected tuples. In this paper, we propose techniques which leverage...

chapter

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks

G. Cormode, Feifei Li, Ke Yi

2009 IEEE 25th International Conference on Data Engineering > 305 - 316

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

When dealing with massive quantities of data, top-k queries are a powerful technique for returning only the k most relevant tuples for inspection, based on a scoring function. The problem of efficiently answering such ranking queries has been studied and analyzed extensively within traditional database settings. The importance of the top-k is perhaps even greater in probabilistic databases, where...

chapter

Efficient Indices Using Graph Partitioning in RDF Triple Stores

Ying Yan, Chen Wang, Aoying Zhou, Weining Qian, more

2009 IEEE 25th International Conference on Data Engineering > 1263 - 1266

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

With the advance of the semantic Web, varying RDF data were increasingly generated, published, queried, and reused via the Web. For example, the DBpedia, a community effort to extract structured data from Wikipedia articles, broke 100 million RDF triples in its latest release. Initiated by Tim Berners-Lee,likewise, the Linking Open Data (LOD) project has published and interlinked many open licence...

chapter

Extracting Web Query Interfaces Based on Form Structures and Semantic Similarity

Jun Hong, Zhongtian He, D.A. Bell

2009 IEEE 25th International Conference on Data Engineering > 1259 - 1262

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Web databases are now pervasive. Such a database can be accessed via its query interface (usually HTML query form) only. Extracting Web query interfaces is a critical step in data integration across multiple Web databases, which creates a formal representation of a query form by extracting a set of query conditions in it. This paper presents a novel approach to extracting Web query interfaces. In...

chapter

BioNav: Effective Navigation on Query Results of Biomedical Databases

A. Kashyap, V. Hristidis, M. Petropoulos, S. Tavoulari

2009 IEEE 25th International Conference on Data Engineering > 1287 - 1290

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Search queries on biomedical databases like PubMed often return a large number of results, only a small subset of which is relevant to the user. Ranking and categorization, which can also be combined, have been proposed to alleviate this information overload problem. Results categorization for biomedical databases is the focus of this work. A natural way to organize biomedical citations is according...

chapter

Supporting Database Applications as a Service

Mei Hui, Dawei Jiang, Guoliang Li, Yuan Zhou

2009 IEEE 25th International Conference on Data Engineering > 832 - 843

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Multi-tenant data management is a form of software as a service (SaaS), whereby a third party service provider hosts databases as a service and provides its customers with seamless mechanisms to create, store and access their databases at the host site. One of the main problems in such a system, as we shall discuss in this paper, is scalability, namely the ability to serve an increasing number of...

chapter

GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases

S. Ranu, A.K. Singh

2009 IEEE 25th International Conference on Data Engineering > 844 - 855

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Graphs are being increasingly used to model a wide range of scientific data. Such widespread usage of graphs has generated considerable interest in mining patterns from graph databases. While an array of techniques exists to mine frequent patterns, we still lack a scalable approach to mine statistically significant patterns, specifically patterns with low p-values, that occur at low frequencies. We...

chapter

Incremental Reverse Nearest Neighbor Ranking

H.-P. Kriegel, P. Kroger, M. Renz, A. Zufle, more

2009 IEEE 25th International Conference on Data Engineering > 1560 - 1567

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

In this paper, we formalize the novel concept of incremental reverse nearest neighbor ranking and suggest an original solution for this problem. We propose an efficient approach for reporting the results incrementally without the need to restart the search from scratch. Our approach can be applied to a multi-dimensional feature database which is hierarchically organized by any R-tree like index structure...

chapter

Galaxy: Encouraging Data Sharing among Sources with Schema Variants

P. Mork, L. Seligman, A. Rosenthal, M. Morse, more

2009 IEEE 25th International Conference on Data Engineering > 1551 - 1554

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

This demonstration presents Galaxy, a schema manager that facilitates easy and correct data sharing among autonomous but related, evolving data sources. Galaxy reduces heterogeneity by helping database developers identify, reuse, customize, and advertise related schema components. The central idea is that as schemata are customized, Galaxy maintains a derivation graph, and exploits it for data exchange,...

chapter

Efficient and Robust Database Support for Data-Intensive Applications in Dynamic Environments

J.O. Hauglid, K. Norvag, N.H. Ryeng

2009 IEEE 25th International Conference on Data Engineering > 1547 - 1550

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Requirements from new types of applications call for new database system solutions. Computational science applications performing distributed computations on grid networks with requirements for efficient storage and query solutions are now emerging. For this purpose we have developed DASCOSA-DB, a P2P-based distributed database system, which in addition to providing location-transparent storage and...

chapter

XBLEND: Visual XML Query Formulation Meets Query Processing

Zhou Yong, S.S. Bhowmick, E. Leonardi, K.G. Widjanarko

2009 IEEE 25th International Conference on Data Engineering > 1535 - 1538

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Due to the complexity of XML query languages, the need for visual query interfaces that can reduce the burden of query formulation is fundamental to the spreading of XML to wider community. We present a RDBMS-based XML query evaluation system, called XBLEND, that takes a novel and non-traditional approach to improving query performance by blending visual query formulation and query processing. It...

chapter

Weighted Set Similarity: Queries and Updates

D. Srivastava

2009 IEEE 25th International Conference on Data Engineering > 1559

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Summary form only given. Consider a universe of items, each of which is associated with a weight, and a database consisting of subsets of these items. Given a query set, a weighted set similarity query identifies either (i) all sets in the database whose normalized similarity to the query set is above a pre-specified threshold, or (ii) the sets in the database with the k highest similarity values...

chapter

Chameleon: Context-Awareness inside DBMSs

H.G. Elmongui, W.G. Aref, M.F. Mokbel

2009 IEEE 25th International Conference on Data Engineering > 1335 - 1338

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Context is any information used to characterize the situation of an entity. Examples of contexts include time, location, identity, and activity of a user. This paper proposes a general context-aware DBMS, named Chameleon, that will eliminate the need for having specialized database engines, e.g., spatial DBMS, temporal DBMS, and Hippocratic DBMS, since space, time, and identity can be treated as contexts...

chapter

Efficient Processing of Warping Time Series Join of Motion Capture Data

Yueguo Chen, Gang Chen, Ke Chen, Beng Chin Ooi

2009 IEEE 25th International Conference on Data Engineering > 1048 - 1059

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Discovering non-trivial matching subsequences from two time series is very useful in synthesizing novel time series. This can be applied to applications such as motion synthesis where smooth and natural motion sequences are often required to be generated from existing motion sequences. We first address this problem by defining it as a problem of l-epsiv-join over two time series. Given two time series,...

chapter

An Integrated Approach to Performance Monitoring for Autonomous Tuning

A. Thiem, K.-U. Sattler

2009 IEEE 25th International Conference on Data Engineering > 1671 - 1678

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

With an ever growing complexity and data volume, the administration of today's relational database management systems has become one of the most important cost factors in their operation. Dynamic workloads and shifting demands require continuous effort from the DBA to deliver adequate performance. The goal of a modern DBMS must be to support the DBA's work with automated processes and workflows that...

chapter

Holistically Twig Matching in Probabilistic XML

Yawen Li, Guoren Wang, Junchang Xin, Ende Zhang, more

2009 IEEE 25th International Conference on Data Engineering > 1649 - 1656

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Traditional databases manage only deterministic information, but now many applications that use databases involve uncertain data. For example, it is infeasible for a sensor database to contain only the exact value of each sensor at all points in time. The uncertainty is inherent in these systems due to measurement and sampling errors, and resource limitations. This paper aims at the query processing...

chapter

Parallel Skyline Computation on Multicore Architectures

Sungwoo Park, Taekyung Kim, Jonghyun Park, Jinha Kim, more

2009 IEEE 25th International Conference on Data Engineering > 760 - 771

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

With the advent of multicore processors,it has become imperative to write parallel programs if one wishes to exploit the next generation of processors. This paper deals with skyline computation as a case study of parallelizing database operations on multicore architectures. We compare two parallel skyline algorithms: a parallel version of the branch-and-bound algorithm (BBS) and a new parallel algorithm...

INFONA - science communication portal

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Querying Communities in Relational Databases

Integrating and Ranking Uncertain Scientific Data

On the Efficiency of Provenance Queries

Leveraging COUNT Information in Sampling Hidden Databases

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks

Efficient Indices Using Graph Partitioning in RDF Triple Stores

Extracting Web Query Interfaces Based on Form Structures and Semantic Similarity

BioNav: Effective Navigation on Query Results of Biomedical Databases

Supporting Database Applications as a Service

GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases

Incremental Reverse Nearest Neighbor Ranking

Galaxy: Encouraging Data Sharing among Sources with Schema Variants

Efficient and Robust Database Support for Data-Intensive Applications in Dynamic Environments

XBLEND: Visual XML Query Formulation Meets Query Processing

Weighted Set Similarity: Queries and Updates

Chameleon: Context-Awareness inside DBMSs

Efficient Processing of Warping Time Series Join of Motion Capture Data

An Integrated Approach to Performance Monitoring for Autonomous Tuning

Holistically Twig Matching in Probabilistic XML

Parallel Skyline Computation on Multicore Architectures

Filter options

Publication date

Keywords

INFONA - science communication portal

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009 $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009