19th International Conference on Scientific and Statistical Database Management (SSDBM 2007)

High energy physics scientists analyze large amounts of data looking for interesting events when particles collide. These analyses are easily expressed using complex queries that filter events. We developed a cost model for aggregation operators and other functions used in such queries and show that it substantially improves performance. However, the query optimizer still produces suboptimal plans...

chapter

What Constitutes a Scientific Database?

J.L. Pfaltz

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 2

2007 International Conference on Scientific and Statistical Database Management

We propose that a scientific database should be inherently different from, say a business database. The difference is based on the nature of science itself, in which hypotheses, or logical implications, form an essential part of the discipline. Empirical observations give rise to tentative hypotheses. Individual hypotheses are then tested, refuted or refined, by further empirical observation. In the...

chapter

Efficient Evaluation of Inbreeding Queries on Pedigree Data

B. Elliott, S.F. Akgul, S. Mayes, Z.M. Ozsoyoglu

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 3

2007 International Conference on Scientific and Statistical Database Management

We consider pedigree data structured in the form of a directed acyclic graph, and use an encoding scheme, called NodeCodes, for expediting the evaluation of queries on pedigree graph structures. Inbreeding is the quantitative measure of the genetic relationship between two individuals. The inbreeding coefficient is related to the probability that both copies of any given gene are received from the...

chapter

Managing Scientific Data: New Challenges for Database Research

Marianne Winslett

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 4

2007 International Conference on Scientific and Statistical Database Management

The database research community's appetite for new applications has led to increased interest in the data management needs of scientists. This area encompasses a huge range of applications, extending from public repositories of observational data such as the popular Sloan Digital Sky Survey to one-of-a-kind runs of simulation codes crafted by individual scientists. In this talk, we will survey the...

chapter

Maintaining K-Anonymity against Incremental Updates

Jian Pei, Jian Xu, Zhibin Wang, Wei Wang, more

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 5

2007 International Conference on Scientific and Statistical Database Management

K-anonymity is a simple yet practical mechanismto protect privacy against attacks of re-identifying individuals by joining multiple public data sources. All existing methods achieving k-anonymity assume implicitly that the data objects to be anonymized are given once and fixed. However, in many applications, the real world data sources are dynamic. In this paper, we investigate the problem of maintaining...

chapter

MAMCost: Global and Local Estimates leading to Robust Cost Estimation of Similarity Queries

G.B. Baioco, A.J.M. Traina, C. Traina

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 6

2007 International Conference on Scientific and Statistical Database Management

This paper presents an effective cost model to estimate the number of disk accesses (I/O cost) and the number of distance calculations (CPU cost) to process similarity queries over data indexed by metric access methods. Two types of similarity queries were taken into consideration: range and k-nearest neighbor queries. The main point of the cost model is considering not only global parameters of the...

chapter

On Exploring Complex Relationships of Correlation Clusters

E. Achtert, C. Bohm, H.-P. Kriegel, P. Kroger, more

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 7

2007 International Conference on Scientific and Statistical Database Management

In high dimensional data, clusters often only exist in arbitrarily oriented subspaces of the feature space. In addition, these so-called correlation clusters may have complex relationships between each other. For example, a correlation cluster in a 1-D subspace (forming a line) may be enclosed within one or even several correlation clusters in 2-D superspaces (forming planes). In general, such relationships...

chapter

Component-based Data Layout for Efficient Slicing of Very Large Multidimensional Volumetric Data

Jusub Kim, J. JaJa

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 8

2007 International Conference on Scientific and Statistical Database Management

In this paper, we introduce a new efficient data layout scheme to efficiently handle out-of-core axis-aligned slicing queries of very large multidimensional volumetric data. Slicing is a very useful dimension reduction tool that removes or reduces occlusion problems in visualizing 3D/4D volumetric data sets and that enables fast visual exploration of such data sets. We show that the data layouts based...

chapter

Information-Aware 2^n-Tree for Efficient Out-of-Core Indexing of Very Large Multidimensional Volumetric Data

Jusub Kim, J. JaJa

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 9

2007 International Conference on Scientific and Statistical Database Management

We discuss a new efficient out-of-core multidimensional indexing structure, information-aware 2ⁿ-tree, for indexing very large multidimensional volumetric data. Building a series of (n-1)-Dimensional indexing structures on n-Dimensional data causes a scalability problem in the situation of continually growing resolution in every dimension. However, building a single n-Dimensional indexing structure...

chapter

Boosting k-Nearest Neighbor Queries Estimating Suitable Query Radii

M.R. Vieira, C. Traina, A.J.M. Traina, A. Arantes, more

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 10

2007 International Conference on Scientific and Statistical Database Management

This paper proposes novel and effective techniques to estimate a radius to answer k-nearest neighbor queries. The first technique targets datasets where it is possible to learn the distribution about the pairwise distances between the elements, generating a global estimation that applies to the whole dataset. The second technique targets datasets where the first technique cannot be employed, generating...

chapter

Efficient Approximation of Spatial Network Queries using the M-Tree with Road Network Embedding

K. Shaw, E. Ioup, J. Sample, M. Abdelguerfi, more

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 11

2007 International Conference on Scientific and Statistical Database Management

Spatial networks, such as road systems, operate differently from normal geospatial systems because objects are constrained to locations on the network. Performing queries on spatial networks demands entirely different solutions. Most spatial queries make use of an R-Tree to process them efficiently. The M-Tree is a data tree index which is capable of indexing data in any metric space. The M-Tree index...

chapter

On Efficient Processing of Subspace Skyline Queries on High Dimensional Data

Wen Jin, A.K.H. Tung, M. Ester, Jiawei Han

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 12

2007 International Conference on Scientific and Statistical Database Management

Recent studies on efficiently answering subspace skyline queries can be separated into two approaches. The first focused on pre-materializing a set of skylines points in various subspaces while the second focus on dynamically answering the queries by using a set of anchors to prune off skyline points through spatial reasoning. Despite effort to compress the pre-materialized subspace skylines through...

chapter

MonetDB/SQL Meets SkyServer: the Challenges of a Scientific Database

M. Ivanova, N. Nes, R. Goncalves, M. Kersten

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) > 13

2007 International Conference on Scientific and Statistical Database Management

This paper presents our experiences in porting the Sloan Digital Sky Survey(SDSS)/ SkyServer to the state-of- the-art open source database system MonetDB/SQL. SDSS acts as a well-documented benchmark for scientific database management. We have achieved a fully functional prototype for the personal SkyServer, to be downloaded from our site. The lessons learned are 1) the column store approach of MonetDB...

Publication date

Set your own date range

Keywords

QUERY PROCESSING (18)
DATABASE INDEXING (8)
DATA MINING (5)
DATABASE MANAGEMENT SYSTEMS (5)
TREE DATA STRUCTURES (5)
WIRELESS SENSOR NETWORKS (5)
DATA ANALYSIS (4)
BIOLOGY COMPUTING (3)
GRAPH THEORY (3)
VISUAL DATABASES (3)
DATA HANDLING (2)
GENETICS (2)
GEOGRAPHIC INFORMATION SYSTEMS (2)
MEDICAL COMPUTING (2)
RELATIONAL DATABASES (2)
SCIENTIFIC DATABASE (2)
SCIENTIFIC INFORMATION SYSTEMS (2)
SENSOR NETWORKS (2)
SQL (2)
VERY LARGE DATABASES (2)
VERY LARGE MULTIDIMENSIONAL VOLUMETRIC DATA (2)
2D SUPERSPACES (1)
ABSTRACT DATA TYPES (1)
ABSTRACT TREE STRUCTURE (1)
ACQUISITIONAL PROTOCOLS (1)
ADAPTIVE MULTIRESERVOIR SAMPLING ALGORITHM (1)
ADAPTIVE SIGNAL PROCESSING (1)
ADAPTIVE WAVELET DENSITY ESTIMATORS (1)
ADAPTIVE-SIZE RESERVOIR SAMPLING (1)
AGGREGATE MONITORING (1)
AGGREGATE QUERY (1)
AGGREGATION OPERATORS (1)
APPROXIMATE QUANTILES (1)
APPROXIMATE QUERY EVALUATION (1)
ARBITRARY-SIZE STREAMS (1)
ARCHIVED HISTORICAL STREAM DATA (1)
AUTOMATED METABOLIC PATHWAY CATEGORIZATION (1)
BIOCHEMISTRY (1)
BIOLOGICAL DATA SOURCES (1)
BIOLOGICAL PATHWAYS (1)
BITMAP BASED TECHNIQUE (1)
BITMAP INDEX (1)
BITMAP INDEXING (1)
BLOCK-WISE MERGE (1)
BOOSTING K-NEAREST NEIGHBOR QUERIES (1)
BUFFER ALLOCATION (1)
CACHE MEMORY SIZE (1)
CACHE STORAGE (1)
CACHE-CONSCIOUS INDEXING (1)
CELLULAR BIOPHYSICS (1)
CELLULAR MECHANISMS (1)
CLUSTER HIERARCHY VISUALIZATION (1)
COLLISION-FREE COMMUNICATION (1)
COMPLEX QUERIES (1)
COMPLEX SCIENTIFIC QUERIES (1)
COMPONENT-BASED DATA LAYOUT (1)
COMPRESSED INFORMATION (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTATIONAL COST (1)
COMPUTERISED MONITORING (1)
CONTINUOUS EVALUATION (1)
CONTINUOUS K-NEAREST-NEIGHBOR QUERY (1)
CORRELATION CLUSTER COMPLEX RELATIONSHIPS (1)
CORRELATION FRACTAL DIMENSION (1)
COST-BASED OPTIMIZATION (1)
COST-BASED OPTIMIZATION FRAMEWORK (1)
CSR+-TREE (1)
DATA ACQUISITION (1)
DATA AGGREGATION (1)
DATA COLLECTION (1)
DATA COMPRESSION (1)
DATA DISTRIBUTION (1)
DATA INCONSISTENCY (1)
DATA INTEGRITY (1)
DATA MINING TASK (1)
DATA MONITORING (1)
DATA PRIVACY (1)
DATA REDUNDANCY (1)
DATA REPRESENTATION (1)
DATA RESILIENT ALGORITHM (1)
DATA SET (1)
DATA SLICING (1)
DATA STREAM (1)
DATA STRUCTURES (1)
DATA VISUALISATION (1)
DATA-DRIVEN MEMORY MANAGEMENT SCHEME (1)
DATABASE MANAGEMENT SYSTEM (1)
DATASET TRACKING (1)
DBMS OPTIMIZATION (1)
DEEP DATA INTEGRATION (1)
DETERMINISTIC ERROR BOUNDS (1)
DETERMINISTIC SCHEDULES (1)
DIRECTED ACYCLIC GRAPH (1)
DIRECTED GRAPHS (1)
DISEASE DIAGNOSIS (1)
DISK ACCESSES NUMBER (1)
DISTANCE CALCULATIONS NUMBER (1)
DISTANCE COMPUTATION SCHEME (1)
DISTRIBUTED ALGORITHM (1)
DISTRIBUTED DATABASES (1)
more

INFONA - science communication portal

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007)

19th International Conference on Scientific and Statistical Database Management-Cover

19th International Conference on Scientific and Statistical Database Management-Title

19th International Conference on Scientific and Statistical Database Management-Copyright

19th International Conference on Scientific and Statistical Database Management-TOC

Foreword by the General Chair

Foreword by the Program Chair

Committees

Cost-based Optimization of Complex Scientific Queries

What Constitutes a Scientific Database?

Efficient Evaluation of Inbreeding Queries on Pedigree Data

Managing Scientific Data: New Challenges for Database Research

Maintaining K-Anonymity against Incremental Updates

MAMCost: Global and Local Estimates leading to Robust Cost Estimation of Similarity Queries

On Exploring Complex Relationships of Correlation Clusters

Component-based Data Layout for Efficient Slicing of Very Large Multidimensional Volumetric Data

Information-Aware 2^n-Tree for Efficient Out-of-Core Indexing of Very Large Multidimensional Volumetric Data

Boosting k-Nearest Neighbor Queries Estimating Suitable Query Radii

Efficient Approximation of Spatial Network Queries using the M-Tree with Road Network Embedding

On Efficient Processing of Subspace Skyline Queries on High Dimensional Data

MonetDB/SQL Meets SkyServer: the Challenges of a Scientific Database

Filter options

Publication date

Keywords

INFONA - science communication portal

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

19th International Conference on Scientific and Statistical Database Management (SSDBM 2007)