2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

Items from 1 to 20 out of 37 results

chapter

EGNAT: A Fully Dynamic Metric Access Method for Secondary Memory

R.U. Paredes, G. Navarro

2009 Second International Workshop on Similarity Search and Applications > 57 - 64

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We introduce a novel metric space search data structure called EGNAT, which is fully dynamic and designed for secondary memory. The EGNAT is based on Brin's GNAT static index, and partitions the space according to hyperplanes. The EGNAT implements deletions using a novel technique dubbed Ghost Hyperplanes, which is of independent interest for other metric space indexes. We show experimentally that...

chapter

Metric Index: An Efficient and Scalable Solution for Similarity Search

D. Novak, M. Batko

2009 Second International Workshop on Similarity Search and Applications > 65 - 73

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

Metric space as a universal and versatile model of similarity can be applied in various areas of non-text information retrieval. However, a general, efficient and scalable solution for metric data management is still a resisting research challenge. We introduce a novel indexing and searching mechanism called metric index (M-Index), that employs practically all known principles of metric space partitioning,...

chapter

Author Index

2009 Second International Workshop on Similarity Search and Applications > 167

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

chapter

Using Tuneable Fuzzy Similarity in Non-metric Search

P. Vojtas, A. Eckhardt

2009 Second International Workshop on Similarity Search and Applications > 163 - 164

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We propose an alternate method for indexing data for answering queries in non-metric spaces. The traditional use of distance and triangle inequality is substituted with the use of fuzzy similarity fulfilling the transitivity property with a tuneable fuzzy conjunctor. In a non-metric space it is still possible that there is a fuzzy conjunctor such that transitivity holds and usual indexing techniques...

chapter

Title Page i

2009 Second International Workshop on Similarity Search and Applications > i

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

The following topics are dealt with: similarity query search; data analysis; query-evaluation algorithm; distributed and parallel index structure and image retrieval.

chapter

Curse of Dimensionality in Pivot Based Indexes

I. Volnyansky, V. Pestov

2009 Second International Workshop on Similarity Search and Applications > 39 - 46

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We offer a theoretical validation of the curse of dimensionality in the pivot based indexing of datasets for similarity search, by proving, in the framework of statistical learning, that in high dimensions no pivot based indexing scheme can essentially outperform the linear scan. A study of the asymptotic performance of pivot based indexing schemes is performed on a sequence of datasets modeled as...

chapter

Dynamic Spatial Approximation Trees for Massive Data

G. Navarro, N. Reyes

2009 Second International Workshop on Similarity Search and Applications > 81 - 88

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

Metric space searching is an emerging technique to address the problem of efficient similarity searching in many applications, including multimedia databases and other repositories handling complex objects. Although promising, the metric space approach is still immature in several aspects that are well established in traditional databases. In particular, most indexing schemes are not dynamic, that...

chapter

Organizing and Program Committees

2009 Second International Workshop on Similarity Search and Applications > ix

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

chapter

Cover Art

2009 Second International Workshop on Similarity Search and Applications > C1

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

chapter

2009 Second International Workshop on Similarity Search and Applications > v - vi

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

chapter

External Reviewers

2009 Second International Workshop on Similarity Search and Applications > x

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

chapter

Principles of Information Filtering in Metric Spaces

P. Ciaccia, M. Patella

2009 Second International Workshop on Similarity Search and Applications > 99 - 106

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

The traditional problem of similarity search requires to find, within a set of points, those that are closer to a query point q, according to a distance function d. In this paper we introduce the novel problem of metric filtering: in this scenario, each data point x_i possesses its own distance function d_i and the task is to find those points that are close enough, according to d_i, to a query point...

chapter

Combinatorial Framework for Similarity Search

Y. Lifshits

2009 Second International Workshop on Similarity Search and Applications > 11 - 17

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We present an overview of combinatorial framework for similarity search. An algorithm is combinatorial if only direct comparisons between two pairwise similarity values are allowed. Namely, the input dataset is represented by a comparison oracle that given any three points X,Y,Z answers whether Y or Z is closer to X. We assume that the similarity order of the dataset satisfies the four variations...

chapter

Optimal Pivots to Minimize the Index Size for Metric Access Methods

L.G. Ares, N.R. Brisaboa, M.F. Esteller, O. Pedreira, more

2009 Second International Workshop on Similarity Search and Applications > 74 - 80

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We consider the problem of similarity search in metric spaces with costly distance functions and large databases. There is a trade-off between the amount of information stored in the index and the reduction in the number of comparisons for solving a query. Pivot-based methods clearly outperform clustering-based ones in number of comparisons, but their space requirements are higher and this can prevent...

chapter

Efficient Similarity Search by Reducing I/O with Compressed Sketches

A.J. Muller-Molina, T. Shinohara

2009 Second International Workshop on Similarity Search and Applications > 30 - 38

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

Sketches are compact bit string representations of objects. Objects that have the same sketch are stored in the same database bucket. By calculating the Hamming distance of the sketches, an estimation of the similarity of their respective objects can be obtained. Objects that are close to each other are expected to have sketches with small hamming distance values. This estimation helps to schedule...

chapter

Structural Entropic Difference: A Bounded Distance Metric for Unordered Trees

R. Connor, F. Simeoni, M. Iakovos

2009 Second International Workshop on Similarity Search and Applications > 21 - 29

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

We show a new metric for comparing unordered, tree-structured data. While such data is increasingly important in its own right, the methodology underlying the construction of the metric is generic and may be reused for other classes of ordered and partially ordered data. The metric is based on the information content of the two values under consideration, which is measured using Shannon's entropy...

chapter

CoPhIR Image Collection under the Microscope

M. Batko, P. Kohoutkova, D. Novak

2009 Second International Workshop on Similarity Search and Applications > 47 - 54

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

The content-based photo image retrieval (CoPhIR) dataset is the largest available database of digital images with corresponding visual descriptors. It contains five MPEG-7 global descriptors extracted from more than 106 million images from Flickr photo-sharing system. In this paper, we analyze this dataset focusing on 1) efficiency of similarity-based indexing and searching and on 2) expressiveness...

chapter

Speeding Up Permutation Based Indexing with Indexing

K. Figueroa, K. Frediksson

2009 Second International Workshop on Similarity Search and Applications > 107 - 114

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

A recent probabilistic approach for searching in high dimensional metric spaces is based on predicting the distances between database elements according to how they order their distances towards some set of distinguished elements, called permutants. In the preprocessing phase a set of permutants is chosen, and are sorted (permuted) by their distances against every database element. The permutations...

chapter

MUFIN: A Multi-feature Indexing Network

M. Batko, V. Dohnal, D. Novak, J. Sedmidubsky

2009 Second International Workshop on Similarity Search and Applications > 158 - 159

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

By exploiting and extending the research achievements in metric searching technology over the last ten years, MUFIN proves the expected extensibility and scalability properties of this technology. The scalability has been demonstrated by an interactive image retrieval system over 280-dimensional vectors, which is one order of magnitude higher than what most of the literature considers to be the dimensionality...

chapter

Title Page iii

2009 Second International Workshop on Similarity Search and Applications > iii

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

Publication date

Set your own date range

Content availability

Available (36)
None (1)

Keywords

DATA MINING (23)
SEARCH PROBLEMS (15)
EXTRATERRESTRIAL MEASUREMENTS (12)
INDEXING (11)
SIMILARITY SEARCH (11)
INDEXES (10)
IMAGE RETRIEVAL (8)
QUERY PROCESSING (8)
CONTENT-BASED RETRIEVAL (7)
DATA STRUCTURES (6)
DATABASE INDEXING (6)
IMAGE COLOR ANALYSIS (5)
PROBABILITY DENSITY FUNCTION (5)
COMPLEXITY THEORY (4)
METRIC SPACE (4)
ROUTING (4)
SEARCH ENGINES (4)
VISUALIZATION (4)
APPROXIMATION METHODS (3)
CONSTRUCTION INDUSTRY (3)
DATA STRUCTURE (3)
FEATURE EXTRACTION (3)
HISTOGRAMS (3)
METRIC SPACE SEARCHING (3)
NEAREST NEIGHBOR SEARCHES (3)
TRANSFORM CODING (3)
TREE DATA STRUCTURES (3)
$K$ NEAREST NEIGHBOR (2)
ALGORITHM DESIGN AND ANALYSIS (2)
COMPUTATIONAL COMPLEXITY (2)
CONTENT-BASED SEARCH (2)
DISTANCE FUNCTION (2)
GRAPH THEORY (2)
INFORMATION FILTERING (2)
JAVA (2)
METRIC SPACES (2)
PATTERN CLUSTERING (2)
PEER TO PEER COMPUTING (2)
PEER-TO-PEER COMPUTING (2)
SCALABILITY (2)
SECONDARY MEMORY (2)
SERVERS (2)
STATISTICAL ANALYSIS (2)
STORAGE MANAGEMENT (2)
TRIANGLE INEQUALITY (2)
VERY LARGE DATABASES (2)
VISUAL DATABASES (2)
ADAPTIVE QUERY-ROUTING ALGORITHM (1)
ALGORITHM THEORY (1)
AMINO ACIDS (1)
ANSWERING QUERIES (1)
APPROXIMATE L_1/L₂ DISTANCE (1)
APPROXIMATION (1)
APPROXIMATION ALGORITHM (1)
APPROXIMATION ALGORITHMS (1)
APPROXIMATION THEORY (1)
ARF TECHNIQUE (1)
ARTIFICIAL DISTANCE VALUE (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASYMPTOTIC ANALYSIS (1)
ASYMPTOTIC ANALYSIS CONCEPT (1)
AUTO RELEVANCE FEEDBACK (1)
AVERAGE PRECISION (1)
B⁺-TREE (1)
BAESA (1)
BIOINFORMATICS (1)
BIOLOGICAL SYSTEM MODELING (1)
BOUNDED DISTANCE METRIC (1)
BUILT-IN OBJECT CACHE (1)
C#/.NET APPLICATION (1)
CACHE STORAGE (1)
CLUSTERING ALGORITHMS (1)
CLUSTERING-BASED METHOD (1)
CODE GENERATION (1)
COMBINATORIAL FRAMEWORK (1)
COMBINATORIAL MATHEMATICS (1)
COMBINATORIAL NET (1)
COMPACT BIT STRING REPRESENTATION (1)
COMPACT CLUSTERING (1)
COMPETITIVE STRATEGY (1)
COMPRESSED SKETCH (1)
COMPRESSION (1)
CONCENTRATION OF MEASURE (1)
CONTAINERS (1)
CONTEMPORARY STRUCTURAL BIOINFORMATICS (1)
CONTENT BASE IMAGE RETRIEVAL APPLICATION (1)
CONTENT BASED IMAGE RETRIEVAL (1)
CONTENT BASED RETRIEVAL (1)
CONTENT SEARCHING (1)
CONTENT-BASED IMAGE RETRIEVAL SYSTEM (1)
CONTENT-BASED PHOTO IMAGE RETRIEVAL (1)
COPHIR DATASET (1)
COPHIR IMAGE COLLECTION (1)
CPU TIME (1)
CURSE OF DIMENSIONALITY (1)
DATA ANALYSIS (1)
DATA COMPRESSION (1)
DATA REPRESENTATION (1)
DATABASE QUERY OBJECT (1)
DATABASE SORTING (1)
more

INFONA - science communication portal

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)

EGNAT: A Fully Dynamic Metric Access Method for Secondary Memory

Metric Index: An Efficient and Scalable Solution for Similarity Search

Author Index

Using Tuneable Fuzzy Similarity in Non-metric Search

Title Page i

Curse of Dimensionality in Pivot Based Indexes

Dynamic Spatial Approximation Trees for Massive Data

Organizing and Program Committees

Cover Art

Table of Contents

External Reviewers

Principles of Information Filtering in Metric Spaces

Combinatorial Framework for Similarity Search

Optimal Pivots to Minimize the Index Size for Metric Access Methods

Efficient Similarity Search by Reducing I/O with Compressed Sketches

Structural Entropic Difference: A Bounded Distance Metric for Unordered Trees

CoPhIR Image Collection under the Microscope

Speeding Up Permutation Based Indexing with Indexing

MUFIN: A Multi-feature Indexing Network

Title Page iii

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 Second International Workshop on Similarity Search and Applications (SISAP 2009)