Search results for: D. Srivastava

Items from 1 to 4 out of 4 results

chapter

Discovery of complex glitch patterns: A novel approach to Quantitative Data Cleaning

L Berti-Equille, T Dasu, D Srivastava

2011 IEEE 27th International Conference on Data Engineering > 733 - 744

2011 27th IEEE International Conference on Data Engineering (ICDE 2011)

Quantitative Data Cleaning (QDC) is the use of statistical and other analytical techniques to detect, quantify, and correct data quality problems (or glitches). Current QDC approaches focus on addressing each category of data glitch individually. However, in real-world data, different types of data glitches co-occur in complex patterns. These patterns and interactions between glitches offer valuable...

chapter

Weighted Set Similarity: Queries and Updates

D. Srivastava

2009 IEEE 25th International Conference on Data Engineering > 1559

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Summary form only given. Consider a universe of items, each of which is associated with a weight, and a database consisting of subsets of these items. Given a query set, a weighted set similarity query identifies either (i) all sets in the database whose normalized similarity to the query set is above a pre-specified threshold, or (ii) the sets in the database with the k highest similarity values...

chapter

Fast Indexes and Algorithms for Set Similarity Selection Queries

M. Hadjieleftheriou, A. Chandel, N. Koudas, D. Srivastava

2008 IEEE 24th International Conference on Data Engineering > 267 - 276

2008 IEEE 24th International Conference on Data Engineering (ICDE '08)

Data collections often have inconsistencies that arise due to a variety of reasons, and it is desirable to be able to identify and resolve them efficiently. Set similarity queries are commonly used in data cleaning for matching similar data. In this work we concentrate on set similarity selection queries: Given a query set, retrieve all sets in a collection with similarity greater than some threshold...

article

Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations

Ke Yi, Feifei Li, G. Kollios, D. Srivastava

IEEE Transactions on Knowledge and Data Engineering > 2008 > 20 > 12 > 1669 - 1682

This work introduces novel polynomial algorithms for processing top-k queries in uncertain databases under the generally adopted model of x-relations. An x-relation consists of a number of x-tuples, and each x-tuple randomly instantiates into one tuple from one or more alternatives. Our results significantly improve the best known algorithms for top-k query processing in uncertain databases, in terms...

Filter options

Keywords:
CLEANING

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

DATABASES (3)
QUERY PROCESSING (3)
ALGORITHM DESIGN AND ANALYSIS (2)
DATA STRUCTURES (2)
DATABASE INDEXING (2)
INDEXES (2)
QUERY ANSWERING (2)
AGGREGATES (1)
ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY (1)
ANALYTICAL TECHNIQUES (1)
APPROXIMATION ALGORITHMS (1)
COMPLEX GLITCH PATTERN DISCOVERY (1)
COMPUTATIONAL COMPLEXITY (1)
CONFERENCES (1)
DATA CLEANING (1)
DATA HANDLING (1)
DATA MINING (1)
DATA MODELS (1)
DATA QUALITY PROBLEMS (1)
DATA SETS (1)
DATABASE DESIGN (1)
DATABASE MANAGEMENT (1)
DATABASE SYSTEM (1)
DEC FRAMEWORK (1)
DETECT-EXPLORE-CLEAN FRAMEWORK (1)
EXPONENTIAL COMPLEXITY (1)
FREQUENCY MEASUREMENT (1)
INDEX STRUCTURE (1)
INDEX STRUCTURE UPDATION (1)
INFORMATION TECHNOLOGY AND SYSTEMS (1)
JOINING PROCESSES (1)
JOINTS (1)
LAZY PROPAGATION (1)
MODELING AND MANAGEMENT (1)
POLYNOMIAL ALGORITHM (1)
POLYNOMIALS (1)
POWER SYSTEM MODELING (1)
PROBABILITY (1)
PROBABILITY DENSITY FUNCTION (1)
PROBABILITY DISTRIBUTION (1)
QDC METHODS (1)
QUANTITATIVE DATA CLEANING (1)
QUERY DESIGN AND IMPLEMENTATION LANGUAGES (1)
QUERY SET (1)
RELATIONAL DATABASE (1)
RELATIONAL DATABASES (1)
RUNTIME (1)
SCALABILITY (1)
SEMANTIC PROPERTY (1)
SET SIMILARITY QUERY SELECTION (1)
SET THEORY (1)
SIMILAR DATA MATCHING (1)
STATISTICAL ANALYSIS (1)
STATISTICAL TECHNIQUES (1)
THEORY OF COMPUTATION (1)
THRESHOLD ALGORITHM (1)
TOP-K QUERY (1)
TOP-K QUERY PROCESSING (1)
UNCERTAIN DATABASE (1)
UNCERTAIN SYSTEMS (1)
UNCERTAINTY (1)
USER SPECIFICATIONS (1)
WEB SERVER (1)
WEIGHT MEASUREMENT (1)
WEIGHTED SET SIMILARITY (1)
X-RELATION MODEL (1)
X-TUPLE (1)
more

INFONA - science communication portal

Search results for: D. Srivastava

Discovery of complex glitch patterns: A novel approach to Quantitative Data Cleaning

Weighted Set Similarity: Queries and Updates

Fast Indexes and Algorithms for Set Similarity Selection Queries

Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options