Search results for: D. Srivastava

Items from 1 to 5 out of 5 results

chapter

Discovery of complex glitch patterns: A novel approach to Quantitative Data Cleaning

L Berti-Equille, T Dasu, D Srivastava

2011 IEEE 27th International Conference on Data Engineering > 733 - 744

2011 27th IEEE International Conference on Data Engineering (ICDE 2011)

Quantitative Data Cleaning (QDC) is the use of statistical and other analytical techniques to detect, quantify, and correct data quality problems (or glitches). Current QDC approaches focus on addressing each category of data glitch individually. However, in real-world data, different types of data glitches co-occur in complex patterns. These patterns and interactions between glitches offer valuable...

chapter

Forward Decay: A Practical Time Decay Model for Streaming Systems

G. Cormode, V. Shkapenyuk, D. Srivastava, Bojian Xu

2009 IEEE 25th International Conference on Data Engineering > 138 - 149

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Temporal data analysis in data warehouses and datastreaming systems often uses time decay to reduce the importance of older tuples, without eliminating their influence, on the results of the analysis. While exponential time decay is commonly used in practice, other decay functions (e.g. polynomial decay) are not, even though they have been identified as useful. We argue that this is because the usual...

chapter

Efficient Table Anonymization for Aggregate Query Answering

C.M. Procopiuc, D. Srivastava

2009 IEEE 25th International Conference on Data Engineering > 1291 - 1294

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Privacy protection is a major concern when microdata is released for ad hoc analyses. Anonymization schemes have to guarantee privacy goals, as well as preserve sufficient information to support reasonably accurate answers to ad hoc queries. In this paper, we focus on the case when the sensitive attributes are numerical (e.g., salary) for which (k,e)-anonymity was shown to be an appropriate privacy...

chapter

Exploring a Few Good Tuples from Text Databases

A. Jain, D. Srivastava

2009 IEEE 25th International Conference on Data Engineering > 616 - 627

2009 IEEE 25th International Conference on Data Engineering. ICDE 2009

Information extraction from text databases is a useful paradigm to populate relational tables and unlock the considerable value hidden in plain-text documents. However, information extraction can be expensive, due to various complex text processing steps necessary in uncovering the hidden data. There are a large number of text databases available, and not every text database is necessarily relevant...

chapter

Randomized Synopses for Query Assurance on Data Streams

Ke Yi, Feifei Li, M. Hadjieleftheriou, G. Kollios, more

2008 IEEE 24th International Conference on Data Engineering > 416 - 425

2008 IEEE 24th International Conference on Data Engineering (ICDE '08)

The overwhelming flow of information in many data stream applications forces many companies to outsource to a third-party the deployment of a data stream management system (DSMS) for performing desired computations. Remote computations intrinsically raise issues of trust, making query execution assurance on data streams a problem with practical implications. Consider a client observing the same data...

INFONA - science communication portal

Search results for: D. Srivastava

Discovery of complex glitch patterns: A novel approach to Quantitative Data Cleaning

Forward Decay: A Practical Time Decay Model for Streaming Systems

Efficient Table Anonymization for Aggregate Query Answering

Exploring a Few Good Tuples from Text Databases

Randomized Synopses for Query Assurance on Data Streams

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: D. Srivastava

Discovery of complex glitch patterns: A novel approach to Quantitative Data Cleaning

Forward Decay: A Practical Time Decay Model for Streaming Systems

Efficient Table Anonymization for Aggregate Query Answering

Exploring a Few Good Tuples from Text Databases

Randomized Synopses for Query Assurance on Data Streams

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options