ICDM 2008. Eighth IEEE International Conference on Data Mining

Items from 1 to 4 out of 4 results

chapter

Estimating Aggregates over Multiple Sets

E. Cohen, H. Kaplan

2008 Eighth IEEE International Conference on Data Mining > 761 - 766

ICDM 2008. Eighth IEEE International Conference on Data Mining

Many datasets, including market basket data, text or hypertext documents, and measurement data collected in different nodes or time periods, are modeled as a collection of sets over a ground set of (weighted) items. We consider the problem of estimating basic aggregates such as the weight or selectivity of a subpopulation of the items. We extend classic summarization techniques based on sampling to...

chapter

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs

L. Akoglu, M. McGlohon, C. Faloutsos

2008 Eighth IEEE International Conference on Data Mining > 701 - 706

ICDM 2008. Eighth IEEE International Conference on Data Mining

How do real, weighted graphs change over time? What patterns, if any, do they obey? Earlier studies focus on unweighted graphs, and, with few exceptions, they focus on static snapshots. Here, we report patterns we discover on several real, weighted, time-evolving graphs. The reported patterns can help in detecting anomalies in natural graphs, in making link prediction and in providing more criteria...

chapter

Discovering Flow Anomalies: A SWEET Approach

J.M. Kang, S. Shekhar, C. Wennen, P. Novak

2008 Eighth IEEE International Conference on Data Mining > 851 - 856

ICDM 2008. Eighth IEEE International Conference on Data Mining

Given a percentage-threshold and readings from a pair of consecutive upstream and downstream sensors, flow anomaly discovery identifies dominant time intervals where the fraction of time instants of significantly mis-matched sensor readings exceed the given percentage-threshold. Discovering flow anomalies (FA) is an important problem in environmental flow monitoring networks and early warning detection...

chapter

DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining

S. Papadimitriou, Jimeng Sun

2008 Eighth IEEE International Conference on Data Mining > 512 - 521

ICDM 2008. Eighth IEEE International Conference on Data Mining

Huge datasets are becoming prevalent; even as researchers, we now routinely have to work with datasets that are up to a few terabytes in size. Interesting real-world applications produce huge volumes of messy data. The mining process involves several steps, starting from pre-processing the raw data to estimating the final models. As data become more abundant, scalable and easy-to-use tools for distributed...

Filter options

Keywords:
IP NETWORKS

Publication date

Set your own date range

Keywords

PROBABILITY DENSITY FUNCTION (3)
AGGREGATES (1)
APPROXIMATE QUERY PROCESSING (1)
BIOINFORMATICS (1)
BLOGS (1)
CO-CLUSTERING (1)
COLLABORATIVE FILTERING (1)
CONDITION MONITORING (1)
DATA MODELS (1)
DATA STORAGE (1)
DISCO (1)
DISTRIBUTED CO-CLUSTERING (1)
DISTRIBUTED DATA PRE-PROCESSING (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED PROCESSING (1)
DOCUMENT HANDLING (1)
EARLY WARNING DETECTION SYSTEMS (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
ENVIRONMENTAL FLOW MONITORING NETWORKS (1)
ENVIRONMENTAL SCIENCE COMPUTING (1)
ESTIMATION (1)
EXECUTION ENGINE (1)
FLOW ANOMALY DISCOVERY (1)
GAIN (1)
GENERATORS (1)
GRAPH GENERATORS (1)
GRAPH MINING (1)
GRAPH THEORY (1)
GRAPHS (1)
HADOOP (1)
HYPERMEDIA (1)
HYPERTEXT DOCUMENTS (1)
KRONECKER PRODUCT (1)
LINK PREDICTION (1)
MAPREDUCE (1)
MARKET BASKET DATA (1)
MEASUREMENT DATA COLLECTED (1)
MESSY DATA (1)
MINING PROCESS (1)
MULTIPLE SETS (1)
OPEN SOURCE MAP-REDUCE IMPLEMENTATION (1)
PATTERN CLUSTERING (1)
PETABYTE-SCALE END-TO-END MINING (1)
POWER LAWS (1)
PROGRAMMING (1)
PRUNING TECHNIQUES (1)
RADIATION DETECTORS (1)
REAL DATASET (1)
REAL-WORLD APPLICATIONS (1)
RECURSIVE ESTIMATION (1)
RECURSIVE GENERATOR (1)
RTM (1)
SAMPLING (1)
SCALABILITY (1)
SEARCH SPACE (1)
SENSOR PHENOMENA AND CHARACTERIZATION (1)
SENSORS (1)
SIMILARITY (1)
SKETCHING (1)
SMART COUNTER (1)
SMART WINDOW ENUMERATION AND EVALUATION OF PERSISTENCE-THRESHOLDS METHOD (1)
STORAGE MANAGEMENT (1)
SUMMARIZATION TECHNIQUES (1)
SYNTHETIC GRAPH GENERATORS (1)
TENSILE STRESS (1)
TENSORS (1)
TEXT MINING (1)
TIME SERIES ANALYSIS (1)
TRANSIENT ANALYSIS (1)
UNWEIGHTED GRAPHS (1)
WATER BODIES (1)
WATER QUALITY (1)
WATER QUALITY PROBLEMS (1)
WEB PAGES (1)
WEIGHTED TIME-EVOLVING GRAPHS (1)
more

INFONA - science communication portal

ICDM 2008. Eighth IEEE International Conference on Data Mining $("#expandableTitles").expandable();

Estimating Aggregates over Multiple Sets

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs

Discovering Flow Anomalies: A SWEET Approach

DisCo: Distributed Co-clustering with Map-Reduce: A Case Study towards Petabyte-Scale End-to-End Mining

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICDM 2008. Eighth IEEE International Conference on Data Mining