Search results for: Odej Kao

Items from 1 to 7 out of 7 results

chapter

Ellis: Dynamically Scaling Distributed Dataflows to Meet Runtime Targets

Lauritz Thamsen, Ilya Verbitskiy, Jossekin Beilharz, Thomas Renner, more

2017 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) > 146 - 153

2017 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)

Distributed dataflow systems like MapReduce, Spark, and Flink help users in analyzing large datasets with a set of cluster resources. Performance modeling and runtime prediction is then used for automatically allocating resources for specific performance goals. However, the actual performance of distributed dataflow jobs can vary significantly due to factors like interference with co-located workloads,...

chapter

Scheduling Recurring Distributed Dataflow Jobs Based on Resource Utilization and Interference

Lauritz Thamsen, Benjamin Rabier, Florian Schmidt, Thomas Renner, more

2017 IEEE International Congress on Big Data (BigData Congress) > 145 - 152

2017 IEEE International Congress on Big Data (BigData Congress)

Resource management systems like YARN or Mesos enable users to share cluster infrastructures by running analytics jobs in temporarily reserved containers. These containers are typically not isolated to achieve high degrees of overall resource utilizations despite the often fluctuating resource usage of single analytic jobs. However, some combinations of jobs utilize the resources better and interfere...

chapter

CoLoc: Distributed data and container colocation for data-intensive applications

Thomas Renner, Lauritz Thamsen, Odej Kao

2016 IEEE International Conference on Big Data (Big Data) > 3008 - 3015

2016 IEEE International Conference on Big Data (Big Data)

The performance of scalable analytic frameworks supporting data-intensive parallel applications often depends significantly on the time it takes to read input data. Therefore, existing frameworks like Spark and Flink try to achieve a high degree of data locality by scheduling tasks on nodes where the input data resides. However, the set of nodes running a job and its tasks is chosen by a cluster resource...

chapter

Selecting resources for distributed dataflow systems according to runtime targets

Lauritz Thamsen, Ilya Verbitskiy, Florian Schmidt, Thomas Renner, more

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC) > 1 - 8

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC)

Distributed dataflow systems like Spark or Flink enable users to analyze large datasets. Users create programs by providing sequential user-defined functions for a set of well-defined operations, select a set of resources, and the systems automatically distribute the jobs across these resources. However, selecting resources for specific performance needs is inherently difficult and users consequently...

chapter

The Internet of Things Resource Management Challenge

Andreas Kliem, Odej Kao

2015 IEEE International Conference on Data Science and Data Intensive Systems > 483 - 490

2015 IEEE International Conference on Data Science and Data Intensive Systems (DSDIS)

Caused by the proliferation of the (IoT) and its related application domains such as Building Automation or E-Health, users face a continuously increasing amount of heterogeneous sensors and devices deployed to their environment. As a result, a large variety of protocols, data formats and physical sensing resources needs to be managed in order to gain benefit from the deployed devices. This raises...

chapter

Network-aware resource management for scalable data analytics frameworks

Thomas Renner, Lauritz Thamsen, Odej Kao

2015 IEEE International Conference on Big Data (Big Data) > 2793 - 2800

2015 IEEE International Conference on Big Data (Big Data)

Sharing cluster resources between multiple frameworks, applications and datasets is important for organizations doing large scale data analytics. It improves cluster utilization, avoids standalone clusters running only a single framework and allows data scientists to choose the best framework for each analysis task. Current systems for cluster resource management like YARN or Mesos achieve resource...

chapter

The Device Cloud - Applying Cloud Computing Concepts to the Internet of Things

Thomas Renner, Andreas Kliem, Odej Kao

2014 IEEE 11th Intl Conf on Ubiquitous Intelligence and Computing and 2014 IEEE 11th Intl Conf on Autonomic and Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops > 396 - 401

2014 IEEE 11th Intl Conf on Ubiquitous Intelligence & Computing and 2014 IEEE 11th Intl Conf on Autonomic & Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom)

The pervasiveness of connected embedded devices and Internet of Things (IoT) related application domains like smart cities, e-Health or, transportation lead to an constantly increasing amount of data, compute- and storage resources surrounding us. However, currently there is a gap between data acquisition and processing, usually bridged by gateway based approaches that integrate the devices and forward...

Filter options

Keywords:
RESOURCE MANAGEMENT

Publication date

Set your own date range

Keywords

CONTAINERS (4)
YARN (4)
DISTRIBUTED DATAFLOWS (3)
SCALABLE DATA ANALYTICS (3)
CLOUD COMPUTING (2)
DATA ANALYSIS (2)
DEVICE INTEGRATION (2)
INTERNET OF THINGS (2)
RESOURCE SHARING (2)
RUNTIME PREDICTION (2)
BANDWIDTH (1)
BENCHMARK TESTING (1)
CLUSTER MANAGEMENT (1)
CLUSTER SCHEDULING (1)
CLUSTERING ALGORITHMS (1)
DATA MODELS (1)
DATA PLACEMENT (1)
DATA-INTENSIVE APPLICATIONS (1)
DEVICE MANAGEMENT (1)
DISTRIBUTED DATABASES (1)
DYNAMIC SCALING (1)
E-HEALTH (1)
INTERFERENCE (1)
INTEROPERABILITY (1)
IOT MIDDLEWARE (1)
JOB INTERFERENCE (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
M2M (1)
MATHEMATICAL MODEL (1)
MONITORING (1)
NETWORK TOPOLOGY (1)
PARALLEL DATAFLOWS (1)
PEER-TO-PEER COMPUTING (1)
PREDICTIVE MODELS (1)
QUALITY OF SERVICE (1)
RESOURCE ALLOCATION (1)
RESOURCE POOLING (1)
RESOURCE VIRTUALIZATION (1)
RUNTIME (1)
SCHEDULING (1)
SENSOR PHENOMENA AND CHARACTERIZATION (1)
SERVERS (1)
SPARKS (1)
TOPOLOGY (1)
WIRELESS SENSOR NETWORKS (1)
more

INFONA - science communication portal

Search results for: Odej Kao

Ellis: Dynamically Scaling Distributed Dataflows to Meet Runtime Targets

Scheduling Recurring Distributed Dataflow Jobs Based on Resource Utilization and Interference

CoLoc: Distributed data and container colocation for data-intensive applications

Selecting resources for distributed dataflow systems according to runtime targets

The Internet of Things Resource Management Challenge

Network-aware resource management for scalable data analytics frameworks

The Device Cloud - Applying Cloud Computing Concepts to the Internet of Things

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options