Search results

Items from 1 to 9 out of 9 results

chapter

Cost-Efficient Distributed MapReduce Job Scheduling across Cloud Federation

Thouraya Gouasmi, Wajdi Louati, Ahmed Hadj Kacem

2017 IEEE International Conference on Services Computing (SCC) > 289 - 296

2017 IEEE International Conference on Services Computing (SCC)

This paper proposes a fully distributed scheduling algorithm to process MapReduce data-intensive applications across geo-distributed clusters in federated clouds. The proposed algorithm, called FedSCD, takes advantage of data locality while reducing both VM cost and data transfer cost (between clusters) subject to Deadline constraint. This work is compared to conventional partially distributed scheduling...

chapter

Towards optimization of Hadoop Map reduce jobs on cloud

A. Sree Lakshmi, M. BalRaju, N. Subash Chandra

2016 International Conference on Computing, Analytics and Security Trends (CAST) > 255 - 260

2016 International Conference on Computing, Analytics and Security Trends (CAST)

Hadoop is commonly used framework for solving applications which deal with large volumes of data. Most of the current day applications require large storage and computation to be performed. Hadoop jobs are executed in cloud as cloud environment provides flexible provision, maintenance and scalability of resources. Hadoop framework can be improved in terms of parameter automation and map reduce tasks...

chapter

Network Traffic Measurement Algorithm Based on Sampling for Big Network Data

Aiping Zhou, Lijun Liu, Min Jiang, Xiaojun Guo

2016 International Conference on Advanced Cloud and Big Data (CBD) > 240 - 245

2016 International Conference on Advanced Cloud and Big Data (CBD)

Network traffic measurement is significant for network security and network management. As network bandwidth increases and internet applications varies, network big data is bringing new challenge for network traffic measurement. Because the existing network traffic measurement mainly processes network traffic data by the centralized method, it is very difficult to meet the application needs of massive...

chapter

Uploading Deferrable Big Data to the Cloud by Improved Dynamic Self-Adaption Algorithm

Baojiang Cui, Peilin Shi, Jun Yang, Yongle Hao

2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC) > 116 - 120

2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC)

Cloud computing is a pattern of processing the big data and provides the convenient, on-demand network access to a shared pool of configurable computing resources. Cloud data center's cost is becoming the hot topic in recent years. This paper studies how to minimize the bandwidth cost for uploading deferral big data to a cloud computing platform, based on the MapReduce Framework. We study the deficiency...

chapter

Novel Data-Distribution Technique for Hadoop in Heterogeneous Cloud Environments

Vrushali Ubarhande, Alina-Madalina Popescu, Horacio Gonzalez-Velez

2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems > 217 - 224

2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS)

The Hadoop framework has been developed to effectively process data-intensive MapReduce applications. Hadoop users specify the application computation logic in terms of a map and a reduce function, which are often termed MapReduce applications. The Hadoop distributed file system is used to store the MapReduce application data on the Hadoop cluster nodes called Data nodes, whereas Name node is a control...

chapter

A Dynamic Self-Adaptive Algorithm for Uploading Deferrable Big Data to the Cloud Cost-Effectively

Baojiang Cui, Peilin Shi, Haifeng Jin

2015 9th International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing > 292 - 295

2015 9th International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS)

Cloud computing is based between the service provider and service consumer agreements and cloud data center is under a cloud computing environment that consists of hardware and software components. This paper studies how to minimize the bandwidth cost for uploading deferral big data to a cloud computing platform, based on the MapReduce Framework. We first analysis the shortcoming of bandwidth of data...

chapter

Reducing the Power Consumption of Servers with Bandwidth Consideration

Yu Chang Lin, Wei Tsong Lee, Jing Yue Qiu

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing > 650 - 653

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP)

In recent years, cloud computing systems become more and more mature and cloud computing system applications are becoming more widespread. Microsoft, Google, IBM, Amazon has developed applications for the cloud computing environment. The cloud computing environment like a large pool of resources, MapReduce distribute resources in this resource pool to achieve cloud computing. Hadoop MapReduce is a...

chapter

Center-of-Gravity Reduce Task Scheduling to Lower MapReduce Network Traffic

Mohammad Hammoud, M. Suhail Rehman, Majd F. Sakr

2012 IEEE Fifth International Conference on Cloud Computing > 49 - 58

2012 IEEE 5th International Conference on Cloud Computing (CLOUD)

MapReduce is by far one of the most successful realizations of large-scale data-intensive cloud computing platforms. MapReduce automatically parallelizes computation by running multiple map and/or reduce tasks over distributed data across multiple machines. Hadoop is an open source implementation of MapReduce. When Hadoop schedules reduce tasks, it neither exploits data locality nor addresses partitioning...

chapter

DARE: Adaptive Data Replication for Efficient Cluster Scheduling

Cristina L. Abad, Yi Lu, Roy H. Campbell

2011 IEEE International Conference on Cluster Computing > 159 - 168

2011 IEEE International Conference on Cluster Computing (CLUSTER)

Placing data as close as possible to computation is a common practice of data intensive systems, commonly referred to as the data locality problem. By analyzing existing production systems, we confirm the benefit of data locality and find that data have different popularity and varying correlation of accesses. We propose DARE, a distributed adaptive data replication algorithm that aids the scheduler...

Filter options

Data set:
ieee
Keywords:
BANDWIDTH
CLOUD COMPUTING
MAPREDUCE
Publication type:
book

Publication date

Set your own date range

Keywords

DISTRIBUTED DATABASES (4)
HADOOP (3)
SCHEDULING (3)
BENCHMARK TESTING (2)
CLUSTERING ALGORITHMS (2)
PROCESSOR SCHEDULING (2)
TIME FACTORS (2)
VIRTUAL MACHINING (2)
AGING (1)
AMAZON CLOUD (1)
BENCHMARK (1)
BIG DATA (1)
CLOUD FEDERATION (1)
CLOUD NETWROK (1)
DATA LOCALITY (1)
DATA PLACEMENT (1)
DATA PROCESSING (1)
DISTRIBUTED COMPUTING (1)
DISTRIBUTED MAPREDUCE SCHEDULING (1)
DYNAMIC SELF-ADAPTION ALGORITHM (1)
ELEPHANT FLOW IDENTIFICATION (1)
EMR (1)
ESTIMATION (1)
GREEN PRODUCTS (1)
HEURISTIC ALGORITHMS (1)
IMPROVED DYNAMIC SELF-ADAPTION ALGORITHM (1)
LOCALITY (1)
MEMORY MANAGEMENT (1)
NETWORK TOPOLOGY (1)
NETWORK TRAFFIC MEASUREMENT (1)
PARTITIONING ALGORITHMS (1)
PROBABILISTIC LOGIC (1)
RANDOM ACCESS MEMORY (1)
REDUCE TASK SCHEDULING (1)
REPLICATION (1)
SAMPLING (1)
SCHEDULES (1)
SERVERS (1)
SORTING (1)
TELECOMMUNICATION TRAFFIC (1)
TUNING (1)
VIRTUALIZATION (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options