2014 Second International Conference on Advanced Cloud and Big Data

Items from 1 to 10 out of 10 results

chapter

Scaling Information-Theoretic Text Clustering: A Sampling-based Approximate Method

Zhexi Xu, Zhiang Wu, Jie Cao, Hengnong Xuan

2014 Second International Conference on Advanced Cloud and Big Data > 18 - 25

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Info-Kmeans, a K-means clustering method employing KL-divergence as the proximity function, is one of the representative methods in information-theoretic clustering. With the explosive growth of online texts such as online reviews and user-generated content, the text is becoming more sparse and much bigger, which poses significant challenges on both effectiveness and efficiency issues of text clustering...

chapter

A Score Based Approach towards Improving Bayesian Network Structure Learning

Yan Tang, Zhuoming Xu

2014 Second International Conference on Advanced Cloud and Big Data > 39 - 44

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

In big data research, an important field is the big data graph algorithm. The Bayesian Network (BN) is a very powerful graph model for causal relationship modeling and probabilistic reasoning. One key process of building a BN is discovering its structure -- a directed acyclic graph (DAG). In the literature, numerous Bayesian network structure learning algorithms are proposed to discover BN structure...

chapter

A Parallel Algorithm to Mine Abnormal Patterns from Satellite Data

Yuhang Xu, Dechang Pi

2014 Second International Conference on Advanced Cloud and Big Data > 53 - 59

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Mining abnormal patterns is important in many areas. With the prevalence of big data, in order to ensure efficiency, an algorithm named PPSpan (JOMP-based parallel Prefix Span) is proposed under the research of traditional serial sequential pattern mining methods. Firstly, redundant parameters are eliminated with grey correlation analysis. Secondly, outlier information is extracted according to the...

chapter

Data Placement and Task Scheduling Optimization for Data Intensive Scientific Workflow in Multiple Data Centers Environment

Mingjun Wang, Jinghui Zhang, Fang Dong, Junzhou Luo

2014 Second International Conference on Advanced Cloud and Big Data > 77 - 84

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Running data-intensive scientific workflow across multiple data centers faces massive data transfer problem which leads to low efficiency in actual workflow application for scientists. By considering data size and data dependency, we propose a k-means algorithm based initial data placement strategy that places the most related initial data sets into the same data center at workflow preparation stage...

chapter

Efficient Auction Mechanism with Group Price for Resource Allocation in Clouds

Yiyi Ma, Bin Li, Yonglong Zhang, Junwu Zhu

2014 Second International Conference on Advanced Cloud and Big Data > 85 - 92

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

With the rapid grows of cloud-based internet application, a need for efficient resource allocation, load balance and cost management increases. In this paper, we propose a group-auction based mechanism for the cloud instance market to efficiently allocate resources. In the market system, resource providers offer resources in the form of virtual machine. Users submit their bids. The proposed system...

chapter

Multi-Q: Multiple Queries Optimization Based on MapReduce in Cloud

Ding Ding, Fang Dong, Junzhou Luo

2014 Second International Conference on Advanced Cloud and Big Data > 100 - 107

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

With the explosion of data in the past decade, big data is becoming a research hotspot in the information field. Many cloud-based distributed data processing platforms have been proposed to provide efficient and cost effective solutions for big data query processing, such as Hadoop, Hive, Pig, etc. However, most of the current research works are focus on improving the performance of query processing...

chapter

Near-Optimal Approximate Duplicate-Detection in Data Streams Over Sliding Windows for the Uniform Query Frequency or Membership Likelihood

Xiujun Wang, Xiao Zheng, Zhe Dang, Xuangou Wu, more

2014 Second International Conference on Advanced Cloud and Big Data > 122 - 127

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Approximate duplicate-detection (or membership query) in data streams answers the question of whether an element from a large universe U (a query element) is present in a small subsequence of a data stream or not. It is an important query that has many Internet applications, such as web crawling, social networks and so on. Existing approximate duplicatedetection methods in the sliding window model...

chapter

MyBSP: An Iterative Processing Framework Based on the Cloud Platform for Graph Data

Chao Liu, Hong Yao, Deze Zeng, Qingzhong Liang, more

2014 Second International Conference on Advanced Cloud and Big Data > 128 - 135

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Massive cloud-based data-intensive applications (e.g., iterative MapReduce-based) could involve graph data processing. How to effectively analyze and process large-scale graph data is an unsolved challenging problem. We present a parallel computation framework, named MyBSP, which is inspired by Google's Pregel system. MyBSP supports and implements the Bulk Synchronous Parallel (BSP) programming model,...

chapter

SSNF: Shared Datacenter Mechanism for Inter-datacenter Bulk Transfer

Yang Yu, Wang Rong, Wang Zhijun

2014 Second International Conference on Advanced Cloud and Big Data > 184 - 189

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Cloud service providers (CSP) usually deploy geographically distributed data centers to improve QoS for colocated customers. Inter-Data center traffic constitutes almost half of the data center's export traffic and occupies significant part of the operational cost. Many store-and-forward mechanisms have been proposed to improve the efficiency of inter-data center transfer. However, existing store-and-forward...

chapter

A Node-to-Set Disjoint Path Routing Algorithm in DCell Networks

Xi Wang, Jianxi Fan

2014 Second International Conference on Advanced Cloud and Big Data > 196 - 200

2014 Second International Conference on Advanced Cloud and Big Data (CBD)

Data center networks become increasingly important with the growth of cloud computing. For any integers k &#x2265; 0 and n &#x2265; 2, the k-dimensional DCell, Dk,n, has been proposed for one of the most important data center networks as a server-centric data center network structure. In this paper, we propose an efficient algorithm for finding disjoint paths in node-to-set routing...

Filter options

Keywords:
ALGORITHM DESIGN AND ANALYSIS

Publication date

Set your own date range

Keywords

BIG DATA (4)
CLOUD COMPUTING (3)
DATA MODELS (3)
APPROXIMATION ALGORITHMS (2)
CLUSTERING ALGORITHMS (2)
COMPUTATIONAL MODELING (2)
DATA TRANSFER (2)
SERVERS (2)
ABNORMAL PATTERNS (1)
AEROSPACE ELECTRONICS (1)
ALGORITHM (1)
BANDWIDTH (1)
BAYES METHODS (1)
BAYESIAN NETWORK STRUCTURE LEARNING (1)
BSP MODEL (1)
BULK TRANSFERS (1)
CLOUD COMPUTING; GROUP AUCTION; DCOP; FACTOR (1)
CLOUD PLATFORM (1)
CLUSTERING METHODS (1)
COMPLEXITY THEORY (1)
CORRELATION (1)
COST ACCOUNTING (1)
DATA CENTER (1)
DATA COMMUNICATION (1)
DATA MINING (1)
DATA PLACEMENT (1)
DATA PROCESSING (1)
DATA STRUCTURES (1)
DATABASES (1)
DCELL (1)
DISJOINT PATH ROUTING (1)
DISTRIBUTED DATABASES (1)
ELECTRONIC MAIL (1)
ENTROPY (1)
FAULT TOLERANCE (1)
FAULT TOLERANT SYSTEMS (1)
GRAPH DATA PROCESSING (1)
GRAPHIC MODE (1)
GRAPHICS (1)
GREY CORRELATION ANALYSIS (1)
HEURISTIC ALGORITHMS (1)
INDEXES (1)
INFERENCE ALGORITHMS (1)
INFORMATION ENTROPY (1)
INTER-DATACENTER TRAFFIC (1)
INTERNET (1)
ITERATIVE PROCESSING FRAMEWORK (1)
K-MEANS (1)
KL-DIVERGENCE (1)
LINEAR PROGRAMMING (1)
MESSAGE PASSING (1)
MULTI-QUERIES RESULT REUSE (1)
MULTILEVEL TASK REPLICATION (1)
NODE-TO-SET (1)
OPTIMIZATION (1)
PARTIAL ORDER (1)
PARTITIONING ALGORITHMS (1)
PEER-TO-PEER COMPUTING (1)
PPSPAN (1)
PROCESS CONTROL (1)
PROCESSOR SCHEDULING (1)
QUERY OPTIMIZATION (1)
QUERY PROCESSING (1)
RANDOM (1)
RESOURCE MANAGEMENT (1)
ROUTING (1)
SCHEDULING (1)
SCIENTIFIC WORKFLOWS (1)
SCORE FUNCTION (1)
SHARED DATACENTER (1)
SYNCHRONIZATION (1)
SYSTEMS ARCHITECTURE (1)
TEXT CLUSTERING (1)
TIME COMPLEXITY (1)
UPLINK (1)
VIRTUAL MACHINING (1)
WIRELESS APPLICATION PROTOCOL (1)
XENON (1)
more

INFONA - science communication portal

2014 Second International Conference on Advanced Cloud and Big Data $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 Second International Conference on Advanced Cloud and Big Data