Search results

Items from 81 to 100 out of 928 results

chapter

Next-gen tools for big scientific data: ARM data center example

Ranjeet Devarakonda, Kyle Dumas, Sheman Beus, Everett Rush, more

2016 IEEE International Conference on Big Data (Big Data) > 3968 - 3970

2016 IEEE International Conference on Big Data (Big Data)

The Atmospheric Radiation Measurement (ARM) Climate Research Facility (www.arm.gov) provides atmospheric observations from diverse climatic regimes around the world. Currently, ARM archives over 22 million user assessable data files, primarily stored in NetCDF file format, with total data volumes close to one Petabyte. In this paper, we will discuss how ARM is currently storing, distributing, cataloging...

chapter

Addressing the big-earth-data variety challenge with the hierarchical triangular mesh

Michael L. Rilee, Kwo-Sen Kuo, Thomas Clune, Amidu Oloso, more

2016 IEEE International Conference on Big Data (Big Data) > 1006 - 1011

2016 IEEE International Conference on Big Data (Big Data)

We have implemented an updated Hierarchical Triangular Mesh (HTM) as the basis for a unified data model and an indexing scheme for geoscience data to address the variety challenge of Big Earth Data. In the absence of variety, the volume challenge of Big Data is relatively easily addressable with parallel processing. The more important challenge in achieving optimal value with a Big Data solution for...

chapter

Robust K-subspaces recovery with combinatorial initialization

Jun He, Yue Zhang, Jiye Wang, Nan Zeng, more

2016 IEEE International Conference on Big Data (Big Data) > 3573 - 3582

2016 IEEE International Conference on Big Data (Big Data)

In this paper we propose a two-stage algorithm for robust K-subspaces recovery. In the first stage, a large number of local candidate subspaces are generated by probabilistic farthest insertion, and then the initial near-optimal K-subspaces are solved by combinatorial selection with randomized greedy method. In the second stage, the K-subspaces are further refined by assigning each data vector to...

chapter

DERIV: Distributed In-Memory Brand Perception Tracking Framework

Manu Shukla, Andrew Fong, Raimundo Dos Santos, Chang-Tien Lu

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 387 - 393

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Social media captures voice of customers at a rapid pace. Consumer perception of a brand is crucial to its success. Current techniques for measuring brand perception using lengthy surveys of handpicked users in person, by mail, phone or online are time consuming and increasingly inadequate. A more effective technique to measure brand perception is to interpret customer voice directly from social media...

chapter

Exploring Controlled RDF Distribution

Raqueline R.M. Penteado, Rebeca Scroeder, Carmem S. Hara

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) > 160 - 167

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)

RDF datasets have increased rapidly over the last few years. In order to process SPARQL queries on these large datasets, much effort has been spent on developing horizontally scalable techniques, which involve data partitioning and parallel query processing. While distribution may provide storage scalability, it may also incur high communication costs for processing queries. In this paper, we present...

chapter

A Scalable Privacy Preserving System for Open Data

Chao-Chun Yeh, Pang-Chieh Wang, Yu-Hsuan Pan, Ming-Chih Kao, more

2016 International Computer Symposium (ICS) > 312 - 317

2016 International Computer Symposium (ICS)

The citizen considers that data source collecting by the government can be released for more diversity usage. However, to archive the open data dream, sensitive data potentially could be published after the proper privacy preserving processing. In this paper, we present a scalable privacy preserving system for open/big data which leverages K-anonymity algorithm and Hadoop framework. We use an experiment...

chapter

Linked data platform for building cloud-based smart applications and connecting API access points with data discovery techniques

Holly Ferguson, Charles Vardeman, Jarek Nabrzyski

2016 IEEE International Conference on Big Data (Big Data) > 3016 - 3025

2016 IEEE International Conference on Big Data (Big Data)

Globalization and cloud computing have allowed major strides forward in terms of communication possibilities, but it is also illuminating how many different resource options and formats exist access to which would dramatically increase the accuracy and reliability of choices made as a result of computational output. As a result, there is increasing need for methods resolving levels of data translations...

chapter

Big data analytics as-a-service: Issues and challenges

Claudio A. Ardagna, Paolo Ceravolo, Ernesto Damiani

2016 IEEE International Conference on Big Data (Big Data) > 3638 - 3644

2016 IEEE International Conference on Big Data (Big Data)

Big Data domain is one of the most promising ICT sectors with substantial expectations both on the side of market growing and design shift in the area of data storage managment and analytics. However, today, the level of complexity achieved and the lack of standardisation of Big Data management architectures represent a huge barrier towards the adoption and execution of analytics especially for those...

chapter

Data Distribution and Encryption Modelling for PaaS-enabled Cloud Security

Yiannis Verginadis, Ioannis Patiniotakis, Gregoris Mentzas, Simeon Veloudis, more

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) > 497 - 502

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)

Some of the most valuable business benefits that accompany the cloud adoption cannot be exploited without addressing, first, new data security challenges posed by cloud computing distributed nature. A promising approach for alleviating these risks is to provide a security-by-design framework that will assist cloud application developers in defining appropriate context-driven policies that enhance...

chapter

DRASH: A Data Replication-Aware Scheduler in Geo-Distributed Data Centers

Moise W. Convolbo, Jerry Chou, Shihyu Lu, Yeh Ching Chung

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) > 302 - 309

2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)

Driven by the trends of BigData and Cloud computing, there is a growing demand for processing and analyzing data that are generated and stored across geo-distributed data centers. However, due to the limited network bandwidth between data centers and the growing data volume spread across different locations, it has become increasingly inefficient to aggregate data and to perform computations at a...

chapter

Brawler to CFAM: Incorporating stochastic engagement-level data in deterministic campaign models

Benjamin R. Mayo, Todd J. Paciencia, Daniel P. Croghan

2016 Winter Simulation Conference (WSC) > 484 - 489

2016 Winter Simulation Conference (WSC)

Headquarters Air Force Studies, Analyses, and Assessments (AF/A9) supports Force Structure decisions by integrating analysis at various levels of resolution. The Combat Forces Assessment Model (CFAM), is a mixed integer program incorporating results from higher-resolution models to identify an optimal force mix within Air Force resources. CFAM is a deterministic model, but some input models are stochastic,...

chapter

An evaluation of data replication for bioinformatics workflows on NoSQL systems

Iasmini Lima, Matheus Oliveira, Diego Kieckbusch, Maristela Holanda, more

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 896 - 901

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Many research projects in bioinformatics may be viewed as scientific workflows. Biologists often run multiple times the same workflow with different parameters in order to refine their data analysis. These executions generate a large volume of files with different formats, which need to be stored for future evaluations. New database models, like NoSQL systems, could be considered to deal with large...

chapter

Study on hotel revenue management without explicitly incorporating competition

N.A. Masruroh, H. N. Absari, Y.P Mulyani

2016 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM) > 557 - 561

2016 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)

This paper analyzes the possibility of applying model that is not explicitly incorporate competition in hotel Revenue Management. Three scenarios are evaluated; (i) each seller understands how its own price affects its own demand but does not directly account for how its competitor's price does, (ii) each seller knows the total market size, and tries to learn how its own price affects demand, while...

chapter

Towards Real-Time and Temporal Information Services in Vehicular Networks via Multi-Objective Optimization

Penglin Dai, Kai Liu, Liang Feng, Qingfeng Zhuge, more

2016 IEEE 41st Conference on Local Computer Networks (LCN) > 671 - 679

2016 IEEE 41st Conference on Local Computer Networks (LCN)

Real-time and temporal information services are intrinsic characteristics in vehicular networks, where the timeliness of data dissemination and the maintenance of data quality interplay with each other and influence overall system performance. In this work, we present the system architecture where multiple road side units (RSUs) are cooperated to provide information services, and the vehicles can...

chapter

Privacy Preserving in Distributed SVM Data Mining on Vertical Partitioned Data

Mohammed Z. Omer, Hui Gao, Faisal Sayed

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI) > 84 - 89

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)

Data mining algorithms tacitly quite access to the data either at centralized or distributed form. Distributed data becomes a big challenge and cannot handle by a classical analytic tool. Cloud Computing can solve the issues of processing, storing, and analyzing the data at distributing locations within the cloud. However, a significant problem that is preventing free sharing of data is privacy and...

chapter

Minimizing the cost of designing fault-tolerant CDN data centers

S. Vignesh, Rakesh Tripathi, Venkatesh Tamarapalli

2016 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS) > 1 - 3

2016 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS)

With an increase in the usage of data centers to power content distribution networks (CDN), minimizing the cost of deployment while handling fault-tolerance has become an important research issue. In this work, we demonstrate the importance of cost-aware capacity provisioning in fault-tolerant CDN data centers (that can tolerate failure at a single site). We propose an optimization model that exploits...

chapter

Extending a Message Passing Runtime to Support Partitioned, Global Logical Address Spaces

D. Brian Larkins, James Dinan

2016 First International Workshop on Communication Optimizations in HPC (COMHPC) > 11 - 16

2016 First International Workshop on Communication Optimizations in HPC (COMHPC)

Partitioned Global Address Space (PGAS) parallel programming models can provide an efficient mechanism for managing shared data stored across multiple nodes in a distributed memory system. However, these models are traditionally directly addressed and, for applications with loosely-structured or sparse data, determining the location of a given data element within a PGAS can incur significant overheads...

chapter

A Multi-Objective Optimization Model for Data-Intensive Workflow Scheduling in Data Grids

Mahshid Helali Moghadam, Seyyed Morteza Babamir, Meghdad Mirabi

2016 IEEE 41st Conference on Local Computer Networks Workshops (LCN Workshops) > 25 - 33

2016 IEEE 41st Conference on Local Computer Networks Workshops (LCN Workshops)

The concept of workflow is used for modeling many of the data-intensive scientific applications executed on data grids. A Workflow is a series of interdependent tasks during which data is processed by different tasks. Scheduling the workflows in the grids is the process of assigning tasks to appropriate resources with the aim of achieving goals such as reducing workflow completion time while considering...

chapter

Teaching MPI from Mental Models

Victor Eijkhout

2016 Workshop on Education for High-Performance Computing (EduHPC) > 14 - 18

2016 Workshop on Education for High-Performance Computing (EduHPC)

The Message Passing Interface (MPI) is the de facto standard for programming large scale parallelism, with up to millions of individual processes. Its dominant paradigm of Single Program Multiple Data (SPMD) programming is different from threaded and multicore parallelism, to an extent that students have a hard time switching models. In contrast to threaded programming, which allows for a view of...

chapter

In-Staging Data Placement for Asynchronous Coupling of Task-Based Scientific Workflows

Qian Sun, Melissa Romanus, Tong Jin, Hongfeng Yu, more

2016 Second International Workshop on Extreme Scale Programming Models and Middlewar (ESPM2) > 2 - 9

2016 Second International Workshop on Extreme Scale Programming Models and Middleware (ESPM2)

Coupled application workflows composed of applications implemented using task-based models present new coupling and data exchange challenges, due to the asynchronous interaction and coupling behaviors between tasks of the component applications. In this paper, we present an adaptive data placement approach that addresses these challenges by dynamically adjusting to the asynchronous coupling patterns...

Keywords:
DATA MODELS
DISTRIBUTED DATABASES

Publication date

Set your own date range

Content availability

Available (920)
None (8)

Publication type

book (816)
article (112)

Keywords

COMPUTATIONAL MODELING (217)
DATA MINING (141)
SERVERS (115)
CLOUD COMPUTING (108)
BIG DATA (79)
ALGORITHM DESIGN AND ANALYSIS (70)
ANALYTICAL MODELS (70)
COMPUTER ARCHITECTURE (61)
GRID COMPUTING (61)
INTERNET (60)
XML (54)
MONITORING (53)
DATABASES (51)
DISTRIBUTED PROCESSING (49)
MAPREDUCE (47)
INDEXES (46)
QUERY PROCESSING (46)
PROTOCOLS (43)
OPTIMIZATION (42)
WEB SERVICES (42)
BANDWIDTH (41)
MIDDLEWARE (40)
SCALABILITY (39)
COMPUTERS (38)
DATA INTEGRATION (37)
SOFTWARE (37)
BIOLOGICAL SYSTEM MODELING (36)
DATA PROCESSING (35)
GEOGRAPHIC INFORMATION SYSTEMS (35)
PREDICTIVE MODELS (35)
RELATIONAL DATABASES (35)
CLUSTERING ALGORITHMS (34)
ONTOLOGIES (34)
BUSINESS (33)
DATA ANALYSIS (33)
SPATIAL DATABASES (33)
WIRELESS SENSOR NETWORKS (33)
MATHEMATICAL MODEL (32)
RESOURCE MANAGEMENT (32)
SEMANTICS (32)
AVAILABILITY (30)
OBJECT ORIENTED MODELING (30)
ORGANIZATIONS (30)
PEER-TO-PEER COMPUTING (30)
DATA HANDLING (29)
MEMORY (29)
PROGRAMMING (29)
REAL-TIME SYSTEMS (29)
TRAINING (29)
HADOOP (28)
HEURISTIC ALGORITHMS (28)
DATA STRUCTURES (27)
DATA VISUALIZATION (27)
SECURITY (27)
DATA PRIVACY (26)
STANDARDS (26)
PARALLEL PROCESSING (25)
SCHEDULING (25)
ACCURACY (24)
COLLABORATION (24)
DISTRIBUTED COMPUTING (24)
LIBRARIES (24)
REAL TIME SYSTEMS (24)
SYNCHRONIZATION (24)
PEER TO PEER COMPUTING (23)
DATABASE SYSTEMS (22)
ESTIMATION (22)
LOAD MODELING (22)
RESOURCE DESCRIPTION FRAMEWORK (22)
DATA GRID (21)
CORRELATION (20)
DISTRIBUTED SYSTEMS (20)
META DATA (20)
SENSORS (20)
VECTORS (20)
CONTEXT (19)
EDUCATIONAL INSTITUTIONS (19)
FILE SYSTEMS (19)
ONTOLOGY (19)
REMOTE SENSING (19)
RUNTIME (19)
ARRAYS (18)
DATA MANAGEMENT (18)
MOBILE COMMUNICATION (18)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (18)
CONFERENCES (17)
DELAY (17)
DISTRIBUTED DATABASE (17)
ENGINES (17)
EQUATIONS (17)
NOSQL (17)
QUALITY OF SERVICE (17)
SECURITY OF DATA (17)
RELIABILITY (16)
TIME FACTORS (16)
UNIFIED MODELING LANGUAGE (16)
COMMUNITIES (15)
DATA COMMUNICATION (15)
more

Data set

ieee (927)
Springer (1)

INFONA - science communication portal

Search results

Next-gen tools for big scientific data: ARM data center example

Addressing the big-earth-data variety challenge with the hierarchical triangular mesh

Robust K-subspaces recovery with combinatorial initialization

DERIV: Distributed In-Memory Brand Perception Tracking Framework

Exploring Controlled RDF Distribution

A Scalable Privacy Preserving System for Open Data

Linked data platform for building cloud-based smart applications and connecting API access points with data discovery techniques

Big data analytics as-a-service: Issues and challenges

Data Distribution and Encryption Modelling for PaaS-enabled Cloud Security

DRASH: A Data Replication-Aware Scheduler in Geo-Distributed Data Centers

Brawler to CFAM: Incorporating stochastic engagement-level data in deterministic campaign models

An evaluation of data replication for bioinformatics workflows on NoSQL systems

Study on hotel revenue management without explicitly incorporating competition

Towards Real-Time and Temporal Information Services in Vehicular Networks via Multi-Objective Optimization

Privacy Preserving in Distributed SVM Data Mining on Vertical Partitioned Data

Minimizing the cost of designing fault-tolerant CDN data centers

Extending a Message Passing Runtime to Support Partitioned, Global Logical Address Spaces

A Multi-Objective Optimization Model for Data-Intensive Workflow Scheduling in Data Grids

Teaching MPI from Mental Models

In-Staging Data Placement for Asynchronous Coupling of Task-Based Scientific Workflows

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options