Search results

Items from 1 to 13 out of 13 results

chapter

Extending OmpSs to Support Data Analytics Workload

Marcos Maronas

2017 International Conference on High Performance Computing & Simulation (HPCS) > 884 - 886

2017 International Conference on High Performance Computing & Simulation (HPCS)

In the era of big data, new scientific applications such as those used in astronomy [1] are emerging and challenging High Performance Computing (HPC) systems and software. Traditionally, HPC applications were compute-bounded, with a light use of the I/O capabilites at the start and end of the execution. In contrast, emergent applications present data- intensive behaviors arising several new challenges...

chapter

A RST-based stateful data analytics within spark

Jike Ge, Zuqin Chen, Can Liu, Jun Peng, more

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) > 394 - 399

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

Stateful data analytics framework have emerged to provide fresh and low-latency results for big data processing. At present, it is desired to achieve the fine-grained data model in mainstream data processing framework, e.g. Spark. However, Spark adopts coarse-grained data model in order to facilitate parallization, it makes the fine-grained data access in stateful data analytics very challenging....

chapter

Potential use of R-statistical programming in the field of geoscience

R. M. Bishwal

2017 2nd International Conference for Convergence in Technology (I2CT) > 979 - 982

2017 2nd International Conference for Convergence in Technology (I2CT)

Computational and statistical assessment of data has become the most valuable resource in every field including geoscience for making accurate decision. Analysis of soil and earth structures requires complex mathematical and numerical simulation which demands significant expertise and resources inciting costlier and tedious tasks. The unavailability of proper open-source tools and technical resources...

article

Orchestrating BigData Analysis Workflows

Rajiv Ranjan, Saurabh Garg, Ali Reza Khoskbar, Ellis Solaiman, more

IEEE Cloud Computing > 2017 > 4 > 3 > 20 - 28

Data analytics has become not only an essential part of day-to-day decision making, but also reinforces long-term strategic decisions. Whether it is real-time fraud detection, resource management, tracking and prevention of disease outbreak, natural disaster management or intelligent traffic management, the extraction and exploitation of insightful information from unparalleled quantities of data...

chapter

A distributed, scalable parallelization of fuzzy c-means algorithm

Reena Bharathi, S.C. Shirwaikar, Vilas Kharat

2016 IEEE Bombay Section Symposium (IBSS) > 1 - 7

2016 IEEE Bombay Section Symposium (IBSS)

Distributed Applications from different domains like Health care, E-Commerce, science, social networks etc., tend to generate large volumes of heterogeneous data that grow exponentially over a period of time leading to big data sets. Descriptive Analytics, on big data sets, pose a great challenge for traditional data analytical tools, since it is to be performed on the full data set, unlike predictive...

chapter

Research the Data Analysis and Processing between MapReduce and Spark

Jaime Raigoza, Vijay Parmar

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 1401 - 1402

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

Big Data can be defined as large data sets which are being generated from different sources like social media, audios, imaging, logging online websites etc. A need exists to process and analyze this huge amount of data to extract meaningful information. This can be a challenging task. Big data exceeds the processing capability of traditional databases to capture, manage, and process the voluminous...

chapter

Visualization and Adaptive Subsetting of Earth Science Data in HDFS: A Novel Data Analysis Strategy with Hadoop and Spark

Xi Yang, Si Liu, Kun Feng, Shujia Zhou, more

2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom) > 89 - 96

Data analytics becomes increasingly important in big data applications. Adaptively subsetting large amounts of data to extract the interesting events such as the centers of hurricane or thunderstorm, statistically analyzing and visualizing the subset data, is an effective way to analyze ever-growing data. This is particularly crucial for analyzing Earth Science data, such as extreme weather. The Hadoop...

article

Second-Generation Big Data Systems

Fadi H. Gebara, H. Peter Hofstee, Kevin J. Nowka

Computer > 2015 > 48 > 1 > 36 - 41

More varied data channels, increasingly diverse analytic methods, and new deployment models--along with some fundamental technology shifts--will significantly impact the next generation of big data systems.

chapter

Beehive: A Framework for Graph Data Analytics on Cloud Computing Platforms

Anand Tripathi, Vinit Padhye, Tara Sasank Sunkara

2014 43rd International Conference on Parallel Processing Workshops > 331 - 338

2014 43nd International Conference on Parallel Processing Workshops (ICCPW)

Beehive is a parallel programming framework designed for cluster-based computing environments in cloud data centers. It is specifically targeted for graph data analysis problems. The Beehive framework provides the abstraction of key-value based global object storage, which is maintained in memory of the cluster nodes. Its computation model is based on optimistic concurrency control in executing concurrent...

chapter

Applying MapReduce Programming Model for Handling Scientific Problems

Yun Hee Kang, Young B. Park

2014 International Conference on Information Science & Applications (ICISA) > 1 - 2

2014 International Conference on Information Science and Applications (ICISA)

According to data volumes in scientific applications have grown exponentially, new scientific methods to analyze and organize the data are required. MapReduce programming is driving Internet services and those services operation in a cloud environment. Hence it is required to efficiently provide resources for handling diverse MapReduce applications. In this paper we show the Hadoop application with...

chapter

Data Access Exception Detecting of WS-BPEL Process Based on Workflow Nets

Kai Lei, Peng Peng Zhang, Bo Lang

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 6

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

The current workflow nets models of WS-BPEL are feature completed, but almost all of them lack data information, so they cannot be used to detect the data access exception in WS-BPEL. In order to overcome this shortcoming, this paper presents a new type of Petri nets-DWFN(Data workflow nets), which makes the data flow modeling and exceptions detection of WS-BPEL possible. Then the DWFN model of corresponding...

chapter

Requirements Modeling of Web-Based Scheduling Information System

Yu Jun, Hu Zhi-yi

2008 International Conference on Information Management, Innovation Management and Industrial Engineering > 1 > 105 - 108

2008 International Conference on Information Management, Innovation Management and Industrial Engineering

Requirements modeling is a crucial step in the software development process. It takes an important role in requirements engineering. Requirements models are used to discover and clarify the functional and data requirements for software systems. It is the basis for understanding user requirement and designing information system. This paper describes an entire process of building a software requirements...

chapter

Application of MVNR Algorithm on Data Analyses of Forest Inventory

Han Ning, Song Danwa, Chen Chen

2008 International Conference on Computer Science and Software Engineering > 4 > 344 - 347

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

This paper emphasizes particularly on introduction of the application of non-Redundant Rules Algorithm on Data Analyses of Forest Inventory. By establishing the data mining model, MVNR Algorithm is applied to analyzing the relation of species, origin, age, chest, circumference, height and canopy density of trees. The results provide the best valuable information for forestation programming management...

Filter options

Keywords:
DATA ANALYSIS
PROGRAMMING

Publication date

Set your own date range

Publication type

book (11)
article (2)

Keywords

BIG DATA (5)
ANALYTICAL MODELS (4)
COMPUTATIONAL MODELING (3)
HADOOP (3)
MAPREDUCE (3)
SPARKS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
CLOUD COMPUTING (2)
DISTRIBUTED DATABASES (2)
GEOSCIENCE (2)
SERVERS (2)
SPARK (2)
APACHE SPARK (1)
ASSOCIATION RULES (1)
BANDWIDTH (1)
BIG DATA AND DATA ANALYTICS (1)
BIGDATA (1)
BIOLOGICAL SYSTEM MODELING (1)
CLASSIFICATION ALGORITHMS (1)
CLOUD TIDBITS (1)
CLUSTERING (1)
CLUSTERING ALGORITHMS (1)
COMPUTER SYSTEMS ORGANIZATION (1)
DATA ACCESS EXCEPTION DETECTING ALGORITHM (1)
DATA ANALYSES (1)
DATA ANALYTICS (1)
DATA INTENSIVE SUPERCOMPUTING (1)
DATA MINING (1)
DATA MODEL (1)
DATA VISUALIZATION (1)
DATA WORKFLOW NETS (1)
DISTRIBUTED SYSTEMS (1)
EDUCATIONAL ADMINISTRATIVE DATA PROCESSING (1)
EDUCATIONAL INSTITUTIONS (1)
FAULT TOLERANCE (1)
FAULT TOLERANCE AND RESILIENCY IN HPC SYSTEMS (1)
FAULT TOLERANT SYSTEMS (1)
FILTERING ALGORITHMS (1)
FOREST INVENTORY (1)
FORESTATION PROGRAMMING MANAGEMENT (1)
FORESTRY (1)
FORESTRY RESOURCES (1)
FORMAL SPECIFICATION (1)
FORMAL SPECIFICATIONS (1)
FORMAL VERIFICATION (1)
FUZZY CLUSTERING (1)
GRAPH ALGORITHMS (1)
IN-MEMORY COMPUTING (1)
INFORMAL DATA ANALYSIS (1)
INFORMATION ANALYSIS (1)
INFORMATION SYSTEMS (1)
INSTRUCTION SETS (1)
INTERNET (1)
JOB SHOP SCHEDULING (1)
JOINING PROCESSES (1)
LIBRARIES AND PROGRAMMING ENVIRONMENTS (1)
MARKET RESEARCH (1)
MATHEMATICAL MODEL (1)
NONREDUNDANT RULES ALGORITHM (1)
NONVOLATILE MEMORY (1)
OPEN-SOURCE (1)
OPTIMISTIC CONCURRENCY CONTROL (1)
OPTIMIZATION (1)
PAGERANK (1)
PARALLEL PROCESSING (1)
PARALLEL PROGRAMMING (1)
PETRI NETS (1)
PETRI NETS-DWFN (1)
PROCESS CONTROL (1)
R (1)
R-PROGRAMMING (1)
REAL-TIME SYSTEMS (1)
REDUCER (1)
REDUNDANCY (1)
REPRODUCIBLE RESEARCH (1)
REQUIREMENTS ENGINEERING (1)
RESILIENT DISTRIBUTED DATASET (1)
RESILIENT STATE TABLE (1)
REVERSE ENGINEERING (1)
SCHEDULING (1)
SOFTWARE COMPREHENSIBILITY (1)
SOFTWARE DEVELOPMENT PROCESS (1)
SOFTWARE RELIABILITY (1)
SOFTWARE SYSTEM RELIABILITY (1)
SOIL (1)
SPECIFICATION LANGUAGES (1)
STATEFUL DATA ANALYSICS (1)
SYSTEM OPTIMIZATION (1)
THRESHOLD (1)
TOOLS (1)
TRANSACTIONAL MEMORY (1)
VISUALIZATION (1)
WEB SERVICES (1)
WEB-BASED UNIVERSITY CLASS SCHEDULING INFORMATION SYSTEM (1)
WORDCOUNT (1)
WORKFLOW MANAGEMENT SOFTWARE (1)
WORKFLOW NETS MODELS (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options