Advanced search

Advanced search in people

From:

To:

Items from 1 to 20 out of 20 results

chapter

Dynamic Load Balancing for High-Performance Simulations of Combustion in Engine Applications

L Antonelli, P D'Ambra

2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing > 133 - 140

19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2011)

The chemical task in internal combustion engine simulations concerns with the solution of a non-linear stiff system of Ordinary Differential Equations (ODEs) per each cell of a discretization grid representing engine geometry. The computational cost of the above task, when a detailed kinetic scheme is used, is dominating in engine simulations. Due to local physical-chemical conditions, each system...

chapter

An open electronic system level multi-SPARC virtual platform and its toolchain

Pin-Hao Fang, Yu-Lin Wang, Zhong-Ho Chen, A W Y Su, more

2010 International Computer Symposium (ICS2010) > 478 - 482

2010 International Computer Symposium (ICS 2010)

We present a multi-core virtual platform which follows single-core architecture, SPARC v8, available as an open source development suite. The proposed multi-SPARC system operates at electronic system level to accelerate its simulation speed. TLM channels are devised to connect the processors. To simplify the use of the proposed virtual platform, we define some specific APIs for data transaction and...

chapter

A MapReduce Style Framework for Computations on Trees

A Sarje, S Aluru

2010 39th International Conference on Parallel Processing > 343 - 352

39th International Conference on Parallel Processing (ICPP 2010)

The emergence of cloud computing and Google's MapReduce paradigm is renewing interest in the development of broadly applicable high level abstractions as a means to deliver easy programmability and cyber resources to the user, while hiding complexities of system architecture, parallelism and algorithms, heterogeneity, and fault-tolerance. In this paper, we present a high-level framework for computations...

chapter

Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputer

Sameer Kumar, Philip Heidelberger, Dong Chen, Michael Hines

2010 IEEE International Symposium on Parallel&Distributed Processing (IPDPS) > 1 - 11

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

We explore the multisend interface as a data mover interface to optimize applications with neighborhood collective communication operations. One of the limitations of the current MPI 2.1 standard is that the vector collective calls require counts and displacements (zero and non-zero bytes) to be specified for all the processors in the communicator. Further, all the collective calls in MPI 2.1 are...

chapter

Power-aware MPI task aggregation prediction for high-end computing systems

Dong Li, Dimitrios S Nikolopoulos, Kirk Cameron, Bronis R de Supinski, more

2010 IEEE International Symposium on Parallel&Distributed Processing (IPDPS) > 1 - 12

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Emerging large-scale systems have many nodes with several processors per node and multiple cores per processor. These systems require effective task distribution between cores, processors and nodes to achieve high levels of performance and utilization. Current scheduling strategies distribute tasks between cores according to a count of available cores, b ut ignore the execution time and energy implications...

chapter

A Cluster-Based Implementation of a Fault Tolerant Parallel Reduction Algorithm Using Swarm-Array Computing

Blesson Varghese, Gerard McKee, Vassil Alexandrov

2010 Sixth International Conference on Autonomic and Autonomous Systems > 30 - 36

2010 Sixth International Conference on Autonomic and Autonomous Systems (ICAS 2010)

Recent research in multi-agent systems incorporate fault tolerance concepts. However, the research does not explore the extension and implementation of such ideas for large scale parallel computing systems. The work reported in this paper investigates a swarm array computing approach, namely 'Intelligent Agents'. In the approach considered a task to be executed on a parallel computing system is decomposed...

chapter

A Light-weight API for Portable Multicore Programming

Christopher G Baker, Michael A Heroux, H Carter Edwards, Alan B Williams

2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing > 601 - 606

18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2010)

Multicore nodes have become ubiquitous in just a few years. At the same time, writing portable parallel software for multicore nodes is extremely challenging. Widely available programming models such as OpenMP and Pthreads are not useful for devices such as graphics cards, and more flexible programming models such as RapidMind are only available commercially. OpenCL represents the first truly portable...

chapter

Experiments in running a parallel MPI financial simulator on a cluster and a dedicated grid

A Ibrahim

2010 The 7th International Conference on Informatics and Systems (INFOS) > 1 - 4

2010 7th International Conference on Informatics and Systems (INFOS 2010)

Many computing-intensive applications in different domains are benefiting from new High Performance Computing (HPC) architectures such as clusters and computational grids. This experimental study shows the performance results of parallelizing a computing-intensive risk management financial simulator application using the Message Passing Interface (MPI) and running it on two configurations of a dedicated...

chapter

Flat MPI vs. Hybrid: Evaluation of Parallel Programming Models for Preconditioned Iterative Solvers on T2K Open Supercomputer

K. Nakajima

2009 International Conference on Parallel Processing Workshops > 73 - 80

2009 38th International Conference on Parallel Processing Workshops (ICPPW 2009)

In this work, parallel preconditioning methods based on ??hierarchical interface decomposition (HID)?? and hybrid parallel programming models were applied to finite-element based simulations of linear elasticity problems in media with heterogeneous material properties. Reverse Cuthill-McKee reordering with cyclic multicoloring (CM-RCM) was applied for parallelism through OpenMP. The developed code...

chapter

A Task-Pool Parallel I/O Paradigm for an I/O Intensive Application

Jianjiang Li, Lin Yan, Zhe Gao, Dan Hei

2009 IEEE International Symposium on Parallel and Distributed Processing with Applications > 679 - 684

2009 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)

In regards to applications like 3D seismic migration, it is quite important to improve the I/O performance within an cluster computing system. Such seismic data processing applications are the I/O intensive applications. For example, large 3D data volume cannot be hold totally in computer memories. Therefore the input data files have to be divided into many fine-grained chunks. Intermediate results...

chapter

PSINS: An Open Source Event Tracer and Execution Simulator

Mustafa M Tikir, Michael A Laurenzano, Laura Carrington, Allan Snavely

2009 DoD High Performance Computing Modernization Program Users Group Conference > 444 - 449

DoD High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC 2009)

As the size of today's supercomputers grow exponentially in numbers of processors, the applications that run on these systems scale to larger processor counts. The majority of these applications commonly use Message Passing Interface (MPI); a trace of these MPI communication events is an important input to the tools that visualize, simulate for performance modeling, or enable tuning of parallel applications...

chapter

Utilizing MPJ Express Software in Parallel DNA Sequence Alignment

M. Nordin, A. Rahman

2009 International Conference on Future Computer and Communication > 567 - 571

2009 International Conference on Future Computer and Communication (ICFCC)

DNA sequence alignment is the most commonly application in computational biology. It is essential pre-requisite of many other operations in computational biology applications. Optimal alignment for a large scale size DNA sequence dataset is a known example of a time and space consuming. The Smith-Waterman algorithm is a well recognized technique to produce optimal alignment between DNA sequences....

chapter

Characterizing communication patterns of NAS-MPI benchmark programs

I. Lee

IEEE Southeastcon 2009 > 158 - 163

IEEE SoutheastCon 2009

Scientific computing algorithms on parallel computing environments are popularly used to simulate scientific and engineering phenomena rather than physical experimentations. The performance of these applications on parallel computing environments depends on the communication delay between processors. To reduce the delay, communication patterns have been studied by many research scientists. The communication...

chapter

GiFT: Automating FTPA Implementation for MPI Programs

Hongyi Fu, Yunfei Du, Panfeng Wang, Jia Jia, more

2008 14th IEEE International Conference on Parallel and Distributed Systems > 91 - 98

2008 14th IEEE International Conference on Parallel and Distributed Systems

Fault tolerance is a critical issue in the arena of large-scale computing. The fault-tolerant parallel algorithm (FTPA) is an application-level technique for tolerating hardware failures. FTPA achieves fast failure recovery making use of parallel recomputing. However, it complicates the coding of the application program. This paper uses compiler technology to automate the design of FTPA, and introduces...

chapter

Concrete Partial Evaluation in Ruby

A. Keep, A. Chauhan

2008 IEEE Fourth International Conference on eScience > 346 - 347

2008 IEEE Fourth International Conference on eScience

Modern scientific research is a collaborative process, with researchers from many disciplines and institutions working toward a common goal. Dynamic languages, like Ruby, provide a platform for quickly developing simulation and analysis tools, freeing researchers to focus on research instead of spending time developing infrastructure. Ruby is a particularly good fit, allowing incorporation of existing...

chapter

EasyGrid Enabling of Iterative Tightly-Coupled Parallel MPI Applications

A. Sena, A. Nascimento, C. Boeres, V. Rebello

2008 IEEE International Symposium on Parallel and Distributed Processing with Applications > 199 - 206

2008 IEEE International Symposium on Parallel and Distributed Processing with Applications

This paper addresses the challenge of how to permit tightly coupled parallel applications, optimised for uniform, stable, static environments, execute equally efficiently in environments which exhibit the complete opposite characteristics. Using the N-body problem as a case study, both the traditional and proposed grid enabled MPI implementations of the popular ring algorithm are analysed. Results...

chapter

An Efficient Time Management Scheme for Large-Scale Distributed Simulation Based on JXTA Peer-to-Peer Network

A. Boukerche, Ming Zhang, Hengheng Xie

2008 12th IEEE/ACM International Symposium on Distributed Simulation and Real-Time Applications > 167 - 172

2008 12th IEEE International Symposium on Distributed Simulation and Real-Time Applications

As an emergence technology, P2P is spreading to distributed simulation area, and many distributed simulation frameworks have used P2P as the middleware to interconnect their existing single processor's simulators to form distributed environments for simulation execution. In terms of simulation time management, most existing tools use a middleware layer to implement and support time management in a...

chapter

Model Checking for Integrating Dynamic Load Distribution into Parallel Applications

J.L. Quiroz-Fabian, M. Aguilar-Cornejo, G. Roman-Alonso, M.A. Castro-Garcia

2008 Mexican International Conference on Computer Science > 221 - 231

2008 Mexican International Conference on Computer Science

Many parallel applications running on a distributed memory cluster generate data dynamically to process during their execution. In this case it is possible that some cluster nodes become overloaded. To improve performance we can integrate a dynamic data distribution algorithm.The integration of a dynamic load distribution policy into an application must consider the correct programming of several...

chapter

Maotai: View-Oriented Parallel Programming on CMT Processors

Jiaqi Zhang, Zhiyi Huang, Wenguang Chen, Qihang Huang, more

2008 37th International Conference on Parallel Processing > 636 - 643

2008 37th International Conference on Parallel Processing (ICPP)

View-oriented parallel programming (VOPP) is a novel parallel programming model which uses views for communication between multiple processes. With the introduction of views, mutual exclusion and shared data access are bundled together, which offers both convenience and high performance to parallel programming. This paper presents the implementation of VOPP on chip-multi threading processors, e.g...

chapter

The FETI-DPEM method for the parallel solution of 3D EM problems

Yu-Jia Li, Jian-Ming Jin

2008 IEEE Antennas and Propagation Society International Symposium > 1 - 4

2008 IEEE Antennas and Propagation Society International Symposium and USNC/URSI National Radio Science Meeting

This paper presented the parallel solution of general 3D EM problems using the FETI-DPEM (dual-primal finite element tearing and interconnecting method). An excellent parallel efficiency has been achieved on a cluster system with an automatic decomposition of the computational domain into hundreds of subdomains. In this work, the parallel implementation of the FETI-DPEM method on a distributed-memory...

Filter options

Keywords:
COMPUTATIONAL MODELING
PROGRAM PROCESSORS
APPLICATION PROGRAM INTERFACES

Publication date

Set your own date range

Keywords

MESSAGE PASSING (14)
MESSAGE PASSING INTERFACE (5)
MULTIPROCESSING SYSTEMS (5)
PARALLEL PROGRAMMING (5)
FAULT TOLERANCE (4)
PARALLEL ALGORITHMS (4)
PARALLEL PROCESSING (4)
PROGRAMMING (4)
BENCHMARK TESTING (3)
COMPUTER ARCHITECTURE (3)
DATA MODELS (3)
FAULT TOLERANT SYSTEMS (3)
HEURISTIC ALGORITHMS (3)
LOAD MODELING (3)
MPI (3)
RESOURCE ALLOCATION (3)
C++ LANGUAGE (2)
DIGITAL SIMULATION (2)
DISTRIBUTED MEMORY SYSTEMS (2)
DISTRIBUTED PROGRAMMING (2)
EQUATIONS (2)
FAULT TOLERANT COMPUTING (2)
FINITE ELEMENT ANALYSIS (2)
FINITE ELEMENT METHODS (2)
GRID COMPUTING (2)
HARDWARE (2)
HIGH PERFORMANCE COMPUTING (2)
ITERATIVE METHODS (2)
LIBRARIES (2)
MAGNETIC CORES (2)
MATHEMATICAL MODEL (2)
MULTI-THREADING (2)
MULTICORE PROCESSING (2)
OPENMP (2)
PARALLEL MACHINES (2)
PARTITIONING ALGORITHMS (2)
PREDICTIVE MODELS (2)
PROTOCOLS (2)
PUBLIC DOMAIN SOFTWARE (2)
WORKSTATION CLUSTERS (2)
3D DATA VOLUME (1)
3D EM PROBLEMS (1)
3D-FFT (1)
4D NEAREST NEIGHBOR EXCHANGE (1)
ABSTRACTED HARDWARE LAYER (1)
ACCURACY (1)
ADAPTATION MODEL (1)
ADMINISTRATIVE DATA PROCESSING (1)
ALGORITHMS (1)
ANALYSIS TOOLS (1)
ANALYTICAL MODELS (1)
ANTENNAS (1)
API (1)
APIS (1)
APPLICATION OPTIMIZATION (1)
APPLICATION PROGRAM INTERFACE (1)
APPLICATION PROGRAMMING INTERFACE (1)
APPLICATION PROGRAMS (1)
APREDICTED CORE FAILURE (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATA (1)
AUTOMATIC DECOMPOSITION (1)
AUTOMOTIVE ENGINEERING (1)
BIOCOMPUTING (1)
BIOLOGICAL SYSTEM MODELING (1)
BIOLOGY COMPUTING (1)
BLUE GENE/P SUPERCOMPUTER (1)
BUFFER STORAGE (1)
C (1)
C LIBRARIES (1)
C++ (1)
C++ TEMPLATE META-PROGRAMMING (1)
CHECKPOINTING (1)
CHEMICALS (1)
CHIP-MULTI THREADING (1)
CHIP-MULTI THREADING PROCESSORS (1)
CLUSTER COMPUTING (1)
CLUSTER SYSTEM (1)
CLUSTER-BASED IMPLEMENTATION (1)
COLLABORATIVE PROCESS (1)
COMBUSTION (1)
COMBUSTION SIMULATION (1)
COMMUNICATION DELAY (1)
COMMUNICATION PATTERNS (1)
COMMUNICATION TIMING (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL BIOLOGY (1)
COMPUTATIONAL DOMAIN (1)
COMPUTATIONAL ELECTROMAGNETICS (1)
COMPUTER CLUSTER (1)
COMPUTER MEMORY (1)
COMPUTING-INTENSIVE RISK MANAGEMENT FINANCIAL SIMULATOR (1)
CONCRETE PARTIAL EVALUATION (1)
CONNECTORS (1)
CORE 2 DUO MACHINE (1)
COUPLED PARALLEL APPLICATIONS (1)
CYCLIC DISTRIBUTION POLICY (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options