Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 20 wyników

rozdział

Comparison of Threading Programming Models

Solmaz Salehian, Jiawen Liu, Yonghong Yan

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 766 - 774

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we provide comparison of languagefeatures and runtime systems of commonly used threadingparallel programming models for high performance computing, including OpenMP, Intel Cilk Plus, Intel TBB, OpenACC, NvidiaCUDA, OpenCL, C++11 and PThreads. We then report ourperformance comparison of OpenMP, Cilk Plus and C++11 fordata and task parallelism on CPU using benchmarks. The resultsshow...

rozdział

Analysis and Evaluation of the Performance of CAPE

Van Long Tran, Eric Renault, Viet Hai Ha

2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld) > 620 - 627

MPI (Message Passing Interface), OpenMP are two tools broadly used to develop parallel programs. On the one hand, MPI has the advantage of high performance while being difficult to use. On the other hand, OpenMP is very easy to use but is restricted to shared-memory architectures. CAPE is an approach based on checkpoints to allow the execution of OpenMP programs on distributed-memory architectures...

rozdział

A Case Study in Coordination Programming: Performance Evaluation of S-Net vs Intel's Concurrent Collections

Pavel Zaichenkov, Bert Gijsbers, Clemens Grelck, Olga Tveretina, więcej

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 1059 - 1067

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

We present a programming methodology and runtime performance case study comparing the declarative data flow coordination language S-Net with Intel's Concurrent Collections (CnC). As a coordination language S-Net achieves a near-complete separation of concerns between sequential software components implemented in a separate algorithmic language and their parallel orchestration in an asynchronous data...

rozdział

GPS: Towards Simplified Communication on SGL Model

Chong Li, Gaetan Hains

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 727 - 736

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

Parallel programming and data-parallel algorithms have been the main techniques supporting high-performance computing for many decades. A major conceptual step was taken by L. Valiant who introduced the Bulk-Synchronous Parallel (BSP) model. Parallel algorithms on BSP can be designed and measured by taking into account not only the classical balance between time and parallel space but also communication...

rozdział

Multi-core Portability Abstraction

Martti Forsell, Mikko Hiivala

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 778 - 785

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Application portability between different multicore architecture-parallel programming paradigm/tool pairs is a big problem nowadays leading often to a complete rewrite of an application when switching from an architecture-paradigm pair to another. This is caused by a wide variety of architectural properties requiring different optimization techniques for different architectures, typically hiding the...

rozdział

Performance modeling of heterogeneous systems

Jan Christian Meyer, Anne Cathrine Elster

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 4

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

Predicting how well applications may run on modern systems is becoming increasingly challenging. It is no longer sufficient to look at number of floating point operations and communication costs, but one also needs to model the underlying systems and how their topology, heterogeneity, system loads, etc, may impact performance. This work focuses on developing a practical model for heterogeneous computing...

rozdział

Parallelization of tau-leap coarse-grained Monte Carlo simulations on GPUs

Lifan Xu, Michela Taufer, Stuart Collins, Dionisios G Vlachos

2010 IEEE International Symposium on Parallel&Distributed Processing (IPDPS) > 1 - 9

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

The Coarse-Grained Monte Carlo (CGMC) method is a multi-scale stochastic mathematical and simulation framework for spatially distributed systems. CGMC simulations are important tools for studying phenomena such as catalysis, crystal growth, surface diffusion, phase transitions on single crystals, and cell membrane receptor dynamics. In parallel CGMC, the tau-leap method is used for parallel simulations...

rozdział

A COC Oriented Parallel Computing Strategy for Irregular Applications

Jingjing Zhou, Xiaofei Zhang

Proceedings of the 4th International Conference on Ubiquitous Information Technologies&Applications > 1 - 6

2009 4th International Conference on Ubiquitous Information Technologies & Applications (ICUT 2009)

Sparse and unstructured computations are widely involved in scientific and engineering applications. It means that data arrays could be indexed indirectly through the values of other arrays or non-affine subscripts. Data access pattern would not be known until runtime. So far all the parallel computing strategies for this kind of irregular problem are single network topology oriented, which cannot...

rozdział

An efficient parallel algorithm for evaluating join queries on heterogeneous distributed systems

M. Al Hajj Hassan, M. Bamha

2009 International Conference on High Performance Computing (HiPC) > 350 - 358

2009 16th International Conference on High Performance Computing (HiPC)

Owing to the fast development of network technologies, executing parallel programs on distributed systems that connect heterogeneous machines became feasible but we still face some challenges: Workload imbalance in such environment may not only be due to uneven load distribution among machines as in parallel systems but also due to distribution that is not adequate with the characteristics of each...

rozdział

Formal Modeling of Parallel System Based on TCPN

Bin Cheng, Xingang Wang, Weiqin Tong

2009 Sixth IFIP International Conference on Network and Parallel Computing > 246 - 250

2009 Sixth IFIP International Conference on Network and Parallel Computing. NPC 2009

Correctness and performance are the principal requirement of a parallel system. Due to the complicated and uncertainty, it is necessary to model it. A hierarchical TCPN model proposed in this paper can investigate on various levels of abstraction and analyze concerning performance, functional validity and correctness. It describes the parallel program and the resources respectively to bring less effect...

rozdział

Efficient Parallel Implementation of Molecular Dynamics with Embedded Atom Method on Multi-core Platforms

Changjun Hu, Yali Liu, Jianjiang Li

2009 International Conference on Parallel Processing Workshops > 121 - 129

2009 38th International Conference on Parallel Processing Workshops (ICPPW 2009)

We present a scalable spatial decomposition coloring approach to implement molecular dynamics simulations with embedded atom method (EAM) on multi-core architectures. It effectively solves parallelization of reduction operations on irregular arrays in molecular dynamics simulations. In OpenMP program model, our methodology avoids that the same memory location is simultaneously modified by more than...

rozdział

A Migration-Based Parallel Programming Model with Architectural Support Structures

M Vance

2009 DoD High Performance Computing Modernization Program Users Group Conference > 467 - 475

DoD High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC 2009)

This paper introduces a new parallel programming model motivated by: 1) the concept that computation should move to, and execute near, the global data which it accesses, 2) a set of extended memory semantics to provide fine-grained global synchronization, 3) architectural support for fast lightweight thread creation/destruction/migration, and 4) the need for a high performance language to provide...

rozdział

Natively Supporting True One-Sided Communication in MPI on Multi-core Systems with InfiniBand

G. Santhanaraman, P. Balaji, K. Gopalakrishnan, R. Thakur, więcej

2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid > 380 - 387

2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2009)

As high-end computing systems continue to grow in scale, the performance that applications can achieve on such large scale systems depends heavily on their ability to avoid explicitly synchronized communication with other processes in the system. Accordingly, several modern and legacy parallel programming models (such as MPI, UPC, global arrays) have provided many programming constructs that enable...

rozdział

Developing parallel programs: A design-oriented perspective

A. Ebnenasir, R. Beik

2009 ICSE Workshop on Multicore Software Engineering > 1 - 8

2009 ICSE Workshop on Multicore Software Engineering (IWMSE 2009)

The behavioral correctness of parallel programs has a pivotal role in computational sciences and engineering applications as researchers draw scientific conclusions from the results generated by parallel applications. Moreover, with the advent of multicore processors, the development of parallel programs should be facilitated for the mainstream developers. While numerous programming models and APIs...

rozdział

GPAW optimized for Blue Gene/P using hybrid programming

M.R.B. Kristensen, H.H. Happe, B. Vinter

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 6

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

In this work we present optimizations of a grid-based projector-augmented wave method software, GPAW for the Blue Gene/P architecture. The improvements are achieved by exploring the advantage of shared and distributed memory programming also known as hybrid programming. The work focuses on optimizing a very time consuming operation in GPAW, the finite-different stencil operation, and different hybrid...

rozdział

Optimistic Parallel Discrete Event Simulation Based on Multi-core Platform and its Performance Analysis

Nianle Su, Hongtao Hou, Feng Yang, Qun Li, więcej

2009 International Conference on Complex, Intelligent and Software Intensive Systems > 675 - 680

2009 International Conference on Complex, Intelligent and Software Intensive Systems (CISIS 2009)

The development of computer processor has stepped into the era of multi-core, providing a good chance to spread the parallel discrete event simulation. The parallel programming model and synchronization problem during the parallelization of discrete event simulation on multi-core platform were discussed. A parallel discrete event simulator based on multi-core platform was designed and implemented...

rozdział

Location Consistency Model Revisited: Problem, Solution and Prospects

Guoping Long, Nan Yuan, Dongrui Fan

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies > 91 - 98

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Location consistency (LC) is a weak memory consistency model which is defined entirely on partial order execution semantics of parallel programs. Compared with sequential consistency (SC), LC is scalable and provides ample theoretical parallelism. This makes LC an interesting memory model in the upcoming many-core parallel processing era. Previous work has pointed out that LC does not guarantee SC...

rozdział

MAPS: Multi-Algorithm Parallel circuit Simulation

Xiaoji Ye, Wei Dong, Peng Li, S. Nassif

2008 IEEE/ACM International Conference on Computer-Aided Design > 73 - 78

2008 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

The emergence of multi-core and many-core processors has introduced new opportunities and challenges to EDA research and development. While the availability of increasing parallel computing power holds new promise to address many computing challenges in CAD, the leverage of hardware parallelism can only be possible with a new generation of parallel CAD applications. In this paper, we propose a novel...

rozdział

An Adaptive Synchronization Technique for Parallel Simulation of Networked Clusters

A. Falcon, P. Faraboschi, D. Ortega

ISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and software > 22 - 31

ISPASS 2008. IEEE International Symposium on Performance Analysis of Systems and Software

Computer clusters are a very cost-effective approach for high performance computing, but simulating a complete cluster is still an open research problem. The obvious approach - to parallelize individual node simulators - is complex and slow. Combining individual parallel simulators implies synchronizing their progress of time. This can be accomplished with a variety of parallel discrete event simulation...

artykuł

Approximate algorithms for optimization of busy waiting in parallel programs

Edmund M. Clarke, Lishing Liu

00020th Annual Symposium on Foundations of Computer Science (sfcs 01979) > 1979 > 255 - 266

20th Annual Symposium on Foundations of Computer Science (sfcs 1979)

Traditional implementations of conditional critical regions and monitors can lead to unproductive "busy waiting" if processes are allowed to wait on arbitrary boolean expressions. Techniques from global flow analysis may be employed at compile time to obtain information about which critical regions (monitor calls) are enabled by the execution of a given critical region (monitor call). We...

Opcje filtrowania

Zbiór danych:
ieee
Słowa kluczowe:
SYNCHRONIZATION
COMPUTATIONAL MODELING
PARALLEL PROGRAMMING

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (19)
artykuł (1)

Słowa kluczowe

PROGRAMMING (7)
COMPUTER ARCHITECTURE (4)
PARALLEL PROCESSING (4)
INSTRUCTION SETS (3)
MAGNETIC CORES (3)
PARALLEL PROGRAMS (3)
PROGRAM PROCESSORS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
ANALYTICAL MODELS (2)
DATA MINING (2)
DATA STRUCTURES (2)
DISCRETE EVENT SIMULATION (2)
GRID COMPUTING (2)
HEURISTIC ALGORITHMS (2)
HIGH PERFORMANCE COMPUTING (2)
INTEGRATED CIRCUIT MODELING (2)
LIBRARIES (2)
MEMORY MANAGEMENT (2)
MESSAGE PASSING (2)
MOLECULAR DYNAMICS METHOD (2)
MPI (2)
MULTICORE ARCHITECTURES (2)
MULTIPROCESSING SYSTEMS (2)
OBJECT ORIENTED MODELING (2)
PARALLEL COMPUTING (2)
RADIATION DETECTORS (2)
SKELETON (2)
SOFTWARE ARCHITECTURE (2)
SYNCHRONISATION (2)
YARN (2)
8-NODE CLUSTER RUNNING NAMD (1)
ACCURACY (1)
ADAPTATION MODEL (1)
ADAPTIVE SYNCHRONIZATION (1)
APPLICATION PORTABILITY (1)
APPROXIMATION ALGORITHMS (1)
APPROXIMATION METHODS (1)
ARCHITECTURAL SUPPORT (1)
ARCHITECTURAL SUPPORT STRUCTURES (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATA (1)
AUTOMATED DESIGN (1)
BANDWIDTH (1)
BIOLOGICAL SYSTEM MODELING (1)
BLUE GENE/P ARCHITECTURE (1)
BRIDGING MODEL (1)
BSML (1)
BSP MODEL (1)
BULK SYNCHRONOUS PARALLEL COST MODEL (1)
BULK-SYNCHRONOUS PARALLELISM MODEL (1)
CACHE-COHERENT MULTICORE-MULTIPROCESSOR ARCHITECTURES (1)
CAD (1)
CAPE (1)
CHECKPOINTING (1)
CIRCUIT CAD (1)
CIRCUIT SIMULATION (1)
CLUSTER OF CLUSTERS (1)
CLUSTERING ALGORITHMS (1)
COARSE-GRAINED MONTE CARLO METHOD (1)
COC ORIENTED PARALLEL COMPUTING (1)
COHERENCE (1)
COILS (1)
COLOR (1)
COMMUNICATION COSTS (1)
COMPLEXITY THEORY (1)
COMPUTER CLUSTERS (1)
COMPUTER GRAPHIC EQUIPMENT (1)
COMPUTER LANGUAGES (1)
COMPUTER PROCESSOR (1)
COMPUTERS (1)
CONCURRENCY THEORY (1)
CONCURRENT COLLECTIONS (1)
CONCURRENT COMPUTING (1)
COORDINATION PROGRAMMING (1)
COPROCESSORS (1)
COST-EFFECTIVE APPROACH (1)
CPU CLUSTERS (1)
DATA ACCESS PATTERN (1)
DATA ARRAYS (1)
DATA MODELS (1)
DATA PARALLELISM (1)
DATA PARALLELISM ON GPUS (1)
DATA RACE FREE (1)
DATA RACE FREE PROGRAMS (1)
DATA STRUCTURE (1)
DESIGN-ORIENTED PERSPECTIVE (1)
DIGITAL SIMULATION (1)
DISTRIBUTED MEMORY PROGRAMMING (1)
DISTRIBUTED PROGRAMMING (1)
DYNAMIC DATA DISTRIBUTION (1)
EMBEDDED ATOM METHOD (1)
EVOLUTION (BIOLOGY) (1)
EXTENDED MEMORY SEMANTICS (1)
FINE-GRAINED GLOBAL SYNCHRONIZATION (1)
FINITE DIFFERENCE CODES (1)
FINITE DIFFERENCE METHODS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu