Search results

Items from 1 to 20 out of 22 results

chapter

The PARSEC benchmark suite: Characterization and architectural implications

Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, Kai Li

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 72 - 81

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

This paper presents and characterizes the Princeton Application Repository for Shared-Memory Computers (PARSEC), a benchmark suite for studies of Chip-Multiprocessors (CMPs). Previous available benchmarks for multiprocessors have focused on high-performance computing applications and used a limited number of synchronization methods. PARSEC includes emerging applications in recognition, mining and...

chapter

Experimental Validation and Exploration of a New Kind of Synchronization in Linux

Fangfang Zhu, Yucong Chen, Jianqiang Wang, Gaofeng Zhang, more

2016 International Symposium on System and Software Reliability (ISSSR) > 91 - 96

2016 International Symposium on System and Software Reliability (ISSSR)

PWCS (Probabilistic Write / Copy-Select) is a new kind of lock-free synchronization mechanism with wait-free characteristics proposed by Nicholas Mc Guire at the 13th real-time Linux workshop, which utilizes the inherent randomness of the modern computer systems. It aims at addressing the multi-reader - single-writer problem in Linux. Based on the original label-based PWCS, we propose a hash-based...

chapter

LSPP Introduction and Committees

Kevin J. Barker, Chris D. Carothers, Eric van Hensbergen

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 1339

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

chapter

μStreams: a tool for automated streaming pipeline generation on soft-core processors

Kris Heid, Jan Weber, Christian Hochberger

2016 International Conference on FPGA Reconfiguration for General-Purpose Computing (FPGA4GPC) > 25 - 30

2016 International Conference on FPGA Reconfiguration for General-Purpose Computing (FPGA4GPC)

FPGAs have grown considerably in the past years. In the meantime it is possible to implement several soft-core processors in one FPGA. This enables considerable parallelism for the developer. Unfortunately, most application code is still available in sequential form. Thus, in this contribution we present a tool that enables the automated transformation of an application into a streaming pipeline using...

chapter

An algorithm to improve MPI-PageRank performance by reducing synchronization time

Sumalee Sangamuang, Pruet Boonma, Lai Lai Win Kyii

2015 International Computer Science and Engineering Conference (ICSEC) > 1 - 4

2015 International Computer Science and Engineering Conference (ICSEC)

Ranking is an important operation in web searching. Among many ranking algorithms, PageRank is a most notable one. However, sequential PageRank computing on a large web-link graph is not efficient. To address such limitation, parallel PageRank implemented on Message Passing Interface (MPI) is a viable choice. Generally speaking, MPI-PageRank will be implemented using a root node and many computing,...

chapter

Test and Repair Flow for Shared BISR in Asynchronous Multi-processors

Gang Wang, Xu Wang, Xinke Chen, Shuangbai Xue

2014 20th IEEE International Symposium on Asynchronous Circuits and Systems > 105 - 107

2014 20th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC)

We present a hierarchical test and repair flow for shared BISR (Built-In Self-Repair) in asynchronous multi-processors. The flow partitions the memories local to a processor in groups and treats the groups as a whole when doing the repair. The flow runs automatically with few interventions except at the beginning stage. It can be used effectively for practical industrial test and repair. Its test...

chapter

MPI-based Parallelization for ILP-based Multi-relational Concept Discovery

Alev Mutlu, Pinar Senkul, Yusuf Kavurucu

2011 10th International Conference on Machine Learning and Applications and Workshops > 1 > 59 - 62

2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011)

Multi-relational concept discovery is a predictive learning task that aims to discover descriptions of a target concept in the light of past experiences. Parallelization has emerged as a solution to deal with efficiency and scalability issues relating to large search spaces in concept discovery systems. In this work, we describe a parallelization method for the ILP-based concept discovery system called...

chapter

Emerging applications for multi/many-core processors

Victor W. Lee, Yen-Kuang Chen, Pradeep Debuy

2011 IEEE International Symposium of Circuits and Systems (ISCAS) > 1524 - 1527

2011 IEEE International Symposium on Circuits and Systems (ISCAS)

There has always been a strong relationship between computer applications and computer architectures. Advances in computer architecture enable new usage models; new usage models challenge new architectures. For many decades, the interplays between applications and architectures have resulted in significant progress in the computer technologies. Recently, the computer industry adopts the multi/many-core...

chapter

Estimating overheads of OpenMP directives

Sareh Doroodian, Nima Ghaemian, Mohsen Sharifi

2011 19th Iranian Conference on Electrical Engineering > 1 - 5

2011 19th Iranian Conference on Electrical Engineering (ICEE)

Estimating the execution time of programs has always been a concern in computer science. With the emergence of multi-core processors, this concern has found new perspectives and new parameters affect the runtime performance of parallel applications. To estimate the execution time of parallel applications, we investigate the overheads caused by parallelizing an application by identifying the overheads...

chapter

A Parallel Simulator for Large-Scale Parallel Computers

Yuzhe Zhi, Yi Liu, Lin Jiao, Peng Zhang

2010 Ninth International Conference on Grid and Cloud Computing > 196 - 200

2010 9th International Conference on Grid and Cloud Computing (GCC 2010)

This paper describes the design and application of an execution-driven parallel simulator for predicting performance of Large-Scale Parallel Computers. The simulator can be used in hardware validation and software development for large-scale parallel computers. It simulates processors of each node, network components and disk I/O components. To illustrate the capabilities of our simulator, we describe...

chapter

Building a Personal High Performance Computer with Heterogeneous Processors

Qiang Li, Zhigang Huo, Ninghui Sun

2010 Ninth International Conference on Grid and Cloud Computing > 223 - 228

2010 9th International Conference on Grid and Cloud Computing (GCC 2010)

Personal high performance computer (PHPC) requires lower cost and high performance. The Teraflops PHPC systems with special accelerator units like GPGPU have been presented, but they have difficulties in programming, compatibility and applicability. In this paper, we present HPP-PHPC, a hybrid architecture of heterogeneous processors connected by non-coherent off-chip system bus. The performance of...

chapter

A Method of Computation Decomposition on Tightly-Nested Loop Automatic Parallelization

Zhao Yan, Lei Liu

2009 Third International Symposium on Intelligent Information Technology Application > 3 > 431 - 434

2009 Third International Symposium on Intelligent Information Technology Application

An automatic parallelization method for tightly-nested loops running on multi-core system has been proposed. First, according to the physical characteristics of multi-core processors, a way has been presented to solve the problem on dada locality during data decomposition; Second, for increasing parallel granularity of tight nested loops, the method discussed in this article studied computation decomposition...

chapter

Efficiency research of batch and single pattern MLP parallel training algorithms

V. Turchenko, L. Grandinetti

2009 IEEE International Workshop on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications > 218 - 224

2009 IEEE International Workshop on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS 2009)

The development of parallel algorithms for batch and single pattern back propagation training of a multilayer perceptron and the research of their efficiency on a general-purpose parallel computer are presented in this paper. The multilayer perceptron model and the sequential batch and single pattern training algorithms are theoretically described. An algorithmic description of the parallel versions...

chapter

Partitioning Algorithm of 3-D Prestack Parallel Kirchhoff Depth Migration for Imaging Spaces

Jianjiang Li, Dan Hei, Lin Yan

2009 Eighth International Conference on Grid and Cooperative Computing > 276 - 280

2009 Eighth International Conference on Grid and Cooperative Computing (GCC)

In the process of treatments to large-scale 3-D prestack Kirchhoff depth migration, the required memory for imagining spaces is large, even more than the total memory of the single node. So this paper proposes the partitioning algorithm based on the process groups in the environment of distributed memory. Partitioning to the imaging spaces is going on in a group, and the processes of the group achieve...

chapter

Improving the speed and scalability of distributed simulations of sensor networks

Zhong-Yi Jin, R. Gupta

2009 International Conference on Information Processing in Sensor Networks > 169 - 180

2009 8th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)

Distributed simulation techniques are commonly used to improve the speed and scalability of wireless sensor network simulators. However, accurate simulations of dynamic interactions of sensor network applications incur large synchronization overheads and severely limit the performance of existing distributed simulators. In this paper, we present two novel techniques that significantly reduce such...

chapter

Dynamic measurement of critical-path timing

A.J. Drake, R.M. Senger, H. Singh, G.D. Carpenter, more

2008 IEEE International Conference on Integrated Circuit Design and Technology and Tutorial > 249 - 252

2008 IEEE International Conference on IC Design and Technology & Tutorial (ICICDT)

A high bandwidth critical path monitor (1 sample/ cycle at 4-5 GHz) capable of providing real-time timing margin information to a variable voltage/frequency scaling control loop is described. The critical path monitor tracks the critical path delay to within 1 FO2 inverter delay with a standard deviation less than 3 FO2 delays over process, voltage, temperature, and workload. The CPM is sensitive...

chapter

BFT-WS: A Byzantine Fault Tolerance Framework for Web Services

Wenbing Zhao

2007 Eleventh International IEEE EDOC Conference Workshop > 89 - 96

2007 11th IEEE International Enterprise Distributed Object Computing Conference Workshops (EDOC Workshops)

Many Web services are expected to run with high degree of security and dependability. To achieve this goal, it is essential to use a Web-services compatible framework that tolerates not only crash faults, but Byzantine faults as well, due to the untrusted communication environment in which the Web services operate. In this paper, we describe the design and implementation of such a framework, called...

chapter

Policy-based resource management mechanism for dynamic Grid environments

Youngjoo Han, Hyewon Song, Chan-Hyun Youn, EunBo Shim, more

2007 IEEE Sarnoff Symposium > 1 - 5

2007 IEEE Sarnoff Symposium

As Grid networks grows, the complexity of resource management in Grid networks dramatically increases. To manage the Grid resources efficiently, policy based resource management system is suitable. However, the policy which made at a moment could not be appropriate to other time because the condition of the grid resources with time space changes. Thus, in this paper, we propose asynchronous policy-based...

chapter

Applying static network protocols to dynamic networks

Yehuda Afek, Baruch Awerbuch, Eli Gafni

28th Annual Symposium on Foundations of Computer Science (sfcs 1987) > 358 - 370

28th Annual Symposium on Foundations of Computer Science

This paper addresses the problem of how to adapt an algorithm designed for fixed topology networks to produce the intended results, when run in a network whose topology changes dynamically, in spite of encountering topological changes during its execution. We present a simple and unified procedure, called a reset procedure, which, when combined with the static algorithm, achieves this adaptation....

article

On the Unavoidability of Metastable Behavior in Digital Systems

Lindsay Kleeman, Antonio Cantoni

IEEE Transactions on Computers > 1987 > C-36 > 1 > 109 - 112

Fault-free digital systems can fail as a result of metastable behavior when asynchronous inputs have critical timing combinations. The problem of metastable behavior is generally considered to be unavoidable in digital systems that synchronize asynchronous inputs. This correspondence extends previous results on the unavoidability of metastable behavior. The set of inputs to the digital system is generalized...

Data set:
ieee
Keywords:
SYNCHRONIZATION
COMPUTERS
PROGRAM PROCESSORS

Publication date

Set your own date range

Publication type

book (20)
article (2)

Keywords

COMPUTER ARCHITECTURE (6)
ALGORITHM DESIGN AND ANALYSIS (5)
COMPUTATIONAL MODELING (5)
COMPLEXITY THEORY (4)
PARALLEL PROCESSING (4)
BANDWIDTH (3)
CLOCKS (3)
CONTRACTS (3)
DELAY (3)
MEMORY MANAGEMENT (3)
MULTICORE PROCESSING (3)
RELIABILITY (3)
CONFERENCES (2)
DATA MINING (2)
ELECTRONIC MAIL (2)
ENGINES (2)
ESTIMATION (2)
FILTERING (2)
HARDWARE (2)
HEURISTIC ALGORITHMS (2)
INFORMATION TECHNOLOGY (2)
LIBRARIES (2)
LOGIC GATES (2)
MATHEMATICAL MODEL (2)
MICROPROCESSORS (2)
MONITORING (2)
MULTIPROCESSING SYSTEMS (2)
NETWORK TOPOLOGY (2)
PROCESS CONTROL (2)
PROTOCOLS (2)
PROTOTYPES (2)
REAL TIME SYSTEMS (2)
RUNTIME (2)
SIGNAL PROCESSING ALGORITHMS (2)
TIMING (2)
YARN (2)
3D PRESTACK PARALLEL KIRCHHOFF DEPTH MIGRATION (1)
ACCESS PROTOCOLS (1)
ACCURACY (1)
ADAPTIVE SYSTEMS (1)
AGING (1)
ANALYTICAL MODELS (1)
ANIMATION (1)
ARTIFICIAL NEURAL NETWORKS (1)
ASYNCHRONOUS INPUTS (1)
ASYNCHRONOUS MULTI-PROCESSORS (1)
ASYNCHRONOUS POLICY-BASED RESOURCE MANAGEMENT MECHANISM (1)
ATTENUATION MEASUREMENT (1)
AUTOMATA (1)
AVAILABILITY (1)
AWARDS ACTIVITIES (1)
BACK PROPAGATION TRAINING (1)
BACKPROPAGATION (1)
BARS (1)
BATCH MLP PARALLEL TRAINING ALGORITHM (1)
BENCHMARK SUITE (1)
BENCHMARK TESTING (1)
BIOMEDICAL EQUIPMENT (1)
BUILT-IN SELF-TEST (1)
BUSINESS (1)
BYZANTINE FAULT TOLERANCE FRAMEWORK (1)
CALIBRATION (1)
CAPACITANCE (1)
CARDIOLOGY (1)
CLIENT-SERVER SYSTEMS (1)
COMMUNICATION NETWORKS (1)
COMPLEX MPI CODE (1)
COMPOUNDS (1)
COMPUTATION DECOMPOSITION (1)
COMPUTER CRASHES (1)
COMPUTER SCIENCE (1)
COMPUTER VISION (1)
CONTROL SYSTEMS (1)
CONVERGENCE (1)
CONVERTERS (1)
COPROCESSORS (1)
CORRELATION (1)
CRITICAL PATH MONITOR (1)
CRITICAL-PATH TIMING (1)
CURRENT MEASUREMENT (1)
DATA ANALYSIS (1)
DATA DECOMPOSITION (1)
DATA LOCALITY (1)
DATA MODELS (1)
DATA STRUCTURES (1)
DATA TRANSFER (1)
DECOMPOSITION (1)
DEGRADATION (1)
DELAY LINES (1)
DELAY SYSTEMS (1)
DELAYS (1)
DETECTORS (1)
DIGITAL SIGNATURES (1)
DIGITAL SYSTEMS (1)
DISEASES (1)
DISK I/O COMPONENT (1)
DISPATCHING (1)
more

INFONA - science communication portal

Search results

The PARSEC benchmark suite: Characterization and architectural implications

Experimental Validation and Exploration of a New Kind of Synchronization in Linux

LSPP Introduction and Committees

μStreams: a tool for automated streaming pipeline generation on soft-core processors

An algorithm to improve MPI-PageRank performance by reducing synchronization time

Test and Repair Flow for Shared BISR in Asynchronous Multi-processors

MPI-based Parallelization for ILP-based Multi-relational Concept Discovery

Emerging applications for multi/many-core processors

Estimating overheads of OpenMP directives

A Parallel Simulator for Large-Scale Parallel Computers

Building a Personal High Performance Computer with Heterogeneous Processors

A Method of Computation Decomposition on Tightly-Nested Loop Automatic Parallelization

Efficiency research of batch and single pattern MLP parallel training algorithms

Partitioning Algorithm of 3-D Prestack Parallel Kirchhoff Depth Migration for Imaging Spaces

Improving the speed and scalability of distributed simulations of sensor networks

Dynamic measurement of critical-path timing

BFT-WS: A Byzantine Fault Tolerance Framework for Web Services

Policy-based resource management mechanism for dynamic Grid environments

Applying static network protocols to dynamic networks

On the Unavoidability of Metastable Behavior in Digital Systems

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options