Search results

article

Dynamic Checkpointing Policy in Heterogeneous Real-Time Standby Systems

Gregory Levitin, Liudong Xing, Yuanshun Dai, Vinod M. Vokkarane

IEEE Transactions on Computers > 2017 > 66 > 8 > 1449 - 1456

This paper models 1-out-of-N standby computing systems with a dynamic checkpointing policy. The system performs a real-time mission task that has to be accomplished within an allowed mission time. During the mission, to facilitate an effective failure recovery the system undergoes checkpointing procedures according to a policy that dynamically determines a checkpointing frequency based on the activated...

chapter

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

Yujie An, Quentin F. Stout

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 937 - 946

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

We give efficient algorithms to solve fundamental data movement problems on mesh-connected computers augmented with limited global bandwidth. Adding a small amount of global bandwidth makes a practical design that combines aspects of mesh and fully connected models to achieve the benefits of each. We give algorithms for sorting, finding the median, finding a spanning tree, and determining various...

chapter

The PARSEC benchmark suite: Characterization and architectural implications

Christian Bienia, Sanjeev Kumar, Jaswinder Pal Singh, Kai Li

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 72 - 81

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

This paper presents and characterizes the Princeton Application Repository for Shared-Memory Computers (PARSEC), a benchmark suite for studies of Chip-Multiprocessors (CMPs). Previous available benchmarks for multiprocessors have focused on high-performance computing applications and used a limited number of synchronization methods. PARSEC includes emerging applications in recognition, mining and...

article

The $t/s$ -Diagnosability of Hypercube Networks Under the PMC and Comparison Models

Jiarong Liang, Qian Zhang

IEEE Access > 2017 > 5 > 5340 - 5346

A

$t/s$

-diagnosable system, a generalization of

$t/t$

-diagnosable system, refers to such a system that all the faulty nodes of the system can be isolated within a set of size at most

$s$

in the presence of at most

$t$

faulty nodes. In this paper, the

$t/s$

-diagnosability of the hypercubes under the PMC model (the comparison model) is evaluated. First, several novel properties of hypercube...

chapter

A Parallel K-Medoids Algorithm for Clustering based on MapReduce

M. Omair Shafiq, Eric Torunski

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 502 - 507

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

One of the most important machine learning techniques include clustering of data into different clusters or categories. There are several decent algorithms and techniques that exist to perform clustering on small to medium scale data. In the era of Big Data and with applications being large-scale and data-intensive in nature, there is a significant increment in volume, variety and velocity of data...

chapter

Left-Preconditioned Communication-Avoiding Conjugate Gradient Methods for Multiphase CFD Simulations on the K Computer

Akie Mayumi, Yasuhiro Idomura, Takuya Ina, Susumu Yamada, more

2016 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA) > 17 - 24

2016 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA)

The left-preconditioned communication avoiding conjugate gradient (LP-CA-CG) method is applied to the pressure Poisson equation in the multiphase CFD code JUPITER. The arithmetic intensity of the LP-CA-CG method is analyzed, and is dramatically improved by loop splitting for inner product operations and for three term recurrence operations. Two LPCA-CG solvers with block Jacobi preconditioning and...

chapter

Experimental Validation and Exploration of a New Kind of Synchronization in Linux

Fangfang Zhu, Yucong Chen, Jianqiang Wang, Gaofeng Zhang, more

2016 International Symposium on System and Software Reliability (ISSSR) > 91 - 96

2016 International Symposium on System and Software Reliability (ISSSR)

PWCS (Probabilistic Write / Copy-Select) is a new kind of lock-free synchronization mechanism with wait-free characteristics proposed by Nicholas Mc Guire at the 13th real-time Linux workshop, which utilizes the inherent randomness of the modern computer systems. It aims at addressing the multi-reader - single-writer problem in Linux. Based on the original label-based PWCS, we propose a hash-based...

chapter

Applying parameterized model checking to real-life cache coherence protocols

Vladimir Burenkov, Alexander Kamkin

2016 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 4

2016 IEEE East-West Design & Test Symposium (EWDTS)

This paper overviews a technique for verifying cache coherence protocols described in the Promela language. The approach is comprised of the following steps. First, a model written for a certain configuration of the memory system is generalized to the model being parameterized with the number of processors. Second, the parameterized model is abstracted from the exact number of processors. Finally,...

chapter

A Code Selection Mechanism Using Deep Learning

Hang Cui, Shoichi Hirasawa, Hiroyuki Takizawa, Hiroaki Kobayashi

2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSOC) > 385 - 392

2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)

Sparse Matrix-Vector multiplication (SpMV) is a computational kernel widely used in many applications. There are many different implementations using different processors and algorithms for SpMV. The performances of different SpMV implementations are quite different, and it is basically difficult to choose the implementation that has the best performance for a given sparse matrix and a given platform...

chapter

How Parallelization Helps Crowd Simulation: Study of an OpenMP-Based System

Edwin Lobo-Hernandez, Xun Luo, Gustavo Alomia-Penafiel, Nan Liu, more

2016 International Conference on Virtual Reality and Visualization (ICVRV) > 354 - 357

2016 International Conference on Virtual Reality and Visualization (ICVRV)

This paper analyzes the parallelization efficiency of Menge [1], an open source virtual crowd simulation system widely used for algorithm benchmarking, with focuses on three aspects: performance of the existing parallel processing scheme, bottleneck of parallel processing, and improvement opportunities for parallel efficiency of the system. First, we calculate the speedup ratio of each Menge module...

chapter

Optimal allocation of power — Graphical interpretation of some scheduling problem

Rafal Rozycki, Grzegorz Waligora

2016 21st International Conference on Methods and Models in Automation and Robotics (MMAR) > 975 - 980

2016 21st International Conference on Methods and Models in Automation and Robotics (MMAR)

We consider a practical makespan minimization problem that arises in a multiprocessor computer system where some processors may be shut down during computation to save an amount of shared power. The system consists of m processors driven by a common power source. The processors are modeled as a set of identical parallel machines. Moreover, we consider a set of n independent, nonpreemptive jobs which...

chapter

Significance of parallel computation over serial computation

Shubhangi Rastogi, Hira Zaheer

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 2307 - 2310

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

In today's scenario there is a need of fast computers to perform huge tasks in less time. In serial computation one task will be done after another but it takes more time. On the other hand, time taken by a computation problem can be reduced by performing several operations simultaneously. Parallel computing [4,8,9] is the concurrent use of multiple resources to solve a single problem. A computational...

chapter

A new task scheduling method for 2 level load balancing in homogeneous distributed system

Lipika Datta

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) > 4320 - 4325

2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT)

A distributed system consists of several autonomous nodes. In a distributed system some of the nodes may be overloaded due to a large number of job arrivals while other nodes may remain idle without any processing. The performance of a distributed system depends crucially on dividing up work effectively among the computing nodes. So a way is needed to share load across all the computing nodes. In...

chapter

Novel source-to-source compiler approach for the automatic parallelization of codes based on the method of moments

Hipolito Gomez-Sousa, Manuel Arenaz, Oscar Rubinos-Lopez, Jose Angel Martinez-Lorenzo

2015 9th European Conference on Antennas and Propagation (EuCAP) > 1 - 6

2015 9th European Conference on Antennas and Propagation (EuCAP)

In computational electromagnetics, surface integral equation (SIE) formulations are widely used to predict the electromagnetic scattering from arbitrary structures. These SIE formulations are discretized into a matrix form by the well-known method of moments (MoM). Up to now, the lack of proper compilers made it necessary for the MoM codes to be parallelized by hand in order to obtain reasonable performance...

chapter

Distributed parallel computing technique for EM modeling

Jianan Zhang, Kai Ma, Feng Feng, Zhihao Zhao, more

2015 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO) > 1 - 3

2015 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO)

This paper proposes a novel distributed parallel EM modeling technique to speed up the process of neural network modeling for EM structures. Existing techniques for EM modeling usually need to repeatedly change the parameters of microwave devices and drive the EM simulator to obtain sufficient training and testing samples. As the complexity in EM modeling problem increases, traditional techniques...

chapter

Towards self-adaptive MPSoC systems with adaptivity throttling

Wei Quan, Andy D. Pimentel

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) > 157 - 164

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

Today's multi-processor system-on-chip (MPSoC) systems increasingly have to deal with dynamically changing application workload scenarios. To cope with such dynamic application behavior, these systems could dynamically adapt the mapping of application tasks onto the underlying system resources to improve the system's performance. However, such performance improvement comes at the cost of a system...

chapter

Software fault tolerance for FPUs via vectorization

Zhi Chen, Ryoichi Inagaki, Alexandru Nicolau, Alexander V. Veidenbaum

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) > 203 - 210

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

Future generation processors are expected to have high soft error rates and will require increased fault detection and fault tolerance. This work focuses on errors in execution units. Hardware or software duplication or triplication, parity, or residue codes could be used to detect errors in execution units. However, hardware duplication/triplication have significant area overhead and, in applications...

chapter

Complex network analysis on distributed systems — An empirical comparison

Jannis Koch, Christian L. Staudt, Maximilian Vogel, Henning Meyerhenke

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 1169 - 1176

2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

Complex networks are relational data sets commonly represented as graphs. The analysis of their intricate structure is relevant to many areas of science and commerce, and data sets may reach sizes that require distributed storage and processing. We describe and compare programming models for distributed computing with a focus on graph algorithms for large-scale complex network analysis. Four frameworks...

chapter

A fast algorithm of compressed sensing

Hao Liu, Yuehaoyan, Junhai Wang

2014 11th International Computer Conference on Wavelet Actiev Media Technology and Information Processing(ICCWAMTIP) > 463 - 467

2014 11th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

After studying the compressed sensing theory and its main reconstruction algorithm-Matching Pursuit (MP) algorithm, this paper proposes a new approach to improve the speed of MP algorithm, and it describes how to build a Beowulf parallel computing system with 8 PCs. Its parallel computations is implemented by Message-Passing-Interface(MPI), and a 100Mb/s high speed Ethernet network interconnects all...

chapter

Introducing A-Cell for Scalable and Portable SIMD Programming

Hamed Khandan

2014 IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs > 275 - 280

2014 IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs (MCSoC)

A-Cell is a high-level abstraction of fine-grained parallelism specifically designed to be applicable to all range of parallel devices from super computers based on CPUs or GPUs, to network of embedded devices. To achieve this, A-Cell adopts a programming model called "connectionist computing" and with that takes a leap step away from Turing programming model. Also, in contrast with most...

INFONA - science communication portal

Search results

Dynamic Checkpointing Policy in Heterogeneous Real-Time Standby Systems

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

The PARSEC benchmark suite: Characterization and architectural implications

The $t/s$ -Diagnosability of Hypercube Networks Under the PMC and Comparison Models

A Parallel K-Medoids Algorithm for Clustering based on MapReduce

Left-Preconditioned Communication-Avoiding Conjugate Gradient Methods for Multiphase CFD Simulations on the K Computer

Experimental Validation and Exploration of a New Kind of Synchronization in Linux

Applying parameterized model checking to real-life cache coherence protocols

A Code Selection Mechanism Using Deep Learning

How Parallelization Helps Crowd Simulation: Study of an OpenMP-Based System

Optimal allocation of power — Graphical interpretation of some scheduling problem

Significance of parallel computation over serial computation

A new task scheduling method for 2 level load balancing in homogeneous distributed system

Novel source-to-source compiler approach for the automatic parallelization of codes based on the method of moments

Distributed parallel computing technique for EM modeling

Towards self-adaptive MPSoC systems with adaptivity throttling

Software fault tolerance for FPUs via vectorization

Complex network analysis on distributed systems — An empirical comparison

A fast algorithm of compressed sensing

Introducing A-Cell for Scalable and Portable SIMD Programming

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options