2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

chapter

GPU Peer-to-Peer Techniques Applied to a Cluster Interconnect

Roberto Ammendola, Massimo Bernaschi, Andrea Biagioni, Mauro Bisson, more

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 806 - 815

Modern GPUs support special protocols to exchange data directly across the PCI Express bus. While these protocols could be used to reduce GPU data transmission times, basically by avoiding staging to host memory, they require specific hardware features which are not available on current generation network adapters. In this paper we describe the architectural modifications required to implement peer-to-peer...

chapter

Direct MPI Library for Intel Xeon Phi Co-Processors

Min Si, Yutaka Ishikawa, Masamichi Tatagi

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 816 - 824

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

DCFA-MPI is an MPI library implementation for Intel Xeon Phi co-processor clusters, where a compute node consists of an Intel Xeon Phi co-processor card connected to the host via PCI Express with InfiniBand. DCFA-MPI enables direct data transfer between Intel Xeon Phi co-processors without assistance from the host. Since DCFA, a direct communication facility for many-core based accelerators, provides...

chapter

HiCOMB Introduction

Jaroslaw Zola, David A. Bader, Srinivas Aluru

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 499 - 500

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

chapter

A Comparison of Ruleset Feature Independent Packet Classification Engines on FPGA

Andrea Sanny, Thilan Ganegedara, Viktor K. Prasanna

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 124 - 133

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Packet classification is used in network firewalls to identify and filter threats or unauthorized network access at the application level. This is realized by comparing incoming packet headers against a predefined rule set. Many solutions to packet classification are available, but most of these solutions exploit some features of the rule set in order to minimize the memory footprint of rule set storage...

chapter

RTL Simulation of High Performance Dynamic Reconfiguration: A Video Processing Case Study

Lingkan Gong, Oliver Diessel, Johny Paul, Walter Stechele

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 106 - 113

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Dynamically Reconfigurable Systems (DRS) allow hardware logic to be partially reconfigured while the rest of the design continues to operate. For example, the Auto Vision driver assistance system swaps video processing engines when the driving conditions change. However, the architectural flexibility of DRS also introduces challenges for verifying system functionality. Using Auto Vision as a case...

chapter

Hardware Supported Adaptive Data Collection for Networks on Chip

Jan Heisswolf, Andreas Weichslgartner, Aurang Zaib, Ralf Konig, more

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 153 - 162

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Managing future many-core architectures with hundreds of cores, running multiple applications in parallel, is very challenging. One of the major reasons is the communication overhead required to handle such a large system. Distributed management is proposed to reduce this overhead. The architecture is divided into regions which are managed separately. The instance managing the region and the applications...

chapter

A Flexible Memory Controller Supporting Deep Belief Networks with Fixed-Point Arithmetic

Jingfei Jiang, Rongdong Hu, Mikel Lujan

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 144 - 152

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Deep Belief Networks (DBNs) are state-of-art Machine Learning techniques and one of the most important unsupervised learning algorithms. Training DBNs is computationally intensive which naturally leads to investigate FPGA acceleration. Fixed-point arithmetic can have an important influence on the execution time and prediction accuracy of a DBN. Previous studies have focused only on customized DBN...

chapter

Architecture Exploration of High-Performance Floating-Point Fused Multiply-Add Units and their Automatic Use in High-Level Synthesis

Bjorn Liebig, Jens Huthmann, Andreas Koch

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 134 - 143

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Multiply-add operations form a crucial part of many digital signal processing and control engineering applications. Since their performance is crucial for the application-level speed-up, it is worthwhile to explore a wide spectrum of implementations alternatives, trading increased area/energy usage to speed-up units on the critical path of the computation. This paper examines existing solutions and...

chapter

Wire Speed IPv6 Forwarding on Multi-core Platforms

Thilan Ganegedara, Viktor K. Prasanna

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2246 - 2249

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

With the exhaustion of IPv4 (32 bit) address space, IPv6 (128 bit) addressing is emerging to facilitate the immense growth of the Internet. However, this poses two main challenges to high-speed routers that perform packet forwarding: 1) increased IP lookup complexity and 2) increased routing table storage requirements. In this work, we present a high-performance IPv6 lookup engine based on routing...

chapter

Discrete Min-Energy Scheduling on Restricted Parallel Processors

Xibo Jin, Fa Zhang, Zhiyong Liu

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2226 - 2229

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Different from the previous work on energy-efficient algorithms, which focused on assumption that a task can be assigned to any processor, we study the problem of task Scheduling with the objective of Energy Minimization on Restricted Parallel Processors (SEMRPP). Restriction accounts for affinities between tasks and processors, that is, a task has its own eligible processing set of processors. It...

chapter

A Compression Framework for Multidimensional Scientific Datasets

Tekin Bicer, Gagan Agrawal

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2250 - 2253

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Scientific simulations and instruments can generate tremendous amount of data in short time periods. Since the generated data is used for inferring new knowledge, it is important to efficiently store and provide it to the scientific endeavors. Although parallel and distributed systems can help to ease the management of such data, the transmission and storage are still challenging problems. Compression...

chapter

Efficient Parallel and Distributed Algorithms for GIS Polygonal Overlay Processing

Satish Puri, Sushil K. Prasad

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2238 - 2241

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Polygon overlay is one of the complex operations in Geographic Information Systems (GIS). In GIS, a typical polygon tends to be large in size often consisting of thousands of vertices. Sequential algorithms for this problem are in abundance in literature and most of the parallel algorithms concentrate on parallelizing edge intersection phase only. Our research aims to develop parallel algorithms to...

chapter

Exploiting Content Similarity to Improve Memory Performance in Large-Scale High-Performance Computing Systems

Scott Levy

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2258 - 2261

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

As we consider building the next generation of extreme-scale systems, many of the biggest challenges are related to memory characteristics. In particular, overcoming challenges related to resilience and memory bandwidth will require innovative strategies for improving the performance of main memory. In this paper, we propose to exploit memory content similarity to improve memory performance. We begin...

chapter

Identifying High betweenness Centrality Vertices in Large Noisy Networks

Vladimir Ufimtsev, Sanjukta Bhowmick

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 2234 - 2237

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Most real-world network models inherently include some degree of noise due to the approximations involved in measuring real-world data. My thesis focuses on studying how these approximations affect the stability of the networks. In this paper, we focus on the stability of betweenness centrality (BC), a metric used to measure the importance of the vertices in the network. We present our results on...

chapter

MapReducing GEPETO or Towards Conducting a Privacy Analysis on Millions of Mobility Traces

Sebastien Gambs, Marc-Olivier Killijian, Izabela Moise, Miguel Nunez del Prado Cortez

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1937 - 1946

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

GEPETO (for GEoPrivacy-Enhancing Toolkit) is a flexible software that can be used to visualize, sanitize, perform inference attacks and measure the utility of a particular geolocated dataset. The main objective of GEPETO is to enable a data curator (e.g., a company, a governmental agency or a data protection authority) to design, tune, experiment and evaluate various sanitization algorithms and inference...

chapter

BPS: A Performance Metric of I/O System

Shuibing He, Xian-He Sun, Yanlong Yin

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1954 - 1962

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

It is known that I/O system rather than CPU and memory is the performance killer of many of the newly emerged data intensive applications. Evaluating and understanding I/O system performance has become a timely issue facing the high performance computing community. Conventional I/O performance metrics, such as Input/Output Operations Per Second (IOPS), bandwidth, response time, etc., are effective...

chapter

InfoStor: Highly Available Distributed Block Store

YongJian Ren, YouQing Lin, JiLin Zhang, Jian Wan, more

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1981 - 1988

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

In order to adapt to the requirements of the massive scale storage environments, and improve storage space utilization of the data center host, we designed and implemented InfoStor, a heterogeneous environment, distributed block storage system. Through in-band storage virtualization technology that provides the reliability of traditional enterprise arrays with low cost and better scalability; provide...

chapter

Toward a Scalable Heterogeneous Runtime System for the Convey MX Architecture

John D. Leidel, Joe Bolding, Geoffrey Rogers

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1597 - 1606

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Given the recent advent of the multicore era [1], research efforts in the area of high performance, low latency runtime systems have increased significantly. This research has given birth to new techniques in low-overhead scheduling techniques, small-memory footprint parallel execution units and kernel-free contextual environments. This paper presents a framework and runtime system for a truly heterogeneous...

chapter

Inferring Large-Scale Computation Behavior via Trace Extrapolation

Laura Carrington, Michael A. Laurenzano, Ananta Tiwari

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1667 - 1674

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Understanding large-scale application behavior is critical for effectively utilizing existing HPC resources and making design decisions for upcoming systems. In this work we present a methodology for characterizing an MPI application's large-scale computation behavior and system requirements using information about the behavior of that application at a series of smaller core counts. The methodology...

chapter

Systematic Reduction of Data Movement in Algebraic Multigrid Solvers

Hormozd Gahvari, William Gropp, Kirk E. Jordan, Martin Schulz, more

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum > 1675 - 1682

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

Algebraic Multigrid (AMG) solvers find wide use in scientific simulation codes. Their ideal computational complexity makes them especially attractive for solving large problems on parallel machines. However, they also involve a substantial amount of data movement, posing challenges to performance and scalability. In this paper, we present an algorithm that provides a systematic means of reducing data...

INFONA - science communication portal

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

GPU Peer-to-Peer Techniques Applied to a Cluster Interconnect

Direct MPI Library for Intel Xeon Phi Co-Processors

HiCOMB Introduction

A Comparison of Ruleset Feature Independent Packet Classification Engines on FPGA

RTL Simulation of High Performance Dynamic Reconfiguration: A Video Processing Case Study

Hardware Supported Adaptive Data Collection for Networks on Chip

A Flexible Memory Controller Supporting Deep Belief Networks with Fixed-Point Arithmetic

Architecture Exploration of High-Performance Floating-Point Fused Multiply-Add Units and their Automatic Use in High-Level Synthesis

Wire Speed IPv6 Forwarding on Multi-core Platforms

Discrete Min-Energy Scheduling on Restricted Parallel Processors

A Compression Framework for Multidimensional Scientific Datasets

Efficient Parallel and Distributed Algorithms for GIS Polygonal Overlay Processing

Exploiting Content Similarity to Improve Memory Performance in Large-Scale High-Performance Computing Systems

Identifying High betweenness Centrality Vertices in Large Noisy Networks

MapReducing GEPETO or Towards Conducting a Privacy Analysis on Millions of Mobility Traces

BPS: A Performance Metric of I/O System

InfoStor: Highly Available Distributed Block Store

Toward a Scalable Heterogeneous Runtime System for the Convey MX Architecture

Inferring Large-Scale Computation Behavior via Trace Extrapolation

Systematic Reduction of Data Movement in Algebraic Multigrid Solvers

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)