Search results

Items from 1 to 20 out of 30 results

chapter

MPI Process and Network Device Affinitization for Optimal HPC Application Performance

Ravindra Babu Ganapathi, Aravind Gopalakrishnan, Russell W. McGuire

2017 IEEE 25th Annual Symposium on High-Performance Interconnects (HOTI) > 80 - 86

2017 IEEE 25th Annual Symposium on High-Performance Interconnects (HOTI)

High Performance Computing(HPC) applications are highly optimized to maximize allocated resources for the job such as compute resources, memory and storage. Optimal performance for MPI applications requires the best possible affinity across all the allocated resources. Typically, setting process affinity to compute resources is well defined, i.e MPI processes on a compute node have processor affinity...

chapter

Evaluating HPC Networks via Simulation of Parallel Workloads

Nikhil Jain, Abhinav Bhatele, Sam White, Todd Gamblin, more

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 154 - 165

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

This paper presents an evaluation and comparison of three topologies that are popular for building interconnection networks in large-scale supercomputers: torus, fat-tree, and dragonfly. To perform this evaluation, we propose a comprehensive methodology and present a scalable packet-level network simulator, TraceR. Our methodology includes design of prototype systems that are being evaluated, use...

chapter

LOREN: A Scalable Routing Method for Layout-Conscious Random Topologies

Ryuta Kawano, Hiroshi Nakahara, Ikki Fujiwara, Hiroki Matsutani, more

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 9 - 18

2016 Fourth International Symposium on Computing and Networking (CANDAR)

End-to-end network latency has become an important issue for parallel application on large-scale high performance computing (HPC) systems. It has been reported that randomly-connected inter-switch networks can lower the end-to-end network latency. The trade-off is a large amount of routing information. For irregular networks, minimal routing is achieved by using routing tables for all destinations...

chapter

Decentralised, dynamic network path selection in high performance computing

John Anderson, Matt Piazza, Aspen Olmsted

2016 International Conference on Information Society (i-Society) > 88 - 90

2016 International Conference on Information Society (i-Society)

In this paper, we investigate the problem of providing highly available, decentralized, dynamic path selection in high performance computing networking. We look at a use case for dynamic path selection that better utilizes bandwidth available in the network. The network architecture we propose is a partial mesh grid whereby each host is directly connected to four forwarding devices. We propose an...

chapter

Torus network labeling in High Performance computing

Mayuresh Dhanak, Parikshit D. Godbole, R. A. Patil

2016 International Conference on Computing Communication Control and automation (ICCUBEA) > 1 - 4

2016 International Conference on Computing Communication Control and automation (ICCUBEA)

Two prime network interconnection topology used today in High Performance computing (HPC) are the fat tree and the torus topology. But due to the various advantages of torus network over fat tree, currently many HPC networks using fat tree are turning to the torus topology. In fat tree topology the switches have high end functionalities. Suppose a packet is traversing from the source to destination...

chapter

Hypercube based clusters in Cloud Computing

Amin Sahba, John J. Prevost

2016 World Automation Congress (WAC) > 1 - 6

2016 World Automation Congress (WAC)

High performance computing (HPC) means the aggregation of computational power to increase the ability of processing large problems in science, engineering, and business. HPC on the cloud allows performing on demand HPC tasks by high performance clusters in a cloud environment. The connection structure of the nodes in HPC clusters should provide fast internode communication. It is important that scalability...

chapter

ACRO: Assignment of channels in reverse order to make arbitrary routing deadlock-free

Ryuta Kawano, Hiroshi Nakahara, Seiichi Tade, Ikki Fujiwara, more

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) > 1 - 6

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS)

Distributed routing methods with small routing tables are scalable design on irregular networks for large-scale High Performance Computing (HPC) systems. Recently proposed compact routing methods, however, do not guarantee deadlock-freedom. Cyclic channel dependencies on arbitrary routing are typically removed with multiple Virtual Channels (VCs). However, challenges still remain to provide good trade-offs...

chapter

Transitively Deadlock-Free Routing Algorithms

Jean-Noel Quintin, Pierre Vigneras

2016 2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era (HiPINEB) > 16 - 24

2016 2nd IEEE International Workshop on High-Performance Interconnection Networks in the Exascale and Big-Data Era (HiPINEB)

In exascale platforms, faults are likely to occur more and more frequently due to the huge number of components. To handle them, the BXI fabric management uses a generic architecture that specifies two distinct modes of operations: offline mode computes, validates and uploads nominal routing tables, while online mode reacts at runtime to failures and recoveries by computing small patches and by uploading...

chapter

Suitability of the Random Topology for HPC Applications

Fabien Chaix, Ikki Fujiwara, Michihiro Koibuchi

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 301 - 304

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

With each technology improvement, parallel systems get larger, and the impact of interconnection networks becomes more prominent. Random topologies and their variants received more and more attention lately due to their low diameter, low average shortest path length and high scalability. However, existing supercomputers still prefer torus and fat-tree topologies, because a number of existing parallel...

chapter

Suitability of the Random Topology for HPC Applications

Fabien Chaix, Ikki Fujiwara, Michihiro Koibuchi

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 301 - 304

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

chapter

Links as a service (LaaS): Guaranteed tenant isolation in the shared cloud

Eitan Zahavi, Alexander Shpiner, Ori Rottenstreich, Avinoam Kolodny, more

2016 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS) > 87 - 98

2016 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

The most demanding tenants of shared clouds require complete isolation from their neighbors, in order to guarantee that their application performance is not affected by other tenants. Unfortunately, while shared clouds can offer an option whereby tenants obtain dedicated servers, they do not offer any network provisioning service, which would shield these tenants from network interference. In this...

chapter

Processing big trajectory and Twitter data streams using Apache STORM

Dragan Stojanovic, Natalija Stojanovic, Jovan Turanjanin

2015 12th International Conference on Telecommunication in Modern Satellite, Cable and Broadcasting Services (TELSIKS) > 301 - 304

2015 12th International Conference on Telecommunication in Modern Satellite, Cable and Broadcasting Services (TELSIKS)

In this paper, we present research work related to processing and analysis of big trajectory and Twitter data streams using Apache Storm framework. We present the TrafficStorm application implemented as Storm topology, and describe its implementation on a cluster of commodity computers. TrafficStorm performs processing of big trajectory data streams related to users moving over a street network, as...

chapter

Fault-Tolerant Routing for Exascale Supercomputer: The BXI Routing Architecture

Pierre Vigneras, Jean-Noel Quintin

2015 IEEE International Conference on Cluster Computing > 793 - 800

2015 IEEE International Conference on Cluster Computing (CLUSTER)

BXI, Bull eXascale Interconnect, is the new inter-connection network developed by Atos for High Performance Computing. It has been designed to meet the requirements of exascale supercomputers. At such scale, faults have to be expected and dealt with transparently so that applications remain unaffected by them. BXI features various mechanisms for this purpose, one of which is the BXI routing component...

chapter

Efficient Queuing Schemes for HoL-Blocking Reduction in Dragonfly Topologies with Minimal-Path Routing

Pedro Yebenes, Jesus Escudero-Sahuquillo, Pedro J. Garcia, Francisco J. Quiles

2015 IEEE International Conference on Cluster Computing > 817 - 824

2015 IEEE International Conference on Cluster Computing (CLUSTER)

HPC systems are growing in number of connected endnodes, making the network a main issue in their design. In order to interconnect large systems, dragonfly topologies have become very popular in the latest years as they achieve high scalability by exploiting high-radix switches. However, dragonfly high performance may drop severely due to the Head-of-Line (HoL) blocking effect derived from congestion...

chapter

Intel® Omni-path Architecture: Enabling Scalable, High Performance Fabrics

Mark S. Birrittella, Mark Debbage, Ram Huggahalli, James Kunz, more

2015 IEEE 23rd Annual Symposium on High-Performance Interconnects > 1 - 9

2015 IEEE 23rd Annual Symposium on High-Performance Interconnects (HOTI)

The Intel® Omni-Path Architecture (Intel® OPA) is designed to enable a broad class of computations requiring scalable, tightly coupled CPU, memory, and storage resources. Integration between devices in the Intel® OPA family and Intel® CPUs enable improvements in system level packaging and network efficiency. When coupled with the new user-focused open standard APIs developed by the OpenFabrics Alliance...

chapter

A contextual approach for effective recovery of inter-process communication patterns from HPC traces

Luay Alawneh, Abdelwahab Hamou-Lhadj, Syed Shariyar Murtaza, Yan Liu

2014 Software Evolution Week - IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE) > 274 - 282

2014 Software Evolution Week - IEEE Conference on Software Maintenance, Reengineering and Reverse Engineering (CSMR-WCRE)

Studies have shown that understanding of interprocess communication patterns is an enabler to effective analysis of high performance computing (HPC) applications. In previous work, we presented an algorithm for recovering communication patterns from traces of HPC systems. The algorithm worked well on small cases but it suffered from low accuracy when applied to large (and most interesting) traces...

chapter

SymSig: A low latency interconnection topology for HPC clusters

Dhananjay Brahme, Onkar Bhardwaj, Vipin Chaudhary

20th Annual International Conference on High Performance Computing > 462 - 471

2013 20th International Conference on High Performance Computing (HiPC)

This paper presents the underlying theory and the performance of a cluster using a new 2-hop network topology. This topology is constructed using a symmetric equation and Singer Difference Sets and is called SymSig. The degree of connections at each node with SymSig is about half compared to previous methods using Singer Difference Sets. A comparison with a cluster of Clos topology shows significant...

chapter

Data Decomposition for Code Parallelization in Practice: What Do the Experts Need?

Anne Meade, Deva Kumar Deeptimahanti, Michael Johnston, Jim Buckley, more

2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing > 754 - 761

2013 IEEE International Conference on High Performance Computing and Communications (HPCC) & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Parallelizing serial software systems in order to run in a High Performance Computing (HPC) environment presents many challenges to developers. In particular, the extant literature suggests the task of decomposing large-scale data applications is particularly complex and time-consuming. In order to take stock of the state of practice of data decomposition in HPC, we conducted a two-phased study. Firstly,...

chapter

Cabinet Layout Optimization of Supercomputer Topologies for Shorter Cable Length

Ikki Fujiwara, Michihiro Koibuchi, Henri Casanova

2012 13th International Conference on Parallel and Distributed Computing, Applications and Technologies > 227 - 232

2012 13th International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT)

As the scales of supercomputers increase total cable length becomes enormous, e.g., up to thousands of kilometers. Recent high-radix switches with dozens of ports make switch layout and system packaging more complex. In this study, we study the optimization of the physical layout of topologies of switches on a machine room floor with the goal of reducing cable length. For a given topology, using graph...

chapter

Scalable Performance Predictions of Distributed Peer-to-Peer Applications

Bogdan Florin Cornea, Julien Bourgeois, The Tung Nguyen, Didier El-Baz

2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems > 193 - 201

2012 IEEE 14th Int'l Conf. on High Performance Computing and Communication (HPCC) & 2012 IEEE 9th Int'l Conf. on Embedded Software and Systems (ICESS)

Recently, a new environment for high performance peer-to-peer distributed computing was proposed. This environment, named P2PDC, addresses stable or volatile systems communicating in a decentralized manner using the self-adaptive protocol P2PSAP. P2PDC is devoted to task parallel applications like numerical simulation problems or optimization problems solved via parallel or distributed iterative algorithms...

Keywords:
TOPOLOGY

Publication date

Set your own date range

Publication type

book (29)
article (1)

Keywords

NETWORK TOPOLOGY (15)
ROUTING (11)
BANDWIDTH (8)
SWITCHES (7)
INTERCONNECTION NETWORKS (6)
BENCHMARK TESTING (5)
COMPUTER ARCHITECTURE (5)
ALGORITHM DESIGN AND ANALYSIS (4)
MPI (4)
PORTS (COMPUTERS) (4)
SCALABILITY (4)
SOFTWARE (4)
SUPERCOMPUTERS (4)
SYSTEM RECOVERY (4)
DYNAMIC ANALYSIS (3)
HARDWARE (3)
HYPERCUBES (3)
MESSAGE PASSING (3)
PEER-TO-PEER COMPUTING (3)
PROTOCOLS (3)
APPLICATION PROGRAM INTERFACES (2)
BXI (2)
CLOUD COMPUTING (2)
CLUSTERING METHODS (2)
COMMUNICATION CHANNELS (2)
COMPUTATIONAL MODELING (2)
CONTEXT (2)
DETECTION ALGORITHMS (2)
FABRIC (2)
FABRICS (2)
FAULT TOLERANCE (2)
FAULT TOLERANT SYSTEMS (2)
FAULT-TOLERANT ROUTING (2)
HIGH-RADIX SWITCHES (2)
INTERCONNECT MANAGEMENT (2)
LAYOUT (2)
LIBRARIES (2)
MEASUREMENT (2)
MESSAGE PASSING INTERFACE (2)
MULTIPROCESSOR INTERCONNECTION NETWORKS (2)
NUMERICAL SIMULATION (2)
OPTICAL SWITCHES (2)
PARALLEL PROCESSING (2)
PEER TO PEER COMPUTING (2)
PERFORMANCE ANALYSIS (2)
PERFORMANCE EVALUATION (2)
PROGRAM PROCESSORS (2)
SERVERS (2)
SOCKETS (2)
TASK PARALLEL MODEL (2)
2D DATA STREAMING (1)
ACADEMIC INSTITUTION (1)
ACCURACY (1)
ADAPTIVE MESH SIMPLIFICATION (1)
ADDRESS MECHANISM (1)
APPLICATION PROGRAMMER (1)
APPLICATION SOFTWARE (1)
ARRAYS (1)
BENCHMARK (1)
BIG DATA (1)
BINARY TREE STRUCTURE (1)
BINARY TREES (1)
BIOMEMBRANES (1)
CABINET LAYOUT (1)
CELLULAR MESH OF TORI TOPOLOGY (1)
CELLULAR PROCESSOR (1)
CIRCUIT SWITCHING (1)
CIRCUIT-SWITCHED COMMUNICATION (1)
CLOCKS (1)
CLUSTER (1)
CLUSTER COMPUTING (1)
CLUSTERING ALGORITHMS (1)
COARSE GRAIN (1)
COMMUNICATION BUFFER REUSE (1)
COMMUNICATION LIBRARY FUNCTIONS (1)
COMMUNICATION MODE (1)
COMMUNICATION PATTERN DETECTION (1)
COMPUTER ERRORS (1)
COMPUTER GRAPHICS (1)
COMPUTER SIMULATION (1)
COMPUTERS (1)
COMPUTING ARCHITECTURE (1)
CONGESTION CONTROL (1)
CONTEXT LIKE TOPOLOGY (1)
DATA CENTERS (1)
DATA MANIPULATION (1)
DATA MINING (1)
DATA PROCESSING PERFORMANCE (1)
DATACENTER (1)
DEADLOCK-FREE ROUTING (1)
DECENTRALISED (1)
DELAYS (1)
DIAMETER (1)
DISTRIBUTED APPLICATION (1)
DISTRIBUTED COMPUTING (1)
DISTRIBUTED ITERATIVE METHOD (1)
DOCUMENTATION (1)
DRAGONFLY TOPOLOGY (1)
more

INFONA - science communication portal

Search results

MPI Process and Network Device Affinitization for Optimal HPC Application Performance

Evaluating HPC Networks via Simulation of Parallel Workloads

LOREN: A Scalable Routing Method for Layout-Conscious Random Topologies

Decentralised, dynamic network path selection in high performance computing

Torus network labeling in High Performance computing

Hypercube based clusters in Cloud Computing

ACRO: Assignment of channels in reverse order to make arbitrary routing deadlock-free

Transitively Deadlock-Free Routing Algorithms

Suitability of the Random Topology for HPC Applications

Suitability of the Random Topology for HPC Applications

Links as a service (LaaS): Guaranteed tenant isolation in the shared cloud

Processing big trajectory and Twitter data streams using Apache STORM

Fault-Tolerant Routing for Exascale Supercomputer: The BXI Routing Architecture

Efficient Queuing Schemes for HoL-Blocking Reduction in Dragonfly Topologies with Minimal-Path Routing

Intel® Omni-path Architecture: Enabling Scalable, High Performance Fabrics

A contextual approach for effective recovery of inter-process communication patterns from HPC traces

SymSig: A low latency interconnection topology for HPC clusters

Data Decomposition for Code Parallelization in Practice: What Do the Experts Need?

Cabinet Layout Optimization of Supercomputer Topologies for Shorter Cable Length

Scalable Performance Predictions of Distributed Peer-to-Peer Applications

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options