Wyniki wyszukiwania dla: K. Kandalla

Pozycje od 1 do 12 spośród 12 wyników

artykuł

M-CSF instructs both cell division and cell identity in HSC through independent transcription factor circuits

Sandrine Sarrazin, Prashanth K. Kandalla, Noushine Mossadegh-Keller, Leon Espinosa, więcej

Experimental Hematology > 2015 > 43 > 9(S) > S93

rozdział

A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-blocking Alltoallv Collective on Multi-core Systems

K. Kandalla, H. Subramoni, K. Tomko, D. Pekurovsky, więcej

2013 42nd International Conference on Parallel Processing > 611 - 620

2013 42nd International Conference on Parallel Processing (ICPP)

Non-blocking collectives have been recently standardized by the Message Passing Interface (MPI) Forum. However, intelligent designs offered by the MPIcommunication runtimes are likely to be the key factors that drive their adoption. While hardware based solutions for non-blocking collective operations have shown promise, they require specialized hardware support and currently have several performance...

rozdział

Design of network topology aware scheduling services for large InfiniBand clusters

H. Subramoni, D. Bureddy, K. Kandalla, K. Schulz, więcej

2013 IEEE International Conference on Cluster Computing (CLUSTER) > 1 - 8

2013 IEEE International Conference on Cluster Computing (CLUSTER)

The goal of any scheduler is to satisfy user's demands for computation and achieve a good performance in overall system utilization by efficiently assigning jobs to resources. However, the current state-of-the-art scheduling techniques do not intelligently balance node allocation based on the total bandwidth available between switches - that leads to over subscription. Additionally, poor placement...

rozdział

Designing Optimized MPI Broadcast and Allreduce for Many Integrated Core (MIC) InfiniBand Clusters

K. Kandalla, A. Venkatesh, K. Hamidouche, S. Potluri, więcej

2013 IEEE 21st Annual Symposium on High-Performance Interconnects > 63 - 70

2013 IEEE 21st Annual Symposium on High-Performance Interconnects (HOTI)

The emergence of co-processors such as Intel Many Integrated Cores (MICs) is changing the landscape of supercomputing. The MIC is a memory constrained environment and its processors also operate at slower clock rates. Furthermore, the communication characteristics between MIC processes are also different compared to communication between host processes. Communication libraries that do not consider...

rozdział

Optimized MPI Gather Collective for Many Integrated Core (MIC) InfiniBand Clusters

A. Venkatesh, K. Kandalla, Dhabaleswar K. Panda

2013 Extreme Scaling Workshop (xsw 2013) > 58 - 63

2013 Extreme Scaling Workshop (XSW)

Xeon Phi coprocessors are gaining popularity in the high performance computing community owing to its rendition of a highly parallel environment and X86 compatibility. The coprocessors, which conform to Intel's Many Integrated Core (MIC) architecture, are being deployed at large scale also because they yield a high performance per Watt. Each Xeon Phi coprocessor, despite offering 1 Teraflop performance,...

rozdział

Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes

H. Subramoni, S. Potluri, K. Kandalla, B. Barth, więcej

2012 International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 12

2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis

Over the last decade, InfiniBand has become an increasingly popular interconnect for deploying modern supercomputing systems. However, there exists no detection service that can discover the underlying network topology in a scalable manner and expose this information to runtime libraries and users of the high performance computing systems in a convenient way. In this paper, we design a novel and scalable...

rozdział

Can Network-Offload Based Non-blocking Neighborhood MPI Collectives Improve Communication Overheads of Irregular Graph Algorithms?

K. Kandalla, A. Buluc, H. Subramoni, K. Tomko, więcej

2012 IEEE International Conference on Cluster Computing Workshops > 222 - 230

2012 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS)

Graph-based computations are commonly used across various data intensive computing domains ranging from social networks to biological systems. On distributed memory systems, graph algorithms involve explicit communication between processes and often exhibit sparse, irregular behavior. Minimizing these communication overheads is critical to cater to the graph-theoretic analyses demands of emerging...

rozdział

Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers

K. Kandalla, U. Yang, J. Keasler, T. Kolev, więcej

2012 IEEE 26th International Parallel and Distributed Processing Symposium > 1156 - 1167

2012 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Scientists across a wide range of domains increasingly rely on computer simulation for their investigations. Such simulations often spend a majority of their run-times solving large systems of linear equations that require vast amounts of computational power and memory. It is hence critical to design solvers in a highly efficient and scalable manner. Hypre is a high performance, scalable software...

rozdział

Designing Network Failover and Recovery in MPI for Multi-Rail InfiniBand Clusters

S. Pai Raikar, H. Subramoni, K. Kandalla, J. Vienne, więcej

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 1160 - 1167

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The emerging trends of designing commodity based supercomputing systems have a severe detrimental impact on the Mean-Time-Between-Failures (MTBF). The MTBF for typical HEC installations is currently estimated to be between eight hours and fifteen days. Failures in the interconnect fabric account for a fair share of the total failures occurring in such systems. This will continue to degrade as system...

rozdział

Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters

H. Subramoni, K. Kandalla, J. Vienne, S. Sur, więcej

2011 IEEE International Conference on Cluster Computing > 317 - 325

2011 IEEE International Conference on Cluster Computing (CLUSTER)

It is an established fact that the network topology can have an impact on the performance of scientific parallel applications. However, little work has been done to design an easy to use solution inside a communication library supporting a parallel programming model where the complexities of making the application performance network topology agnostic is hidden from the end user. Similarly, the rapid...

rozdział

Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL

K. Kandalla, H. Subramoni, J. Vienne, S. Pai Raikar, więcej

2011 IEEE 19th Annual Symposium on High Performance Interconnects > 27 - 34

2011 IEEE 19th Annual Symposium on High-Performance Interconnects (HOTI)

The upcoming MPI-3.0 standard is expected to include non-blocking collective operations. Non-blocking collectives offer a new MPI interface, using which an application can decouple the initiation and completion of collective operations. However, to be effective, the MPI library should provide a high performance and scalable implementation. One of the major challenges in designing an effective non-blocking...

rozdział

Designing multi-leader-based Allgather algorithms for multi-core clusters

K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop, więcej

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

The increasing demand for computational cycles is being met by the use of multi-core processors. Having large number of cores per node necessitates multi-core aware designs to extract the best performance. The Message Passing Interface (MPI) is the dominant parallel programming model on modern high performance computing clusters. The MPI collective operations take a significant portion of the communication...

Opcje filtrowania

Data publikacji

Ustaw własny zakres dat

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: K. Kandalla

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zbiór danych

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu