Search results

Items from 1 to 15 out of 15 results

chapter

Architectures for cloud-based HPC in data centers

Dao Manh Phan Hung, Sunil Manyam Seshadri Naidu, Michael Opoku Agyeman

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)( > 138 - 143

2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA)

The growing demands in IT services for improving efficiency and quality at low cost to handle complex compute requirements has led to the integration of High performance computing (HPC) systems and cloud infrastructure in data centers. Earlier, HPC systems were limited to academic and research institutions and engineering laboratories. However, the emergence of cloud infrastructures and their successful...

chapter

Performance Evaluation of Parallelizing Algorithm Using Spanning Tree for Stream-Based Computing

Guyue Wang, Koichi Wada, Shinichi Yamagiwa

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 497 - 503

2016 Fourth International Symposium on Computing and Networking (CANDAR)

This paper proposes a detailed performance evaluation of an algorithm using spanning tree that automatically exploits the parallelism and determines an execution order of multiple kernel programs in distributed environment. In stream-based computing, efficient parallel execution requires careful scheduling of the invocation of the kernel programs. By mapping a kernel to a node and an I/O stream between...

chapter

A data analysis system processing framework for underwater target detection

Xiaoning Jin, Peiying Zhang

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 1687 - 1691

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

For the traditional processing tasks of underwater data analysis system, including finite impulse response filter banks, fast Fourier transform, conventional beamforming and target tracking, the algorithm flow is relatively steady with little alternative control, which is to say that it is highly parallel. Besides that, the processing tasks need to deal with enormous data, which means great computational...

chapter

On the Resilience of Parallel Sparse Hybrid Solvers

Emmanuel Agullo, Luc Giraud, Mawussi Zounon

2015 IEEE 22nd International Conference on High Performance Computing (HiPC) > 75 - 84

2015 IEEE 22nd International Conference on High Performance Computing (HiPC)

As the computational power of high performance computing (HPC) systems continues to increase by using a huge number of CPU cores or specialized processing units, extreme-scale applications are increasingly prone to faults. Consequently, the HPC community has proposed many contributions to design resilient HPC applications. These contributions may be system-oriented, theoretical or numerical. In this...

chapter

Mars: A 64-core ARMv8 processor

Charles Zhang

2015 IEEE Hot Chips 27 Symposium (HCS) > 1 - 23

2015 IEEE Hot Chips 27 Symposium (HCS)

This article consists of a collection of slides from the author's conference presentation. The following slides are presented to introduce the general features of one of our products, instead of any commitment about it. It is for information purposes only, and may not be incorporated into any contract. It is not suggested to make purchasing decisions accordingly. The development, release, and timing...

chapter

Evaluation of control loop statements power efficiency: An experimental study

Muhammad Al-Hashimi, Mostafa Saleh, Osama Abulnaja, Naif Aljabri

2014 9th International Conference on Informatics and Systems > PDC-45 - PDC-48

2014 9th International Conference on Informatics and Systems (INFOS)

Massively parallel computers have found significant interest from researchers in recent years. These machines have been used for complex and sophisticated simulations, such as, the brain functions simulation. Due to the increase in the power demand by massively parallel computers and it's move into the exascale computing in future, temperature and power consumption has become a major constrains, many...

chapter

A heterogeneous platform with GPU and FPGA for power efficient high performance computing

Qiang Wu, Yajun Ha, Akash Kumar, Shaobo Luo, more

2014 International Symposium on Integrated Circuits (ISIC) > 220 - 223

2014 International Symposium on Integrated Circuits (ISIC)

Heterogeneous computing is gaining attention from both industry and academia nowadays. One driving factor for heterogeneous computing is the power efficiency. GPU and FPGA have been reported to achieve much higher power efficiency over CPU on many applications. Comparisons between GPU and FPGA show different characteristics of GPU and FPGA in accelerated computing. Some tasks run better on GPU, some...

chapter

Towards on high performance computing of medical imaging based on graphical processing units

K. Suresh, M. Rajasekhara Babu

2013 15th International Conference on Advanced Computing Technologies (ICACT) > 1 - 6

2013 15th International Conference on Advanced Computing Technologies (ICACT)

The Design of GPU(Graphical Processing Unit) will well suitable for express the data parallel computations because GPU will specialized for parallel and today's digital images in medical are huge volume of collections in every day, however medical imaging produces demand to improve the medical diagnosis and procedures. This survey is provide graphical processing computations and hardware require to...

chapter

Optical interconnects for high performance computing

Marc A. Taubenblatt

IEEE Photonic Society 24th Annual Meeting > 668 - 669

2011 IEEE Photonics Conference (IPC 2011)

High performance computing is a new and rapidly growing optical interconnect market needed to address this segment's steadily increasing bandwidth needs. This tutorial will discuss the trends, requirements, trade-offs and technology for this market.

chapter

Experience Applying Fortran GPU Compilers to Numerical Weather Prediction

T. Henderson, J. Middlecoff, J. Rosinski, M. Govett, more

2011 Symposium on Application Accelerators in High-Performance Computing > 34 - 41

2011 Symposium on Application Accelerators in High-Performance Computing (SAAHPC)

Graphics Processing Units (GPUs) have enabled significant improvements in computational performance compared to traditional CPUs in several application domains. Until recently, GPUs have been programmed using C/C++ based methods such as CUDA (NVIDIA) and OpenCL (NVIDIA and AMD). Using these approaches, Fortran Numerical Weather Prediction (NWP) codes would have to be completely re-written to take...

chapter

A Case Study of SWIM: Optimization of Memory Intensive Application on GPGPU

Wei Yi, Yuhua Tang, Guibin Wang, Xudong Fang

2010 3rd International Symposium on Parallel Architectures, Algorithms and Programming > 123 - 129

Third International Symposium on Parallel Architectures, Algorithms and Programming (PAAP 2010)

Recently, GPGPU has been adopted well in the High Performance Computing (HPC) field. The limited global memory bandwidth poses a great challenge to many GPGPU programmers trying to exploit parallelism within the CPU-GPU heterogeneous platform. In this paper, we choose SWIM, a typical memory intensive application from the SPEC OMP 2001 benchmark suite, for case study. We attempt to optimize the performance...

chapter

Multiple-GPUs Algorithm for Lattice Boltzmann Method

Jifu Zhou, Chengwen Zhong, Jianfei Xie, Shiqun Yin

2008 International Symposium on Information Science and Engineering > 2 > 793 - 796

2008 International Symposium on Information Science and Engineering (ISISE)

It is studied about parallel algorithm of lattice Boltzmann method. The data's arrangement, commutation and computational progress are redesigned in a marriage of message passing interface and general purpose graphic processing Units. On the single-GPU, novel techniques appearing in shader model 3.0 such as frame buffer object (FBO), multiple-channels-rendering and, rendering-to-textures are used...

chapter

Micro-architecture of Godson-3 multi-core processor

Weiwu Hu, Jian Wang, Xiang Gao, Yunji Chen

2008 IEEE Hot Chips 20 Symposium (HCS) > 1 - 31

2008 IEEE Hot Chips 20 Symposium (HCS)

This article consists of a collection of slides from the authors' conference presentation. Some of the topics discussed include: a brief introduction to Godson processors; the architecture of the Godson-3 multicore processor; physical implementation; and PetaFLOPS and TeraFLOPs.

chapter

A PLD Architecture for High Performance Computing

Naoki Hirakawa, Masanori Yoshihara, Kazuya Tanigawa, Tetsuo Hironaka, more

2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems > 35 - 42

2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2008)

In recent years, Field Programmable Gate Arrays (FPGAs) have been used for High Performance Computing (HPC). Because there is a significantly difference between configuration speed of FPGA and execution speed of Central Processing Unit (CPU), the difference causes performance degradation. To resolve of this problem, we proposed MPLD as a new Programmable Logic Device (PLD) architecture with high speed...

chapter

Communication performance of a modular high-bandwidth multiprocessor system

Fong Pong, Nian-Feng Tzeng, K. Oner, Chun Ning, more

2007 International Conference on Parallel and Distributed Systems > 2 > 1 - 8

2007 International Conference on Parallel and Distributed Systems

This article deals with communication performance of a multiprocessor system implemented using award-wining BCM 1480 multi-core chips. Our system uses high-performance HyperTransport links to interconnect constituent chips, realizing cache-coherent non-uniform memory access. It takes advantage of hardware support from the BCM 1480 chip to attain very impressive communication performance among constituent...

Filter options

Keywords:
HIGH PERFORMANCE COMPUTING

Publication date

Set your own date range

Keywords

GRAPHICS PROCESSING UNITS (5)
COMPUTATIONAL MODELING (4)
COMPUTER ARCHITECTURE (3)
HARDWARE (3)
KERNEL (3)
BANDWIDTH (2)
COPROCESSORS (2)
FIELD PROGRAMMABLE GATE ARRAYS (2)
FPGA (2)
GPU (2)
GRAPHICS PROCESSING UNIT (2)
HPC (2)
MEMORY MANAGEMENT (2)
MULTICORE PROCESSING (2)
OPTIMIZATION (2)
PERFORMANCE EVALUATION (2)
POWER DEMAND (2)
PROTOTYPES (2)
SCALABILITY (2)
ALGORITHMS (1)
ARRAY SIGNAL PROCESSING (1)
CLOUD (1)
CLOUD COMPUTING (1)
CLUSTERING ALGORITHMS (1)
CMOS INTEGRATED CIRCUITS (1)
CMOS TECHNOLOGY (1)
COMMUNICATION PERFORMANCE (1)
COMPUTATIONAL EFFICIENCY (1)
COMPUTE (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTED TOMOGRAPHY (1)
COMPUTER GRAPHIC EQUIPMENT (1)
COMPUTER INTERFACES (1)
COMPUTERS (1)
COMPUTING (1)
CONCURRENT COMPUTING (1)
CONFIGURATION METHOD (1)
CONTEXT (1)
CPU (1)
CPU CORE (1)
CPU-GPU HETEROGENEOUS PLATFORM (1)
DATA ANALYSIS (1)
DATA MODELS (1)
DATA TRANSFER (1)
DEBUGGING (1)
DELAY (1)
DIGITAL BEAMFORMING (1)
EASY PARTIAL CONFIGURATION (1)
EDUCATIONAL INSTITUTIONS (1)
ENERGY CONSUMPTION (1)
ESTIMATION (1)
EXASCALE COMPUTING (1)
FAULT TOLERANCE (1)
FIBER OPTICS (1)
FIELD PROGRAMMABLE GATE ARRAY (1)
FRAME BUFFER OBJECT (1)
GENERAL PURPOSE GRAPHIC PROCESSING UNITS (1)
GENERAL-PURPOSE COMPUTATION ON GPU (1)
GMRE (1)
GPGPU (1)
GRAPHIC PROCESSOR UNIT (1)
HETEROGENEOUS COMPUTING (1)
HIGH SPEED CONFIGURATION (1)
HYPERTRANSPORT LINK (1)
INFORMATION SCIENCE (1)
INSTANCE-VELOCITY VECTOR (1)
INSTRUCTION SETS (1)
JOINING PROCESSES (1)
KERNEL FUSION (1)
KERNEL PROCESSING (1)
KRYLOV (1)
LATTICE BOLTZMANN METHOD (1)
LATTICE BOLTZMANN METHODS (1)
LINEAR ALGEBRA (1)
LINEAR SYSTEMS (1)
LOGIC CIRCUITS (1)
LOGIC DESIGN (1)
MATHEMATICAL MODEL (1)
MATRIX TRANSPOSITION (1)
MEDICAL DIAGNOSTIC IMAGING (1)
MEDICAL IMAGE COMPUTING (1)
MEDICAL SERVICES (1)
MEMORY ACCESS MECHANISMS (1)
MEMORY INTENSIVE APPLICATION (1)
MESSAGE PASSING (1)
MESSAGE PASSING INFERFACE (1)
MESSAGE PASSING INTERFACE (1)
MICROARCHITECTURE (1)
MODULAR HIGH-BANDWIDTH MULTIPROCESSOR SYSTEM (1)
MPLD (1)
MULTI-CORE CHIPS (1)
MULTIPLE GPUS (1)
MULTIPLE-CHANNELS-RENDERING (1)
MULTIPLE-GPU ALGORITHM (1)
MULTIPLE-GPUS (1)
MULTIPROCESSING SYSTEMS (1)
NETWORK-ON-CHIP (1)
NON-UNIFORM MEMORY ACCESS (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options