Wyniki wyszukiwania

Pozycje od 1 do 5 spośród 5 wyników

rozdział

Variability: A Tuning Headache

Allan Porterfield, Sridutt Bhalachandra, Wei Wang, Rob Fowler

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 1069 - 1072

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Performance tuning is an ongoing activity at most HPC sites. Small performance improvements can save thousands of dollars. Run-to-run performance variations significantly impact performance tuning. Not being able to tell which code version is faster (or more energy efficient) in a single run greatly increases the computational expense and uncertainty for theprogrammer. We will show examples where...

rozdział

Serialization Management for Best-Effort Hardware Transactional Memory

Matthew Gaudet, Guido Araujo, Jose Nelson Amaral

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 138 - 145

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Most studies of Best-Effort HTM (BE-HTM) performance use a single serialization manager and a single parameter value across all benchmarks, inputs and thread counts. The experimental study in this paper indicates that the values chosen for serialization-manager parameters have a significant effect on performance in the Blue Gene/Q's (BG/Q) BE-HTM system. Moreover, for a given serialization manager,...

rozdział

One OpenCL to rule them all?

Romain Dolbeau, Francois Bodin, Guillaume Colin de Verdiere

2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS) > 1 - 6

2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS)

OpenCL is now available on a very large set of processors. This makes this language an attractive layer to address multiple targets with a single code base. The question on how sensitive to the underlying hardware is the OpenCL code in practice remains to be better understood. ¹

rozdział

Understanding the impact of CUDA tuning techniques for Fermi

Yuri Torres, Arturo Gonzalez-Escribano, Diego R. Llanos

2011 International Conference on High Performance Computing & Simulation > 631 - 639

2011 International Conference on High Performance Computing & Simulation (HPCS)

While the correctness of an NVIDIA CUDA program is easy to achieve, exploiting the GPU capabilities to obtain the best performance possible is a task for CUDA experienced programmers. Typical code tuning strategies, like choosing an appropriate size and shape for the thread-blocks, programming a good coalescing, or maximize occupancy, are inter-dependent. Moreover, the choices are also dependent on...

rozdział

PIR: PMaC's Idiom Recognizer

C Olschanowsky, A Snavely, M R Meswani, L Carrington

2010 39th International Conference on Parallel Processing Workshops > 189 - 196

2010 39th International Conference on Parallel Processing Workshops (ICPPW)

The speed of the memory subsystem often constrains the performance of large-scale parallel applications. Experts tune such applications to use hierarchical memory subsystems efficiently. Hardware accelerators, such as GPUs, can potentially improve memory performance beyond the capabilities of traditional hierarchical systems. However, the addition of such specialized hardware complicates code porting...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Variability: A Tuning Headache

Serialization Management for Best-Effort Hardware Transactional Memory

One OpenCL to rule them all?

Understanding the impact of CUDA tuning techniques for Fermi

PIR: PMaC's Idiom Recognizer

Opcje filtrowania

Data publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Variability: A Tuning Headache

Serialization Management for Best-Effort Hardware Transactional Memory

One OpenCL to rule them all?

Understanding the impact of CUDA tuning techniques for Fermi

PIR: PMaC's Idiom Recognizer

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu