Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 59 wyników

Poprzednia

Następna

rozdział

Analysis of K-bit pipelined processor cores using perl benchmarking

Eze Victor Chisom, K. C. Okafor, A. A. Obayi, Okoro Nkem Jennifer, więcej

2017 International Conference on Computing Networking and Informatics (ICCNI) > 1 - 7

2017 International Conference on Computing Networking and Informatics (ICCNI)

In today's high performance computing (HPC) environments, analyzing and predicting the performance of multiple-processor systems (clusters cores) on critical workloads remains a challenge. This is as a result of the key metrics that influences system's behavior. Busty arrivals in HPCs demand either a shared memory-parallel architecture or pipelined dataflow architecture. At present, a processor model...

rozdział

Software patterns for asymmetric multiprocessing devices on embedded systems: a performance assessment

Pedro Ignacio Martos, Alejandra Garrido

2017 Eight Argentine Symposium and Conference on Embedded Systems (CASE) > 1 - 6

2017 Eight Argentine Symposium and Conference on Embedded Systems (CASE)

In embedded systems there is a variant of Multicore System on Chip devices (MSoC devices) where not all the computing elements (processor cores) are equal. The differences in the cores of these devices range from different hardware architectures using the same instruction set to completely different processors working together inside the same device. These SoCs are called “Asymmetric Multi Processing...

rozdział

Thermal-Aware Job Scheduling of MapReduce Applications on High Performance Clusters

Shubbhi Taneja, Yi Zhou, Mohammed Ibrahim Alghamdi, Xiao Qin

2017 46th International Conference on Parallel Processing Workshops (ICPPW) > 261 - 270

2017 46th International Conference on Parallel Processing Workshops (ICPPW)

In this study, we develop a thermal-aware job scheduling strategy called tDispatch tailored for MapReduce applications running on Hadoop clusters. The scheduling idea of tDispatch is motivated by a profiling study of CPU-intensive and I/O-intensive jobs from the perspective of thermal efficiency. More specifically, we investigate the thermal behaviors of these two types of jobs running on a Hadoop...

rozdział

Voltage margins identification on commercial x86-64 multicore microprocessors

George Papadimitriou, Manolis Kaliorakis, Athanasios Chatzidimitriou, Charalampos Magdalinos, więcej

2017 IEEE 23rd International Symposium on On-Line Testing and Robust System Design (IOLTS) > 51 - 56

2017 IEEE 23rd International Symposium on On-Line Testing and Robust System Design (IOLTS)

In this paper, we explore the pessimistic voltage guardbands of two multicore x86-64 microprocessor chips that belong to different microarchitectures (one ultra-low power and one high-performance microprocessor), when programs are executed on individual cores of the CPU chips. We also examine the energy and temperature gains as positive effects of lowering the voltage in both chips while preserving...

rozdział

Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications

Ugljesa Milic, Alejandro Rico, Paul Carpenter, Alex Ramirez

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 3 - 12

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

High performance computing (HPC) applications have parallel code sections that must scale to large numbers of cores, which makes them sensitive to serial regions. Current supercomputing systems with heterogeneous or asymmetric CMPs (ACMP) combine few high-performance big cores for serial regions, together with many low-power lean cores for throughput computing. The low requirements of HPC applications...

rozdział

An Evaluation of the NVIDIA TX1 for Supporting Real-Time Computer-Vision Workloads

Nathan Otterness, Ming Yang, Sarah Rust, Eunbyung Park, więcej

2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS) > 353 - 364

2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Autonomous vehicles are an exemplar for forward-looking safety-critical real-time systems where significant computing capacity must be provided within strict size, weight, and power (SWaP) limits. A promising way forward in meeting these needs is to leverage multicore platforms augmented with graphics processing units (GPUs) as accelerators. Such an approach is being strongly advocated by NVIDIA,...

rozdział

Reducing code management overhead in software-managed multicores

Jian Cai, Yooseong Kim, Youngbin Kim, Aviral Shrivastava, więcej

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1241 - 1244

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Software-managed architectures, which use scratch-pad memories (SPMs), are a promising alternative to cached-based architectures for multicores. SPMs provide scalability but require explicit management. For example, to use an instruction SPM, explicit management code needs to be inserted around every call site to load functions to the SPM. such management code would check the state of the SPM and...

rozdział

Cache Utilization as a Locality Metric - A Case Study on the Mantevo Suite

Nafiul Alam Siddique, Patricia Grubel, Abdel-Hameed A. Badawy, Jeanine Cook

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 549 - 554

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

Cache hierarchies have long been utilized to minimize the latency of main memory accesses by caching frequently used data closer to the processor. Significant research has been done to identify the most crucial metrics of cache performance. Though the majority of research focuses on measuring cache hit rates and data movement as the major cache performance metrics, cache utilization can be equally...

rozdział

Fast register consolidation and migration for heterogeneous multi-core processors

Elliott Forbes, Eric Rotenberg

2016 IEEE 34th International Conference on Computer Design (ICCD) > 1 - 8

2016 IEEE 34th International Conference on Computer Design (ICCD)

Single-ISA heterogeneous multi-core processors have been demonstrated to improve the performance and efficiency of general-purpose workloads. However, these designs leave some performance on the table due to the common assumption that the cost of migrating a program from one core to another is high. This high cost is due to the reliance on the operating system for a migration via a context switch...

rozdział

Efficient pointer management of stack data for software managed multicores

Jian Cai, Aviral Shrivastava

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 67 - 74

2016 IEEE 27th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Scratchpad-memory (SPM) based memory hierarchy is a promising alternative to cache-based memory hierarchies, due to the difficulty in scaling caches to processors with high core count. However, explicit data management in software is required on SPM-based memory hierarchies. This paper focuses on optimizing the stack data management on SPM-based multicore processors, as memory accesses to call stack...

rozdział

Diagnosing Virtualization Overhead for Multi-threaded Computation on Multicore Platforms

Xiaoning Ding, Jianchen Shan

2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom) > 226 - 233

2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom)

Hardware-assisted virtualization, as an effective approach to low virtualization overhead, has been dominantly used. However, existing hardware assistance mainly focuses on single-thread performance. Much less attention has been paid to facilitate the efficient interaction between threads, which is critical to the execution of multi-threaded computation on virtualized multicore platforms. This paper...

rozdział

Applying Multi-core Model Checking to Hardware-Software Partitioning in Embedded Systems

Alessandro Trindade, Hussama Ismail, Lucas Cordeiro

2015 Brazilian Symposium on Computing Systems Engineering (SBESC) > 102 - 105

2015 Brazilian Symposium on Computing Systems Engineering (SBESC)

We present an alternative approach to solve the hardware and software partitioning problem, which uses Bounded Model Checking (BMC) based on Satisfiability Modulo Theories (SMT) in conjunction with a multi-core support using Open Multi-Processing. The multi-core approach allows initializing many verification instances based on processors cores numbers available to the model checker. Each instance...

rozdział

Heterogeneous work-stealing across CPU and DSP cores

Vivek Kumar, Alina Sbirlea, Ajay Jayaraj, Zoran Budimlic, więcej

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2015 IEEE High Performance Extreme Computing Conference (HPEC)

Due to the increasing power constraints and higher and higher performance demands, many vendors have shifted their focus from designing high-performance computer nodes using powerful multicore general-purpose CPUs, to nodes containing a smaller number of general-purpose CPUs aided by a larger number of more power-efficient special purpose processing units, such as GPUs, FPGAs or DSPs. While offering...

artykuł

A Cross-Layer Multicore Architecture to Tradeoff Program Accuracy and Resilience Overheads

Qingchuan Shi, Henry Hoffmann, Omer Khan

IEEE Computer Architecture Letters > 2015 > 14 > 2 > 85 - 89

To protect multicores from soft-error perturbations, resiliency schemes have been developed with high coverage but high power/performance overheads ($\sim$<alternatives> <inline-graphic xlink:type="simple" xlink:href="khan-ieq1-2365204.gif"/></alternatives>2$\times$ <alternatives><inline-graphic xlink:type="simple" xlink:href="khan-ieq2-2365204.gif"/></alternatives>...

rozdział

A survey of hardware signature implementations in multi-core systems

R. Sangeetha, N. Ramasubramanian

2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN) > 1 - 5

2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN)

Signature is used as a short and unique representation to identify a person. In the similar manner hardware signature is used to identify items like memory locations that got stored in bounded hardware registers in a hashed form. This paper considers bloom filter based hardware signatures and reviews several hardware signature implementations in multi-core systems. Some of the hardware signature implementations...

rozdział

On the Influence of Shared Memory Contention in Real-Time Multicore Applications

Giovani Gracioli, Antonio Auguto Frohlich

2014 Brazilian Symposium on Computing Systems Engineering > 25 - 30

2014 Brazilian Symposium on Computing Systems Engineering (SBESC)

The continuous evolution of processor technology has allowed the utilization of multicore architectures in the embedded system domain. A major part of embedded systems, however, are inherently real-time (soft and hard) and the use of multicores in this domain is not straightforward due to their unpredictability in bounding worst-case execution scenarios. One of the main factors for unpredictability...

rozdział

Improving Signature Behavior by Irrevocability in Transactional Memory Systems

Ricardo Quislant, Eladio Gutierrez, Emilio L. Zapata, Oscar Plata

2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing > 120 - 127

2014 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Signatures have been proposed in Hardware Transactional Memory (HTM) to represent read and write sets of transactions and decouple transaction conflict detection from private caches. Generally, signatures are implemented as Bloom filters that allow unbounded read/write sets to be summarized in bounded hardware, at the cost of address aliasing that causes false conflict detection. Such conflicts rises...

rozdział

Impact of Serial Scaling of Multi-threaded Programs in Many-Core Era

Surya Narayanan, Bharath N. Swamy, Andre Seznec

2014 International Symposium on Computer Architecture and High Performance Computing Workshop > 36 - 41

2014 International Symposium on Computer Architecture and High Performance Computing Workshop (SBAC-PADW)

Estimating the potential performance of parallel applications on the yet-to-be-designed future many cores is very speculative. The traditional laws used to predict performance of an application do not reflect on the various scaling behaviour of a multi-threaded (MT) application leading to optimistic estimation of performance in many core era. In this paper, we study the scaling behavior of MT applications...

rozdział

A Case for Resource Efficient Prefetching in Multicores

Muneeb Khan, Andreas Sandberg, Erik Hagersten

2014 43rd International Conference on Parallel Processing > 101 - 110

2014 43nd International Conference on Parallel Processing (ICPP)

Modern processors typically employ sophisticated prefetching techniques for hiding memory latency. Hardware prefetching has proven very effective and can speed up some SPEC CPU 2006 benchmarks by more than 40% when running in isolation. However, this speedup often comes at the cost of prefetching a significant volume of useless data (sometimes more than twice the data required) which wastes shared...

rozdział

Spider: A Synchronous Parameterized and Interfaced Dataflow-based RTOS for multicore DSPS

Julien Heulot, Maxime Pelcat, Karol Desnos, Jean-Francois Nezan, więcej

2014 6th European Embedded Design in Education and Research Conference (EDERC) > 167 - 171

2014 6th European Embedded Design in Education and Research Conference (EDERC)

This paper introduces a novel Real-Time Operating System (RTOS) based on a parameterized dataflow Model of Computation (MoC). This RTOS, called Synchronous Parameterized and Interfaced Dataflow Embedded Runtime (SPiDER), aims at efficiently scheduling Parameterized and Interfaced Synchronous Dataflow (PiSDF) graphs on multicore architectures. It exploits features of PiSDF to locate locally static...

Poprzednia

Następna

Opcje filtrowania

Zbiór danych:
ieee
Słowa kluczowe:
HARDWARE
MULTICORE PROCESSING
BENCHMARK TESTING

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (55)
artykuł (4)

Słowa kluczowe

INSTRUCTION SETS (12)
PROGRAM PROCESSORS (10)
MULTIPROCESSING SYSTEMS (9)
REGISTERS (7)
RUNTIME (7)
RADIATION DETECTORS (6)
SOFTWARE (6)
MEMORY MANAGEMENT (5)
OPTIMIZATION (5)
THROUGHPUT (5)
MESSAGE SYSTEMS (4)
MICROPROCESSOR CHIPS (4)
MULTI-THREADING (4)
MULTICORE (4)
OPERATING SYSTEMS (4)
PARALLEL PROGRAMMING (4)
PROGRAMMING (4)
RANDOM ACCESS MEMORY (4)
REAL-TIME SYSTEMS (4)
VIRTUAL MACHINING (4)
BANDWIDTH (3)
COMPUTATIONAL MODELING (3)
FAULT TOLERANCE (3)
KERNEL (3)
MEASUREMENT (3)
MONITORING (3)
MULTI-CORE (3)
MULTICORE ARCHITECTURE (3)
PERFORMANCE EVALUATION (3)
POWER DEMAND (3)
PROTOCOLS (3)
RELIABILITY (3)
RESOURCE MANAGEMENT (3)
ART (2)
ASYMMETRIC MULTICORE PROCESSOR (AMP) (2)
BENCHMARKING (2)
CACHE (2)
CACHE COHERENCE (2)
CACHE STORAGE (2)
COHERENCE (2)
COMPUTER ARCHITECTURE (2)
CONTEXT (2)
EMBEDDED SYSTEMS (2)
FIELD PROGRAMMABLE GATE ARRAYS (2)
HARDWARE SUPPORT (2)
HIGH PERFORMANCE COMPUTING (2)
INSTRUCTIONS PER CYCLE (IPC) (2)
JAVA (2)
LOAD BALANCING (2)
LOAD MANAGEMENT (2)
LOAD MODELING (2)
MULTICORE ARCHITECTURES (2)
MULTICORE PROCESSORS (2)
OPENMP (2)
PARALLEL PROCESSING (2)
PERFORMANCE (2)
PREFETCHING (2)
PROPOSALS (2)
SCALABILITY (2)
SCHEDULING (2)
STARSS (2)
SYNCHRONIZATION (2)
TASK MANAGEMENT (2)
TIMING (2)
VIRTUAL MACHINE (2)
VIRTUAL MACHINE MONITORS (2)
VIRTUALIZATION (2)
WORKLOAD CHARACTERIZATION (2)
ACCURACY (1)
ADAPTIVE EMBEDDED PLATFORM (1)
ANALYTICAL MODELS (1)
APPLICATION PROGRAM BEHAVIOR (1)
AREA-EQUIVALENT HOMOGENEOUS MULTICORE (HMG) (1)
ASSOCIATIVE-CACHES (1)
ASYMMETRIC MULTICORE PROCESSOR (1)
ASYMMETRIC MULTIPROCESSING PATTERNS (1)
AUTO-TUNED CODE (1)
AUTOTUNING (1)
BENCHMARK METRICS (1)
BERKELEYS DWARF TAXONOMY (1)
BIO-MEDICAL SIGNAL PROCESSING (1)
BLOCK LEVEL COMMUNICATION CRITICALITY INFORMATION (1)
BLOOM FILTER (1)
BYTECODE OPTIMIZATIONS (1)
CACHE EFFECTS (1)
CACHE HIERARCHY (1)
CACHE LINE UTILIZATION (1)
CACHE UTILIZATION (1)
CENTRALIZED MECHANISM (1)
CHARACTERIZATION (1)
CIRCUITS AND SYSTEMS (1)
CLASS LOADING (1)
CLOCKS (1)
CLUSTER OF MICROPROCESSORS (1)
CMP (1)
CMPS (1)
COARSE GRANULARITY (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu