Wyniki wyszukiwania

Pozycje od 41 do 53 spośród 53 wyników

Poprzednia

Następna

rozdział

Cache-Efficient, Intranode, Large-Message MPI Communication with MPICH2-Nemesis

D. Buntinas, B. Goglin, D. Goodell, G. Mercier, więcej

2009 International Conference on Parallel Processing > 462 - 469

2009 International Conference on Parallel Processing (ICPP 2009)

The emergence of multicore processors raises the need to efficiently transfer large amounts of data between local processes. MPICH2 is a highly portable MPI implementation whose large-message communication schemes suffer from high CPU utilization and cache pollution because of the use of a double-buffering strategy, common to many MPI implementations. We introduce two strategies offering a kernel-assisted,...

rozdział

Linux transplantation based on the processor S3C2440

Sun Yanpeng, Peng Peng, Zhang Yuan

2009 9th International Conference on Electronic Measurement&Instruments > 2-306 - 2-309

2009 9th International Conference on Electronic Measurement & Instruments (ICEMI 2009)

This paper describes the methods and progress of transplanting the embedded Linux to the target board based on the S3C2440 processor, including the establishment of cross-compiler environment, the reduction and compilation of startup code (bootloader) and Linux kernel and the construction of the root file system with the point focused on the structure and function of bootloader as well as the transplantation...

rozdział

Resource manager with multi-core support for parallel desktop

J.R. Garcia, J.L. Lerida, P. Hernandez

2009 IEEE International Conference on Cluster Computing and Workshops > 1 - 4

2009 IEEE International Conference on Cluster Computing and Workshops (CLUSTER)

Due to the fall in the price of multicore processors, today's non-dedicated clusters tend to include this kind of hardware in their configurations. However, most current resource managers and job schedulers are optimized with the aim of maximizing the throughput in single-core environments. Additionally, job schedulers balance thread loads without considering such aspects as cache affinity, resource...

rozdział

Design and implementation of multi-channel CAN communication interface based on embedded linux

Gang Dai, Guanghua Gong, Beibei Shao, Wei Su

2009 9th International Conference on Electronic Measurement&Instruments > 3-6 - 3-9

2009 9th International Conference on Electronic Measurement & Instruments (ICEMI 2009)

According to the communication and control requirement of multi-channel Control Area Network(CAN) bus by the HLA simulation platform, this paper described a embedded PowerPC system based method of design and implementation of multi-channel CAN communication platform. The corresponding Linux multi-channel CAN device driver has also been developed. This flexible system can connect to different number...

rozdział

Scalability challenges for massively parallel AMR applications

B. Van Straalen, J. Shalf, T. Ligocki, N. Keen, więcej

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 12

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

PDE solvers using Adaptive Mesh Refinement on block structured grids are some of the most challenging applications to adapt to massively parallel computing environments. We describe optimizations to the Chombo AMR framework that enable it to scale efficiently to thousands of processors on the Cray XT4. The optimization process also uncovered OS-related performance variations that were not explained...

rozdział

Minimizing startup costs for performance-critical threading

A.M. Castaldo, R.C. Whaley

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 8

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Using the well-known ATLAS and LAPACK dense linear algebra libraries, we demonstrate that the parallel management overhead (PMO) can grow with problem size on even statically scheduled parallel programs with minimal task interaction. Therefore, the widely held view that these thread management issues can be ignored in such computationally intensive libraries is wrong, and leads to substantial slowdown...

rozdział

Enabling high-performance memory migration for multithreaded applications on LINUX

B. Goglin, N. Furmento

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 9

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

As the number of cores per machine increases, memory architectures are being redesigned to avoid bus contention and sustain higher throughput needs. The emergence of Non-Uniform Memory Access (NUMA) constraints has caused affinities between threads and buffers to become an important decision criterion for schedulers. Memory migration dynamically enables the joint distribution of work and data across...

rozdział

An Automatic Compiler Optimizations Selection Framework for Embedded Applications

Shih-Hao Hung, Chia-Heng Tu, Huang-Sen Lin, Chi-Meng Chen

2009 International Conference on Embedded Software and Systems > 381 - 387

2009 International Conference on Embedded Software and Systems. ICESS 2009

Optimizing compilers provide users with compiler options to maximize program performance. The selection of compiler options is important as the resulted performance can vary significantly. The best combination of compiler options is not only dependent on the program itself, but it also is highly related to the configuration of the system and the architecture of the processor that the program runs...

rozdział

Characterization of Direct Cache Access on multi-core systems and 10GbE

A. Kumar, R. Huggahalli, S. Makineni

2009 IEEE 15th International Symposium on High Performance Computer Architecture > 341 - 352

HPCA - 15 2009. IEEE 15th International Symposium on High Performance Computer Architecture

10 GbE connectivity is expected to be a standard feature of server platforms in the near future. Among the numerous methods and features proposed to improve network performance of such platforms is direct cache access (DCA) to route incoming I/O to CPU caches directly. While this feature has been shown to be promising, there can be significant challenges when dealing with high rates of traffic in...

rozdział

Virtualization for Advanced Power Management of Consumer Electronic Devices

F. Altschuler, V. Palatin

2009 6th IEEE Consumer Communications and Networking Conference > 1 - 3

2009 6th IEEE Consumer Communications and Networking Conference. "Empowering the Connected Consumer"

We describe the use of a dedicated power management virtual machine in the context of portable consumer electronics devices. The high level architecture and inter-virtual machine dependencies are discussed as well as key power management strategies and issues.

rozdział

Design and evaluation of a collective IO model for loosely coupled petascale programming

Zhao Zhang, A. Espinosa, K. Iskra, I. Raicu, więcej

2008 Workshop on Many-Task Computing on Grids and Supercomputers > 1 - 10

MTAGS 2008. Workshop on Many-Task Computing on Grids and Supercomputers

Loosely coupled programming is a powerful paradigm for rapidly creating higher-level applications from scientific programs on petascale systems, typically using scripting languages. This paradigm is a form of many-task computing (MTC) which focuses on the passing of data between programs as ordinary files rather than messages. While it has the significant benefits of decoupling producer and consumer...

rozdział

Logical Partitioning without Architectural Supports

T. Shimosawa, H. Matsuba, Y. Ishikawa

2008 32nd Annual IEEE International Computer Software and Applications Conference > 355 - 364

2008 IEEE 32nd International Computer Software and Applications Conference (COMPSAC)

As a method for running multiple operating systems on one machine, we propose a new resource partitioning method we have named "single hardware with independent multiple operating systems" (SHIMOS). In SHIMOS, CPU and memory resources are partitioned by multiple native kernels without any architectural virtualization supports. There is nearly no slowdown, unlike VMs, because the kernel and...

rozdział

Framework for supporting multi-service edge packet processing on network processors

A. Raghunath, A. Kunze, E.J. Johnson, V. Balakrishnan

2005 Symposium on Architectures for Networking and Communications Systems (ANCS) > 163 - 171

2005 Symposium on Architectures for Networking and Communications Systems (ANCS)

Network edge packet-processing systems, as are commonly implemented on network processor platforms, are increasingly required to support a rich set of services. These multi-service systems are also subjected to widely varying and unpredictable traffic. Current network processor systems do not simultaneously deal well with a variety of services and fluctuating workloads. For example, current methods...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
PROGRAM PROCESSORS
KERNEL

Data publikacji

Ustaw własny zakres dat

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu