Wyniki wyszukiwania

Pozycje od 101 do 120 spośród 273 wyników

Poprzednia

1 ...
3
4
5
6
7
8
9

Następna

rozdział

Exhaustive Key Search on Clusters of GPUs

Davide Barbieri, Valeria Cardellini, Salvatore Filippone

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 1160 - 1168

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

Exhaustive search is generally a last resort for solving a problem: each possible state of a system is generated and evaluated against a condition to find if the problem solution is attained. In some cases, for example in the reversal of cryptographic hash functions that make use of the salting technique, there are very few valid alternatives. However, the set of candidate solutions can be extremely...

rozdział

QuickRelease: A throughput-oriented approach to release consistency on GPUs

Blake A. Hechtman, Shuai Che, Derek R. Hower, Yingying Tian, więcej

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 189 - 200

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

Graphics processing units (GPUs) have specialized throughput-oriented memory systems optimized for stream-ing writes with scratchpad memories to capture locality explicitly. Expanding the utility of GPUs beyond graphics encourages designs that simplify programming (e.g., using caches instead of scratchpads) and better support irregular applications with finer-grain synchronization. Our hypothe-sis...

rozdział

MRPB: Memory request prioritization for massively parallel processors

Wenhao Jia, Kelly A. Shaw, Margaret Martonosi

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 272 - 283

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high performance for a broad range of programs. They are, however, complex to program, especially because of their intricate memory hierarchies with multiple address spaces. In response, modern GPUs have widely adopted caches, hoping to providing smoother reductions in memory access traffic and latency....

rozdział

Flexible and scalable implementation of H.264/AVC encoder for multiple resolutions using ASIPs

Hong Chinh Doan, Haris Javaid, Sri Parameswaran

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 6

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Real-time encoding of video streams is computationally intensive and rarely carried out at high resolutions. In this paper, for the first time, we propose a platform for H.264 encoder which is both flexible (allows software upgrades) and scalable (supports multiple resolutions), and supports high video quality (by using both intraprediction and inter-prediction) and high throughput (by exploiting...

rozdział

On the Fairness of Linux O(1) Scheduler

Jyothish Jose, Oravanpadath Sujisha, Malayamparambath Gilesh, Thayyil Bindima

2014 5th International Conference on Intelligent Systems, Modelling and Simulation > 668 - 674

2014 5th International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

The scheduling algorithm of Linux operating systems has to fulfill several conflicting objectives: fast process response time, higher throughput for background jobs, avoidance of process starvation, reconciliation of the needs of low and high priority processes etc. The set of rules used to determine when and how to select a new process to run is called scheduling policy. Current Linux kernel uses...

rozdział

Preemptive thread block scheduling with online structural runtime prediction for concurrent GPGPU kernels

Sreepathi Pai, R. Govindarajan, Matthew J. Thazhuthaveetil

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 483 - 484

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these GPUs, the thread block scheduler (TBS) currently uses the FIFO policy to schedule thread blocks of concurrent kernels. We show that the FIFO policy leaves performance to chance, resulting in significant loss of performance and fairness. To improve performance and fairness, we propose use of the preemptive...

rozdział

Optimizing Event Polling for Network-Intensive Applications: A Case Study on Redis

Xingbo Wu, Xiang Long, Lei Wang

2013 International Conference on Parallel and Distributed Systems > 687 - 692

2013 International Conference on Parallel and Distributed Systems (ICPADS)

In today's data centers supporting Internet-scale computing and I/O services, increasingly more network-intensive applications are deployed on the network as a service. To this end, it is critical for the applications to quickly retrieve requests from the network and send their responses to the network. To facilitate this network function, operating system usually provides an event notification mechanism...

rozdział

Performance comparison of wireless networks over IPv6 and IPv4 under several operating systems

Hossam M. A. Fahmy, Salma A. Ghoneim

2013 IEEE 20th International Conference on Electronics, Circuits, and Systems (ICECS) > 670 - 673

2013 IEEE 20th International Conference on Electronics, Circuits, and Systems (ICECS)

IPv6 was introduced but yet it is not widely used. Research work has been pointed to many directions, specifically, on how to migrate from IPv4 to IPv6, on how to adapt hardware devices to support a transitory period from coexistence between IPv4 and IPv6 to established use of IPv6, and on how should operating systems perform when using IPv6 as compared to IPv4. This work provides a comparative performance...

rozdział

Online Performance Projection for Clusters with Heterogeneous GPUs

Lokendra S. Panwar, Ashwin M. Aji, Jiayuan Meng, Pavan Balaji, więcej

2013 International Conference on Parallel and Distributed Systems > 283 - 290

2013 International Conference on Parallel and Distributed Systems (ICPADS)

We present a fully automated approach to project the relative performance of an OpenCL program over different GPUs. Performance projections can be made within a small amount of time, and the projection overhead stays relatively constant with the input data size. As a result, the technique can help runtime tools make dynamic decisions about which GPU would run faster for a given kernel. Usage cases...

rozdział

Dependency Design for Large Cloud Applications

Rui Liu, Xin Sheng Mao

2013 International Conference on Computer Sciences and Applications > 670 - 673

2013 International Conference on Computer Sciences and Applications (CSA)

Dependency design is one of the key factors of the overall performance of a large application system for both runtime phase and development phase. The 2 features of cloud enabled large applications make this dependency design more critical. The first is that service is always distributed and data application is always partitioned. To be distributed and partitioned means there are more delay and complexity...

rozdział

On the performance of Linux Container with Netmap/VALE for networks virtualization

Maurizio Casoni, Carlo Augusto Grazia, Natale Patriciello

2013 19th IEEE International Conference on Networks (ICON) > 1 - 6

2013 19th IEEE International Conference on Networks (ICON)

In this paper we study the problem of how to simulate complex networks on general-purpose hardware in an efficient, feasible and scalable way. State of the art solutions for network simulation are based on the virtualization of network simulators (so as to emulate network's nodes) or on the usage of specialized software that models the network itself (so as to emulate network's links). The former...

rozdział

Open the Gates: Using High-level Synthesis towards programmable LDPC decoders on FPGAs

Frederico Pratas, Joao Andrade, Gabriel Falcao, Vitor Silva, więcej

2013 IEEE Global Conference on Signal and Information Processing > 1274 - 1277

2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

State-of-the-art decoders for LDPC codes adopted by several digital communication standards require a significant amount of hardware resources to achieve the desired high throughput performance. With technology scaling below the 22nm and with billions of transistors available per chip/device, the development cost and complexity of such designs represent an increasing challenge for hardware designers...

rozdział

High throughput low latency LDPC decoding on GPU for SDR systems

Guohui Wang, Michael Wu, Bei Yin, Joseph R. Cavallaro

2013 IEEE Global Conference on Signal and Information Processing > 1258 - 1261

2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In this paper, we present a high throughput and low latency LDPC (low-density parity-check) decoder implementation on GPUs (graphics processing units). The existing GPU-based LDPC decoder implementations suffer from low throughput and long latency, which prevent them from being used in practical SDR (software-defined radio) systems. To overcome this problem, we present optimization techniques for...

rozdział

Selective Profiling for OS Scalability Study on Multicore Systems

Kuo-Yi Chen, Yuan-Hao Chang, Pei-Shu Liao, Pen-Chung Yew, więcej

2013 IEEE 6th International Conference on Service-Oriented Computing and Applications > 174 - 181

2013 IEEE 6th International Conference on Service-Oriented Computing and Applications (SOCA)

With more cores becoming available in each future generation of microprocessors (i.e. the well-known Moore's Law), scalability is becoming an increasingly important issue. Scalability of the operating system, in particular, is critical to such systems. To study OS scalability and many other issues related to OS performance on multicore systems, software and hardware profilers are indispensable tools...

rozdział

An architecture of virtual NIC driver based on WinPcap and a method to test it

Dongxiang Fang, Peifeng Zeng

2013 15th IEEE International Conference on Communication Technology > 555 - 559

2013 15th IEEE International Conference on Communication Technology (ICCT)

An architecture of WinPcap based virtual NIC driver is proposed for developing embedded network applications. It makes it possible to develop applications with real network traffic using simulation environment in PC rather than the embedded hardware, reduce development cycles while keeping the cost down. A test method with the tool of Iperf is proposed to test the throughput, packet loss rate for...

rozdział

TCP_SDF: Transport Control to Achieve Smooth Data Flow at a Stable Throughput

Xukang Lu, Chase Qishi Wu, Paul Sheldon, Alan Tackett, więcej

2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing > 1187 - 1194

2013 IEEE International Conference on High Performance Computing and Communications (HPCC) & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

Various large-scale network applications demand different levels of stable bandwidth to perform bulk data movement, which is not adequately supported by current transport methods. We propose a novel end-to-end transport control protocol, (TCP_SDF), which achieves smooth data flow at a stable good put level. TCP_SDF dynamically adjusts the congestion window based on the estimate of the current sending...

rozdział

Cross-layer cooperation to boost multipath TCP performance in cloud networks

Matthieu Coudron, Stefano Secci, Guy Pujolle, Patrick Raad, więcej

2013 IEEE 2nd International Conference on Cloud Networking (CloudNet) > 58 - 66

2013 IEEE 2nd International Conference on Cloud Networking (CloudNet)

Cloud networking imposes new requirements in terms of connection resiliency and throughput among virtual machines, hypervisors and users. A promising direction is to exploit multipath communications, yet existing protocols have a so limited scope that performance improvements are often unreachable. Generally, multipathing adds signaling overhead and in certain conditions may in fact decrease throughput...

rozdział

Fast and flexible: Parallel packet processing with GPUs and click

Weibin Sun, Robert Ricci

Architectures for Networking and Communications Systems > 25 - 35

2013 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

We introduce Snap, a framework for packet processing that outperforms traditional software routers by exploiting the parallelism available on modern GPUs. While obtaining high performance, it remains extremely flexible, with packet processing tasks implemented as simple modular elements that are composed to build fully functional routers and switches. Snap is based on the Click modular router, which...

rozdział

k-p0f: A high-throughput kernel passive OS fingerprinter

Jason Barnes, Patrick Crowley

Architectures for Networking and Communications Systems > 113 - 114

2013 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS)

Most critical security vulnerabilities depend on the OS. If a hacker finds a machine with a vulnerable OS, then he can attack the system. Network administrators can defend against OS-specific attacks if they can find vulnerable machines before hackers do, but physically checking or actively scanning a large network can take time and resources. This paper describes a modification of p0f implemented...

rozdział

Taming TCP incast throughput collapse in data center networks

Jiao Zhang, Fengyuan Ren, Li Tang, Chuang Lin

2013 21st IEEE International Conference on Network Protocols (ICNP) > 1 - 10

2013 21st IEEE International Conference on Network Protocols (ICNP)

The TCP incast problem attracts a lot of attention due to its wide existence in cloud services and catastrophic performance degradation. Some effort has been made to solve it. However, the industry is still struggling with it, such as Facebook. Based on the investigation that the TCP incast problem is mainly caused by the TimeOuts (TOs) occurring at the boundary of the stripe units, this paper presents...

Poprzednia

1 ...
3
4
5
6
7
8
9

Następna

Opcje filtrowania

Słowa kluczowe:
KERNEL
THROUGHPUT

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (269)
Brak (4)

Słowa kluczowe

LINUX (71)
HARDWARE (45)
PROTOCOLS (41)
SERVERS (38)
GRAPHICS PROCESSING UNITS (37)
COMPUTER ARCHITECTURE (35)
INSTRUCTION SETS (34)
BANDWIDTH (31)
PERFORMANCE EVALUATION (29)
FIELD PROGRAMMABLE GATE ARRAYS (28)
IP NETWORKS (26)
TRANSPORT PROTOCOLS (24)
GPU (21)
OPTIMIZATION (20)
BENCHMARK TESTING (19)
RECEIVERS (19)
PARALLEL PROCESSING (18)
DELAY (17)
GRAPHICS PROCESSING UNIT (17)
SWITCHES (17)
RANDOM ACCESS MEMORY (16)
CUDA (15)
DATA MINING (14)
MEMORY MANAGEMENT (14)
PIPELINES (14)
SOCKETS (13)
ENCODING (12)
GPGPU (12)
PROGRAM PROCESSORS (12)
SCHEDULING (12)
TCP (12)
VIRTUAL MACHINING (12)
ENGINES (11)
LOCAL AREA NETWORKS (11)
PERFORMANCE (11)
ALGORITHM DESIGN AND ANALYSIS (10)
ARRAYS (10)
CONTEXT (10)
CRYPTOGRAPHY (10)
DECODING (10)
FPGA (10)
INTERNET (10)
MONITORING (10)
SYNCHRONIZATION (10)
VIRTUAL MACHINES (10)
CLOUD COMPUTING (9)
COPROCESSORS (9)
DELAYS (9)
MULTIPROCESSING SYSTEMS (9)
RESOURCE MANAGEMENT (9)
SCALABILITY (9)
SCHEDULES (9)
YARN (9)
DRIVER CIRCUITS (8)
TELECOMMUNICATION CONGESTION CONTROL (8)
OPERATING SYSTEM KERNELS (7)
PIPELINE PROCESSING (7)
REAL TIME SYSTEMS (7)
REGISTERS (7)
CLOCKS (6)
COMPUTATIONAL MODELING (6)
CONGESTION CONTROL (6)
CONVOLUTION (6)
DIGITAL SIGNAL PROCESSING (6)
LINUX KERNEL (6)
MEASUREMENT (6)
OPTIMISATION (6)
PROGRAMMING (6)
RESOURCE ALLOCATION (6)
STREAMING MEDIA (6)
TELECOMMUNICATION TRAFFIC (6)
WIRELESS LAN (6)
CACHE STORAGE (5)
COMPUTER GRAPHIC EQUIPMENT (5)
CONTAINERS (5)
DEGRADATION (5)
DETECTORS (5)
EMBEDDED SYSTEMS (5)
ETHERNET NETWORKS (5)
MESSAGE SYSTEMS (5)
MICROPROCESSOR CHIPS (5)
MULTI-THREADING (5)
NETWORK INTERFACES (5)
OPENCL (5)
PREFETCHING (5)
PROCESSOR SCHEDULING (5)
QUALITY OF SERVICE (5)
ROUTING (5)
SHARED MEMORY (5)
SYSTEM-ON-CHIP (5)
TIME FACTORS (5)
VIRTUALIZATION (5)
WRITING (5)
ACCELERATION (4)
BUFFER STORAGE (4)
COMPLEXITY THEORY (4)
DATABASES (4)
EMULATION (4)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu