Search results

Items from 1 to 12 out of 12 results

chapter

Online scalability characterization of data-parallel programs on many cores

Younghyun Cho, Surim Oh, Bernhard Egger

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) > 191 - 205

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)

We present an accurate online scalability prediction model for data-parallel programs on NUMA many-core systems. Memory contention is considered to be the major limiting factor of program scalability as data parallelism limits the amount of synchronization or data dependencies between parallel work units. Reflecting the architecture of NUMA systems, contention is modeled at the last-level caches of...

chapter

A scalable communication-aware compilation flow for programmable accelerators

Jason Cong, Hui Huang, Mohammad Ali Ghodrat

2016 21st Asia and South Pacific Design Automation Conference (ASP-DAC) > 503 - 510

2016 21st Asia and South Pacific Design Automation Conference (ASP-DAC)

Programmable accelerators (PA) are receiving increased attention in domain-specific architecture designs to provide more general support for customization. In a PA-rich system, computational kernels are compiled into predefined PA templates and dynamically mapped to real PAs at runtime. This imposes a demanding challenge on the compiler side - that is, how to generate high-quality PA mapping code...

chapter

A New Approach to Embedding Semantic Link Network with Word2Vec Binary Code

Yanhong Yuan, Yao Liu, Qiaoli Huang, Zhixing Huang

2015 11th International Conference on Semantics, Knowledge and Grids (SKG) > 9 - 16

2015 11th International Conference on Semantics, Knowledge and Grids (SKG)

Graph-structured data has come into wide use in various fields where graphs are the natural data structure to model networks. Therefore, the comparison between two graphs becomes a research focus. Traditional approaches for graph comparison face the common problem: either increasing the runtime for large graphs or simplifying the representation of graphs which ignores part of their topological information...

chapter

Leveraging Hierarchical Data Locality in Parallel Programming Models

Ahmad Anbar, Engin Kayraklioglu, Olivier Serres, Tarek El Ghazawi

2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS) > 363 - 366

2014 IEEE International Conference on High Performance Computing and Communications (HPCC), 2014 IEEE 6th International Symposium on Cyberspace Safety and Security (CSS) and 2014 IEEE 11th International Conference on Embedded Software and Systems (ICESS)

We are proposing a novel framework that ameliorates locality-aware parallel programming models, by defining hierarchical data locality model extension. We also propose a hierarchical thread partitioning algorithm. This algorithm synthesizes hierarchical thread placement layouts that targets minimizing the program's overall communication costs. We demonstrated the effectiveness of our approach using...

chapter

PEMOGEN: Automatic adaptive performance modeling during program runtime

Arnamoy Bhattacharyya, Torsten Hoefler

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 393 - 404

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

Traditional means of gathering performance data are tracing, which is limited by the available storage, and profiling, which has limited accuracy. Performance modeling is often used to interpret the tracing data and generate performance predictions. We aim to complement the traditional data collection mechanisms with online performance modeling, a method that generates performance models while the...

chapter

Simulation driven design of the German toll system - profiling simulation performance

Tommy Baumann, Bernd Pfitzinger, Thomas Jestadt

2013 Federated Conference on Computer Science and Information Systems > 923 - 926

2013 Federated Conference on Computer Science and Information Systems (FedCSIS)

Taking an existing large-scale simulation model of the German toll system we identify the typical workload by profiling the runtime behavior. Crucial performance hot spots are identified and related to the real-world application to analyze and evaluate the observed efficiency. In a benchmark approach we compare the observed performance to different simulation frameworks.

chapter

Modeling for Synthesis with System#

C. Kollner, F. Mendoza, K.D. Muller-Glaser

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 470 - 476

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

While Electronic Design Automation made the shift towards system design and high-level design methods keep on emerging, there is hardly any open framework which allows researchers to quickly prototype novel synthesis algorithms. We present System#, an open source system level design framework based on C#. System# tries to bridge the productivity gap by covering modeling, simulation, code transformations...

chapter

Linking Formal Description and Simulation of Runtime Reconfigurable Systems

Thilo Pionteck, Christoph Osterloh, Carsten Albrecht

2011 International Conference on Reconfigurable Computing and FPGAs > 158 - 163

2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)

This paper links a well-investigated formalism for describing dynamic structured discrete event systems and a modelling methodology for runtime reconfigurable systems. The theory behind dynamic structured discrete event systems is used to back a generic SystemC simulation model derived from and developed for runtime reconfigurable systems. The coupling of formalism and model effects in particular...

chapter

On the (f)utility of untrusted data sanitization

Ashish Gehani, David Hanz, John Rushby, Grit Denker, more

2011 - MILCOM 2011 Military Communications Conference > 1261 - 1266

MILCOM 2011 - 2011 IEEE Military Communications Conference

Data sanitization has been studied in the context of architectures for high assurance systems, language-based information flow controls, and privacy-preserving data publication. A range of sanitization strategies has been developed to address the wide variety of data content and contexts that arise in practice. It is therefore tempting to separate the complex downgrading operations into untrusted...

chapter

TIDeFlow: The Time Iterated Dependency Flow Execution Model

Daniel Orozco, Elkin Garcia, Robert Pavel, Rishi Khan, more

2011 First Workshop on Data-Flow Execution Models for Extreme Scale Computing > 1 - 9

2011 First Workshop on Data-Flow Execution Models for Extreme Scale Computing (DFM)

The many-core revolution brought forward by recent advances in computer architecture has created immense challenges in the writing of parallel programs for High Performance Computing (HPC). Development of parallel HPC programs remains an art, and a universaldoctrine for synchronization, scheduling and execution in general has not been found for many-core/multi-core architectures. These issues are...

chapter

Enabling active storage on parallel I/O software stacks

Seung Woo Son, S Lang, P Carns, R Ross, more

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) > 1 - 12

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010)

As data sizes continue to increase, the concept of active storage is well fitted for many data analysis kernels. Nevertheless, while this concept has been investigated and deployed in a number of forms, enabling it from the parallel I/O software stack has been largely unexplored. In this paper, we propose and evaluate an active storage system that allows data analysis, mining, and statistical operations...

chapter

Split Objects for Multiconsistent Shared Memory

Nico Kaemmer, Patrick Schmidt, Steffen Gerhold, Thilo Schmitt, more

2010 Second International Conference on Computer Engineering and Applications > 1 > 240 - 243

2010 Second International Conference on Computer Engineering and Applications (ICCEA 2010)

Distributed systems with shared memory and more than one consistency model for shared data are often restricted in use or inflexible for programmers. This paper describes details of our transactional distributed memory system, that provides several consistency models for shared memory. To this end Rainbow OS implements so-called split objects guaranteeing the integrity of heap structures and providing...

Filter options

Data set:
ieee
Keywords:
KERNEL
RUNTIME
DATA MODELS

Publication date

Set your own date range

Keywords

COMPUTATIONAL MODELING (5)
ADAPTATION MODELS (3)
ANALYTICAL MODELS (3)
BENCHMARK TESTING (2)
COMPUTER ARCHITECTURE (2)
PREDICTIVE MODELS (2)
SCALABILITY (2)
ACTIVE STORAGE SYSTEM (1)
ATOMIC LAYER DEPOSITION (1)
CODELETS (1)
COMPLEXITY THEORY (1)
CONTEXT (1)
COPROCESSORS (1)
CSHARP (1)
DATA ANALYSIS (1)
DATA ANALYSIS BENCHMARKS (1)
DATA ANALYSIS KERNELS (1)
DATA COLLECTION (1)
DATA LOCALITY (1)
DATA MINING (1)
DATA TRANSFER (1)
DATAFLOW (1)
DEPENDENCY GRAPH (1)
DESIGN AUTOMATION (1)
DISCRETE EVENT SYSTEMS (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED OPERATING SYSTEM (1)
DISTRIBUTED SHARED MEMORY SYSTEMS (1)
DOTNET (1)
EXECUTION MODEL (1)
FILE I/O BUFFER (1)
FORMAL DESCRIPTION (1)
GLOBAL POSITIONING SYSTEM (1)
GPU OFFLOADING (1)
GRAPH KERNEL (1)
GRAPH LANGUAGES (1)
HARDWARE (1)
HARDWARE DESIGN LANGUAGES (1)
HIERARCHICAL THREAD CLUSTERING (1)
INTERSERVER COMMUNICATION (1)
ITERATED DATAFLOW (1)
K-MEANS CLUSTERING KERNEL (1)
MANY-CORES (1)
MEASUREMENT (1)
MEMORY MANAGEMENT (1)
MESSAGE SYSTEMS (1)
MULTICONSISTENCY (1)
MULTICONSISTENT SHARED MEMORY (1)
NEURAL NETWORKS (1)
OBJECT ORIENTED MODELING (1)
OPERATING SYSTEMS (COMPUTERS) (1)
OPTIMIZATION (1)
PARALLEL FILE SYSTEM (1)
PARALLEL I/O SOFTWARE STACKS (1)
PARALLEL PROCESSING (1)
PARALLEL PROGRAMMING (1)
PARTITIONING ALGORITHMS (1)
PATTERN CLUSTERING (1)
PERFORMANCE GAIN (1)
PROGRAM PROCESSORS (1)
PROGRAMMING (1)
RAINBOW OS (1)
RECONFIGURABLE LOGIC (1)
RESOURCE MANAGEMENT (1)
RUNTIME RECONFIGURABLE SYSTEMS (1)
RUNTIME SYSTEM (1)
SECURITY (1)
SEMANTIC LINK NETWORK (1)
SEMANTICS (1)
SERVER-SIDE COLLECTIVE COMMUNICATION PRIMITIVES (1)
SERVER-SIDE OPERATION (1)
SERVERS (1)
SPLIT OBJECT (1)
SPLIT OBJECTS (1)
STATISTICAL ANALYSIS (1)
STATISTICAL OPERATION (1)
STORAGE MANAGEMENT (1)
SYNCHRONIZATION (1)
SYSTEM-LEVEL DESIGN (1)
SYSTEMC (1)
SYSTEMSHARP (1)
TIDEFLOW (1)
TILES (1)
TRANSACTIONAL DISTRIBUTED MEMORY (1)
UNIFIED MODELING LANGUAGE (1)
VERY HIGH SPEED INTEGRATED CIRCUITS (1)
WORD2VEC (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options