The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Detailed memory access traces are extremely helpful for system partitioning and optimization in the context of hardware/software codesign, especially in early design stages. The prevalent technique for the generation of such traces is interpretive instruction set simulation which, however, depends on detailed modeling and further results in poor performance. With compiled simulation techniques, performance...
This work presents a minimally-intrusive, high-performance, post-silicon validation framework for validating memory consistency in multi-core systems. Our framework generates constrained-random tests that are instrumented with observability-enhancing code for memory consistency verification. For each test, we generate a set of compact signatures reflecting the memory-ordering patterns observed over...
Memory access tracing is aprogram analysis technique with many different applications, ranging from architectural simulation to (on-line) data placement optimization and security enforcement. In this article we propose a memory access tracing approach based on static x86 binary instrumentation. Unlike non-selective schemes, whichinstrument all the memory access instructions, our proposal selectively...
As high performance computing (HPC) systems reach exascale proportions, the cost of simulation in time and resources increases. Tools for selecting representative parts of parallel applications to reduce simulation cost are widespread, e.g., BarrierPoint achieves this by analysing abstract characteristics such as basic blocks and reuse distances. However, architectures new to HPC will have a limited...
Existing network simulators do not account for the overhead of communication software execution, which can be significant when devices in the network are resource constrained, like in MANETs and sensor networks. We propose an approach to extend network simulators to model the execution of communication software and to map the resulting behaviour onto existing protocol models. The approach can be used...
We present a new approach to software performance analysis that aims to extend conventional profiling with some of the predictive capabilities of a performance model. The idea is to execute programs in virtual time, which allows hypothetical time scaling of the constituent parts of an application to be explored in advance of any speculative redevelopment aimed at improving performance. Virtual time...
Over the past few years, computer architecture research has moved towards execution-driven simulation, due to the inability of traces to capture timing-dependent thread execution interleaving. However, trace-driven simulation has many advantages over execution-driven that are being missed in multithreaded application simulations. We present a methodology to properly simulate multithreaded applications...
Predicting performance of parallel applications is becoming increasingly complex and the best performance predictor is the application itself, but the time required to run it thoroughly is a onerous requirement. We seek to characterize the behavior of message-passing applications on different systems by extracting a signature which will allow us to predict what system will allow the application to...
Irregular scientific applications are difficult to parallelize in an efficient and scalable fashion due to indirect memory references (i.e. A[B[i]]), irregular communication patterns, and load balancing issues. In this paper, we present our experience parallelizing an irregular scientific application written in Java. The application is an N-Body molecular dynamics simulation that is the main component...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.