The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The arch project is a suite of mini-apps that have been developed with consistent coding practices, under a common infrastructural layer. Great emphasis has been placed on making the applications concise and easy to manipulate, while capturing the key performance characteristics of their proxied algorithmic classes. The suite is intended for traditional exploration of performance, portability and...
To satisfy growing computational demands of modern applications, significant enhancements have been introduced in the contemporary processor architectures with the aim to increase their attainable performance, such as increased number of cores, improved capability of memory subsystem and enhancements in the processor pipeline [1]. Therefore, the performance improvements are usually coupled with an...
Currently, researchers use simulators to experimenttheir innovation on emerging non-volatile memory. Unfortunately, simulation method is both time-consuming andare hard to debug. In this paper, we present a non-volatile memory emulatorwhich enables system-level research on emerging memory. Ouremulator uses performance monitoring units on off-the-shelfprocessors to implement an accurete performance...
This paper presents a comparison of existing and novel behavioral models targeted at the outphasing power amplifier (PA) architecture. A comprehensive comparison of ten modeling strategies is presented in the results. Novel techniques for outphasing PAs, such as vector switched and dual path time series, are also presented for the first time. Investigation of such techniques was driven by the analysis...
CMOS logarithmic amplifiers based on piecewise linear approximation have been usually analyzed and designed by assuming ideal conditions. This paper discusses non-ideal factors in the used components, which play a key role in the performance of high-accuracy logarithmic amplifiers for sensing applications, and shows that the use of simplified mathematical models may lead to wrong conclusions. A gain-mismatch...
To address the challenge of unprecedented growth in mobile data traffic, ultra-dense network deployment is a cost efficient solution to offload the traffic over some small cells. The overlapped coverage areas of small cells create more than one candidate access points for one mobile station. Signal strength based user association in IEEE 802.11 results in a significantly unbalanced load distribution...
In dynamic bandwidth management based on traffic prediction, the traffic flows can be modeled as time-series data. State-of-the-art technique used in modeling this traffic flows is by using a linear model. In contrast, Recurrent Neural Network (RNN) has been the state-of-the-art technique in speech recognition, which data is also time-series. Therefore, we conjecture that the use of RNN can improve...
Online compression of I/O-data streams in general purpose computing will enhance the effective I/O bandwidth of processors, the bandwidth of the computer network as well as the storage capacity and the read/write performance of the storage. In this paper, a self-adaptive dynamic partial reconfigurable architecture for the online compression of data streams is introduced. The proposed architecture...
Infrastructure as a service (IaaS) is a form of cloud computing, which converts physical machines (PMs) to various types of resources by virtualization technology, and delivers these resources to customers by a on-demand way. However, different customer requirements complicate the usage of resources. In this paper, we employ a high level formalization way, Performance Evaluation Process Algbra (PEPA),...
The video grid with hybrid architecture provides video services by using client resources so as to the streaming path of video data was shortened and the service ability of system was enhanced. However, it is too hard to get the service ability of the system because the service ability provided by clients is not calculated easily while these clients have great heterogeneity and dynamism. Aiming at...
A critical issue in mobile picocellular network configurations at 60 GHz is the inefficiency of classical handoff process due to restrictions derived by the small cells' topology, which leads to very frequent handoffs as well as by the limited overlapping areas between adjacent cells. In this paper we focus on Radio-over-Fiber (RoF) network architectures at 60 GHz band, which recently attracts high...
We analyze the impact of the node architecture flexibility in the number of line interfaces required, for multi-period planning with and without traffic churn. The line interface savings from enforcing hitless traffic re-grooming are highlighted.
Performance modelling is a useful tool in the lifeycle of high performance scientific software, such as weather and climate models, especially as a means of ensuring efficient use of available computing resources. In particular, sufficiently accurate performance prediction could reduce the effort and experimental computer time required when porting and optimising a climate model to a new machine.
We consider the problem of how to enable computer architects and algorithm designers to reason directly and analytically about the relationship between high-level architectural features and algorithm characteristics. We propose a modeling framework designed to help understand the long-term and high-level impacts of algorithmic and technology trends. This model connects abstract communication complexity...
The lattice Boltzmann method is increasingly important in facilitating large-scale fluid dynamics simulations. To date, these simulations have been built on discretized velocity models of up to 27 neighbors. Recent work has shown that higher order approximations of the continuum Boltzmann equation enable not only recovery of the Navier-Stokes hydro-dynamics, but also simulations for a wider range...
Sustaining the memory locality is critical for obtaining high performance in NUMA system. But how to identify a locality leakage problem and how to measure the leakage is still open issue. This paper provides an algorithm to quantitatively measure the locality leakage based on the memory trace produced by IBS (Instruction-Based-Sampling). A """"perfect matrix""""...
Reliability, efficiency (in term of time consumption) and effectiveness in resources utilization are the desired quality attributes of Cloud Scheduling System, the main purpose of which is to execute jobs optimally, i.e. with minimum average waiting, turnaround and response time. Replication provides improved availability, decreased bandwidth use, increased fault tolerance, and improved scalability...
Traditional DBT system is hard to accelerate by introducing a customized processor core because the startup overhead is hard to eliminate. In this paper, we concentrate on how to choose a suitable layout of the DBT core in a dual-core system. We analyze the tradeoff between the frequency and the memory bandwidth of the DBT core through an analytical model, and simulated 4 different and usual layouts...
A key driver for the evolution of Future Media Networks (FMNs) is the emergence of beyond High Definition (HD) media formats. These formats impose far greater demands on networks for high-capacity, low latency and stringent Quality-of-Service (QoS) compared to other existing formats. In addition, their data-intensiveness will require real-time interconnection of multiple, possibly distributed, high...
Algorithms for biological sequence analysis, such as approximate string matching or algorithms for identification of sequence patterns supporting specific structural elements, present good opportunities for hardware acceleration. Implementation of these algorithms often results in architectures based on multidimensional arrays of computing elements. Mapping effectively these computational structures...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.