The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper we present MLTBiqCrunch, a hierarchically parallelized version of the open-source solver BiqCrunch [1]. More precisely, this version has two levels of parallelization: a coarse grain, assigning a thread to a node evaluation and a fine grain, parallelizing a node evaluation when some threads are not busy. We present experiments on some classical binary quadratic optimization problems...
FPGAs are becoming an attractive choice as a heterogeneous computing unit for scientific computing because FPGA vendors are adding floating-point-optimized architectures to their product lines. Additionally, high-level synthesis (HLS) tools such as Altera OpenCL SDK are emerging, which could potentially break the FPGA programming wall and provide a streamlined flow for domain experts in scientific...
Field-Programmable Gate Arrays (FPGAs) are gaining considerable momentum in mainstream high-performance systems in recent years due to their flexibility and low power consumption. Still, FPGAs remain largely unavailable to software programmers due to programming and debugging difficulties that are inherent to standard Hardware Description Languages. The performance that hardware-oblivious software...
A heterogeneous memory system (HMS) consists of multiple memory components with different properties. GPU is a representative architecture with HMS. It is challenging to decide optimal placement of data objects on HMS because of the large exploration space and complicated memory hierarchy on HMS. In this paper, we introduce performance modeling techniques to predict performance of various data placements...
It is non-trivial to optimise computations of chaotic systems since slightly perturbed simulations diverge exponentially over time due to the well-known butterfly effect if bit-reproducible results are not achieved. Therefore, two model setups that show the same quality in the representation of a chaotic system will show uncorrelated behaviour if integrated long enough, hence it is challenging to...
Due to energy efficiency, heterogeneous computing is gaining more and more attention. Since FPGA implementations are time consuming, high-level synthesis (HLS) is used to close the productivity gap. OpenCL has become accepted as a good programming model for HLS, due to its portability, good capability of design verification and rich instruction set. This work implements different optimization strategies...
Field-Programmable Gate Arrays (FPGAs) are gaining considerable momentum in mainstream high-performance systems in recent years due to their flexibility and low power consumption. Still, FPGAs remain largely unavailable to software programmers due to programming and debugging difficulties that are inherent to standard Hardware Description Languages. The performance that hardware-oblivious software...
The cost of maintaining an application code would significantly increase if the application code is branched into multiple versions, each of which is optimized for a different architecture. In this work, default and vector versions of a realworld application code are refactored to be a single version, and the differences between the versions are expressed as userdefined code transformations. As a...
Ultrasound imaging is a real-time and high frame rate modality suitable for the analysis of tendon dynamics, e.g. for diagnosis of carpal tunnel syndrome. Tendon displacement quantification algorithms based on speckle tracking are sensitive to underestimation due to stationary clutter present in the tendon region. In this study we propose an improved speckle tracking method based on Singular Value...
A novel projection twin support vector machine (PTSVM), termed as NPTSVM, is presented in this paper for binary classification. Although this method determines two projection vectors using the same way as PTSVM, it has more advantages than existing PTSVMs. First, NPTSVM does not have to calculate inverse matrices during the learning process, which makes the training speed of NPTSVM be much faster...
Multipath is known to be one of the dominant error sources in high accuracy positioning systems, and multipath estimation is crucial for multipath mitigation. Most existing multipath estimation algorithms usually consider the cases of single mutlipath with Gaussian noise. However, non-Gaussian noises and two-multipath are often encountered in many practical environments. In this paper, a new algorithm...
Experiment-based black-box optimization is now under active research, because of its usefulness in a wide range of fields. Among related studies, some proposed an iterative algorithm for optimizing environmentally adaptive control policies using response surface method, which is expected to be useful for the applications in, e.g. mobile robots. A metric named unbiased expected improvement is the key...
This paper deals with the evaluation of FPGAs resurgence for hardware acceleration applied to computed tomography on the back-projection operator used in iterative reconstruction algorithms. We focus our attention on the tools developed by FPGAs manufacturers, in particular the Intel FPGA SDK for OpenCL, that promises a new level of hardware abstraction from the developer's perspective, allowing a...
In the paper we consider a problem of Hammerstein-Wiener (N-L-N) system identification in the presence of random input and random noise. The proposed strategy combines both parametric (e.g. least squares) and nonparametric (kernel) estimates (cf. [6]). First, the impulse response of the linear block, and the composition of two nonlinear characteristics are identified independently. Next, the nonlinear...
Due to their flexibility and high performance, Coarse Grained Reconfigurable Array (CGRA) are a topic of increasing research interest. However, CGRAs also have the potential to achieve very high energy efficiency in comparison to other reconfigurable architectures when hardware optimizations are applied. Some of these optimizations are common for more traditional processors but can also lead to large...
One of the key aspects in the successful use of kernel methods such as Support Vector Machines is the proper choice of the kernel function. While there are several well known kernel functions which can produce satisfactory results for various applications (e.g. RBF), they do not take into account specific characteristics of the data sets. Moreover, they have a set of parameters to be tuned. In this...
Kernel Samepage Merging (KSM) is a Linux kernel module for improving memory utilization by searching and merging the redundant memory pages. When working with the hypervisors, such as Kernel-based Virtual Machine, KSM helps share identical memory pages of the hosted virtual servers so as to increase the server density. Nevertheless, while KSM improves the efficiency of the host system, it hurts the...
Nowadays, Peer-to-Peer computing technology (P2P) is widely used on Internet, which has brought great challenges to effective management of the network. As a result, it is very important to recognize P2P applications as to maintain network. In essence, to identify traffic of P2P is a problem belongs to pattern recognition. As one of the optimal classifiers, support vector machine (SVM) has special...
Fuzzy clustering has emerged as an important tool for discovering the structure of data. Kernel based clustering has emerged as an interesting and quite visible alternative in fuzzy clustering. Aimed at the problems of both a local optimum and depending on initialization strongly in the fuzzy c-means clustering algorithm (FCM), a method of kernel-based fuzzy c-means clustering based on fruit fly algorithms...
Low-rank sparse tensor factorization is a populartool for analyzing multi-way data and is used in domainssuch as recommender systems, precision healthcare, and cybersecurity.Imposing constraints on a factorization, such asnon-negativity or sparsity, is a natural way of encoding priorknowledge of the multi-way data. While constrained factorizationsare useful for practitioners, they can greatly increasefactorization...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.