The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Many fault-proneness prediction models have been proposed in literature to identify fault-prone code in software systems. Most of the approaches use fault data history and supervised learning algorithms to build these models. However, since fault data history is not always available, some approaches also suggest using semi-supervised or unsupervised fault-proneness prediction models. The HySOM model,...
Researchers often focus on the development process and the final product (source code) to investigate and predict software defects. Unfortunately, these models may not be applicable to software projects in which there is no access to the data sources regarding development process. For example, in cases when a company conducts tests on behalf of its business contractors, it is only possible to evaluate...
Because of the volatility of memory, nodes in in-memory storage system crashing down would lead to data lost. One solution to this problem is backing data up. However, if we backup data to a node which is about to fail down, the data should be recopied again. That would lead to a large amount of backup data, and in turn reduce the system reliability. We first establish a correlated failure model with...
The Arbitrary Lagrangian-Eulerian (ALE) method is used in a variety of engineering and scientific applications for enabling multi-physics simulations. Unfortunately, the ALE method can suffer from simulation failures that require users to adjust parameters iteratively in order to complete a simulation. In this paper, we present a supervised learning framework for predicting conditions leading to simulation...
Active Learning (AL) is a methodology from machine learning in which the learner interacts with the data source. In this paper, we investigate application of AL techniques to a new domain: regression problems in performance analysis. For computational systems with many factors, each of which can take on many levels, fixed experiment designs can require many experiments, and can explore the problem...
The pay-as-you-go pricing model and the illusion of unlimited resources in the Cloud initiate the idea to provision services elastically. Elastic provisioning of services allocates/de-allocates resources dynamically in response to the changes of the workload. It minimizes the service provisioning cost while maintaining the desired service level objectives (SLOs). Model-predictive control is often...
A considerable amount of energy efficient routing algorithms have been proposed to save energy and prolong network lifetime. Those algorithms mainly focus on forwarding packets along the minimum energy path to the sink to merely minimize energy consumption, which causes an unbalanced distribution of residual energy among sensor nodes, and eventually results in a network partition. In this paper, we...
Faulty modules of any software can be problematic in terms of accuracy, hence may encounter more costly redevelopment efforts in later phases. These problems could be addressed by incorporating the ability of accurate prediction of fault prone modules in the development process. Such ability of the software enables developers to reduce the faults in the whole life cycle of software development, at...
An algorithm for predicting the quality of video received by a client from a shared server is presented. A statistical model for this client-server system, in the presence of other clients, is proposed. Our contribution is that we explicitly account for the interfering clients, namely the load. Once the load on the system is understood, accurate client-server predictions are possible with an accuracy...
With an ever-increasing amount of information made available via the Internet, it is getting more and more difficult to find the relevant pieces of information. Recommender systems have thus become an essential part of information technology. Although a lot of research has been devoted to this area, the factors influencing the quality of recommendations are not completely understood. This paper examines...
Short-term traffic prediction provides tools for improved road management by allowing the reduction of delays, incidents and other unexpected events. Different real-time approaches provide traffic managers with varying but valuable information. This paper reviews the literature regarding model-driven and data-driven approaches focusing on short-term realtime traffic prediction. We start by analyzing...
Different ranking algorithms have been proposed to fulfil the need of ranking. The problem is that most of the existing algorithms and models are just applicable on a specific data. When the data is imbalanced and heterogeneous, finding the records belonging to the minority class is significant especially in failure cases. So considering ranking as a classification problem of predicting the specific...
When using electronic health record (EHR) data to build models for predicting adverse drug effects (ADEs), one is typically facing the problem of data sparsity, i.e., Drugs and diagnosis codes that could be used for predicting a certain ADE are absent for most observations. For such tasks, the ability to effectively handle sparsity by the employed machine learning technique is crucial. The state-of-the-art...
Managing change in the early stages of a software development life cycle is an effective strategy for developing a good quality software at low costs. In order to manage change, we use software quality models which can efficiently predict change prone classes and hence guide developers in appropriate distribution of limited resources. This study examines the effectiveness of ten machine learning algorithms...
In the era of big data, a large amount of noisy and incomplete data can be collected from multiple sources for prediction tasks. Combining multiple models or data sources helps to counteract the effects of low data quality and the bias of any single model or data source, and thus can improve the robustness and the performance of predictive models. Out of privacy, storage and bandwidth considerations,...
High dimensionality and class imbalance are the two main problems affecting many software defect prediction. In this paper, we propose a new technique, named SelectRUSBoost, which is a form of ensemble learning that in-corporates data sampling to alleviate class imbalance and feature selection to resolve high dimensionality. To evaluate the effectiveness of the new technique, we apply it to a group...
This paper proposes a selection scheme (S-scheme) between neural-network (NN) and multiple-regression (MR) outputs of a virtual metrology system (VMS). Both NN and MR are applicable algorithms for implementing VM conjecture models. But a MR algorithm may achieve better accuracy only with a stable process, whereas a NN algorithm may has superior accuracy when equipment property drift or shift occurs...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.