The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Nowadays, a hot challenge for supermarket chains is to offer personalized services to their customers. Market basket prediction, i.e., supplying the customer a shopping list for the next purchase according to her current needs, is one of these services. Current approaches are not capable of capturing at the same time the different factors influencing the customer's decision process: co-occurrence,...
Thanks to rapidly evolving sequencing techniques, the amount of genomic data at our disposal is growing increasingly large. Determining the gene structure is a fundamental requirement to effectively interpret gene function and regulation. An important part in that determination process is the identification of translation initiation sites. In this paper, we propose a novel approach for automatic prediction...
Given an heterogeneous social network, can we forecast its future? Can we predict who will start using a given hashtag on twitter? Can we leverage side information, such as who retweets or follows whom, to improve our membership forecasts? We present TensorCast, a novel method that forecasts time-evolving networks more accurately than current state of the art methods by incorporating multiple data...
Osteoarthritis (OA) is the most common type of arthritis. Analgesics are widely used in the process of the treatment of arthritis. Analgesics are particularly used by OA patients which may increase the risk of cardiovascular disease by 20% to 50% overall. In this study, we proposed an interpretable model to predict side effects of analgesics on cardiovascular disease for OA patients. One task of our...
Drug abuse and addiction is a growing epidemic at the forefront of public health. Within this remit, the illicit use of opioid analgesics alone has emerged as one of the fastest growing forms of drug abuse in the U.S. and the death rate from this epidemic are drawing comparison to the US AIDS epidemic. Traditional methods of epidemiology based on explicit reporting of indicator-based data from patient...
It is found that the GARCH (1,1) model has a good fitting effect on the time series by the statistical analysis of the logarithmic yield of the closing price of the Shanghai Composite Index. Therefore, this paper first uses the model GARCH (1,1) predicts the daily closing price of the Shanghai Composite Index, and then uses the Fourier series to correct the predicted residuals to obtain the final...
Networks are models representing relationships between entities. Often these relationships are explicitly given, or we must learn a representation which generalizes and predicts observed behavior in underlying individual data (e.g. attributes or labels). Whether given or inferred, choosing the best representation affects subsequent tasks and questions on the network. This work focuses on model selection...
Reliable uncertainty estimation for time series prediction is critical in many fields, including physics, biology, and manufacturing. At Uber, probabilistic time series forecasting is used for robust prediction of number of trips during special events, driver incentive allocation, as well as real-time anomaly detection across millions of metrics. Classical time series models are often used in conjunction...
Collecting labeling information of time-to-event analysis is naturally very time consuming, i.e., one has to wait for the occurrence of the event of interest, which may not always be observed for every instance. By taking advantage of censored instances, survival analysis methods internally consider more samples than standard regression methods, which partially alleviates this data insufficiency problem...
Ecological inference (EI) is a classical problem from political science to model voting behavior of individuals given only aggregate election results. Flaxman et al. recently formulated EI as machine learning problem using distribution regression, and applied it to analyze US presidential elections. However, distribution regression unnecessarily aggregates individual-level covariates available from...
Given a collection of basic customer demographics (e.g., age and gender) andtheir behavioral data (e.g., item purchase histories), how can we predictsensitive demographics (e.g., income and occupation) that not every customermakes available?This demographics prediction problem is modeled as a classification task inwhich a customer's sensitive demographic y is predicted from his featurevector x. So...
Precipitation prediction, such as short-term rainfall prediction, is a very important problem in the field of meteorological service. In practice, most of recent studies focus on leveraging radar data or satellite images to make predictions. However, there is another scenario where a set of weather features are collected by various sensors at multiple observation sites. The observations of a site...
Due to the recent vast availability of transportation traffic data, major research efforts have been devoted to traffic prediction, which is useful in many applications such as urban planning, traffic management and navigations systems. Current prediction methods that independently train a model per traffic sensor cannot accurately predict traffic in every situation (e.g., rush hours, constructions...
Hospital readmissions within 30 days after discharge are costly and it has been a prior for researchers to identify patients at risk of early readmission. Most of the reported hospital readmission prediction models have been built with historical data and thus can outdate over time. In this work, a self-adaptive 30-day diabetic hospital readmission prediction model has been developed. A diabetic inpatient...
Current Network Function Virtualization (NFV) with Virtualized Network Functions (VNFs) running as virtual machines on commodity servers enables flexibility to Service Function Chaining (SFC). Specific applications may require Quality of Service (QoS) on end-to-end latency. However, the processing delay and the queuing delay of VNFs varies with virtual resource configurations (vCPU and vMemory), as...
Modern drug discovery organizations generate large volumes of SAR data. A promising methodology that can be used to mine this chemical data to identify novel structure-activity relationships is the matched molecular pair (MMP) methodology. However, before the full potential of the MMP methodology can be utilized, a MMP identification method that is capable of identifying all MMPs in large chemical...
Solar irradiance prediction has a significant impact on various aspects of power system generation. The predictive models can be deployed to improve the planning and operation of renewable systems and can improve the power purchase process and bring several advantages to the power utilities. The irradiance is affected by several factors, such as clouds and dust, and it becomes challenging for physical...
A model control and a filter suppressing oscillation of the output voltage that can obtain the stable output voltage of a dc-dc converter have been proposed previous. Although there is a problem that the condition on the input side of dc-dc converter is changed when it connects the load equipment to renewable energy generators through the DC bus, since the renewable energy generators are sometimes...
Aerosol optical depth (AOD), one of the key factors affecting the atmosphere visibility, has great influence on the prediction of radiation intensity and photovoltaic power generation. Considering the problem that AOD is difficult to obtain real-timely and conveniently with high accuracy, in this paper, PM2.5 concentration, PM10 concentration and temperature, wind speed grade and relative humidity...
Model-based software estimation uses algorithms and past project data to make predictions for new projects. This paper presents a comparative assessment of four modeling approaches, including the original COCOMO, COCOMO calibration, k-Nearest Neighbors, and a combination of COCOMO calibration and k-Nearest Neighbors. Our results indicate that using kNN to select the nearest projects and calibrating...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.