The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
DDoS attacks bring huge threaten to network, how to effectively detect DDoS is a hot topic of information security. Currently, there are some methods designed to detect DDoS attacks, but the detection rate of them is low. Moreover, DDoS detection is easily misled by flash crowd traffic. In this paper, a new method to detect DDoS attacks based on RDF-SVM algorithm is proposed. By considering the importance...
With the development of the aviation industry and the improvement of people's living standard, more and more people choose aircraft as their way of travel, but the airline adjusts the price according to the revenue management in real time. The purpose of this paper is to design different decision-making tools from the customer's perspective, and to provide customers with the relevant information needed...
In semi administered bunching is one of the vital errands and goes for gathering the information objects into classes (groups) to such an extent that the similitude of items inside bunches is high and the comparability of articles between bunches is Less. The dataset once in a while might be in blended nature that is it might comprise of both numeric and unmitigated sort of information. So two types...
This paper presents IC realization of a random forest (RF) machine learning classifier. Algorithm-architecture-circuit is co-optimized to minimize the energy-delay product (EDP). Deterministic subsampling (DSS) and balanced decision trees result in reduced interconnect complexity and avoid irregular memory accesses. Low-swing analog in-memory computations embedded in a standard 6T SRAM enable massively...
The ubiquitous growth of Internet of Things (IoT) and its medical applications has improved the effectiveness in remote health monitoring systems of elderly people or patients who need long-term personal care. Nowadays, chronic illnesses, such as, stroke, heart disease, diabetes, cancer, chronic respiratory diseases are major causes of death, in many parts of the world. In this paper, we propose a...
Due to the imbalanced distribution of business data, missing of user features and many other reasons, directly using big data techniques on realistic business data tends to deviate from the business goals. It is difficult to model the insurance business data by classification algorithms like Logistic Regression and SVM etc. This paper exploits a heuristic bootstrap sampling approach combined with...
Thanks to the ubiquitous computing, the scale of data collection in various fields has been growing rapidly. Medicine is one of the fields that can benefit from the big data. However, it is faced with a big challenge because medical datasets are normally in high dimensions. Therefore, reducing dimensionality and finding the optimal set of features or attributes is of great importance. This paper presents...
The basic idea behind the classifier ensembles is to use more than one classifier by expecting to improve the overall accuracy. It is known that the classifier ensembles boost the overall classification performance by depending on two factors namely, individual success of the base learners and diversity. One way of providing diversity is to use the same or different type of base learners. When the...
In the world today, the security of the computer system is of great importance, And in the last few years, there have seen an affected growth in the amount of intrusions that intrusion detection has become the dominant of current information security. Firewalls cannot provide complete protection. Applying on a firewall system alone is not enough to prevent a corporate network from all types of network...
The second largest cause of death in Palestine is Cancer at a rate 12.4% of all deaths. Predicting the survivability of a disease is one of the most interesting purposes of developing a medical data mining applications. This paper applies two classification models (Rule Induction and Random Forest) on the Gaza Strip 2011 cancer patient's dataset, to predict the survivability of cancer patients. The...
An ensemble consists of a set of individually trained classifiers (e.g., as neural networks or decision trees etc.) whose predictions are combined in some manner (e.g., averaging or voting etc.) to form the final prediction. In literature many previous study has shown that an ensemble is often more accurate than any of the single classifiers. Ensemble learning is primarily used to improve the (classification,...
For the modeling problem of microbial fermentation process, taking glutamic acid fermentation process as the research object, the decision tree and the random forest model were established by using the data mining method, and the model was evaluated and predicted by using the R language. Good effect of the decision tree model, indicating that the decision tree package of R language has a certain flexibility,...
Feature extracting and screening get more important and necessary because of data analysis will become very slow and difficult with the increasing of data dimension. To reduce the dimension of features, we propose a new way of feature screening in this paper. The improved clustering algorithm is employed to screen the features preliminarily, and then the genetic algorithm synergistically combined...
In this study, Sentinel-1A SAR imagery for land use/cover classification and its impacts on classification algorithms were addressed. Sentinel-1A imagery has dual polarization (VV and VH) and freely available from ESA. Istanbul was selected as the study region. After the pre-processing steps including the applying the precise orbit file, calibration, multilooking, speckle filtering and terrain correction,...
Titanic disaster occurred 100 years ago on April 15, 1912, killing about 1500 passengers and crew members. The fateful incident still compel the researchers and analysts to understand what can have led to the survival of some passengers and demise of the others. With the use of machine learning methods and a dataset consisting of 891 rows in the train set and 418 rows in the test set, the research...
With the increase in man to machine interaction, speech analysis has become an integral part in reducing the gap between physical and digital world. An important subfield within this domain is the recognition of emotion in speech signals, which was traditionally studied in linguistics and psychology. Speech emotion recognition is a field having diverse applications. The prime objective of this paper...
In order to improve the efficiency and adaptability of classical random forest algorithm in large data environment, an improved random forest algorithm based on Spark is proposed. Firstly, an improved random forest algorithm (FRF) based on the Fayyad boundary point principle is proposed to deal with the shortcomings of classical random forest algorithm in the process of discretization of continuous...
In recent years, type II diabetes has become a serious disease that threaten the health and mind of human. Efficient predictive modeling is required for medical researchers and practitioners. This study proposes a type II diabetes prediction model based on random forest which aims at analyzing some readily available indicators (age, weight, waist, hip, etc.) effects on diabetes and discovering some...
Due to the imbalanced distribution of business data, missing user features, and many other reasons, directly using big data techniques on realistic business data tends to deviate from the business goals. It is difficult to model the insurance business data by classification algorithms, such as logistic regression and support vector machine (SVM). In this paper, we exploit a heuristic bootstrap sampling...
This paper presents the performance of classification algorithms that have been popular in general also include Neural Network, Naïve Bayes and Decision Tree is working with real data in real work situations for selection algorithm that is suitable to develop an algorithm to create an effective data classification having more precise and can be explained the model from work to understand. The application...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.