The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
To try to decrease the preference of the attribute values for information gain and information gain ratio, in the paper, the authors puts forward a improved algorithm of C4.5 decision tree on the selection classification attribute. The basic thought of the algorithm is as follows: Firstly, computing the information gain of selection classification attribute, and then get an attribute of the information...
A standard procedure for evaluating the performance of classification algorithms is k-fold cross validation. Since the training sets for any pair of iterations in k-fold cross validation are overlapping when the number of folds is larger than two, the resulting accuracy estimates are considered to be dependent. In this paper, the overlapping of training sets is shown to be irrelevant in determining...
This paper aims to develop a framework for vehicle type classification using convolutional neural network based on vehicle rear view images. Compared with the extraction of the appearance features from vehicle side view and frontal view images, there has been relatively little research on vehicle type classification by using vehicle rear view images' information. The vehicle rear view images are detected...
By analyzing the disadvantages of the traditional KNN using lazy learning that directly classify the data based on the K neighboring classes using the majority voting method, a new Sigmoid weighted classification algorithm WKS (Weighted KNN Based On Sigmoid) was proposed. WKS provides a new method for learning and training, since each training data di ∊ D contributes to the correct classification...
Many early stage lung cancer patients have resectable tumors, however, their cardiopulmonary function needs to be properly evaluated before they are deemed operative candidates. Such patients are typically asked to undergo standard pulmonary function tests, including cardiopulmonary exercise tests (CPET) or stair climbs. The standard tests are conducted only at selected healthcare provider locations,...
In semi administered bunching is one of the vital errands and goes for gathering the information objects into classes (groups) to such an extent that the similitude of items inside bunches is high and the comparability of articles between bunches is Less. The dataset once in a while might be in blended nature that is it might comprise of both numeric and unmitigated sort of information. So two types...
Link prediction plays an important role in complex network analysis. It is to predict the existence of an unknown link or a future link in a network. Classical methods for link prediction evaluate the similarity of vertices based on common neighbors, and denote that every common neighbor makes equal contribution to the connection likelihood. However, common neighbors may play different roles depending...
The interoperability testing of CTCS-3 Level Train Control System guarantees the safe operation of train running on different lines. It makes great sense to achieve automatic analysis of interoperability testing results, which could improve the efficiency and accuracy of testing. In this paper, a research was conducted on automatic analysis of testing results for on-board equipment of train control...
Roller element bearing fault diagnosis is crucial in industry to maintain that the machine is in good condition so that there is no delay of work due to machine breakdown. This paper discusses the use of Extreme Learning Machine (ELM) algorithm to classify bearing faults. The performance of ELM is compared with Back Propagation (BP) algorithm. It was found that the results show that the ELM has smaller...
In this work, we present a performance comparison of the Multi Layer Perceptron (MLP), Support Vector Machines (SVM) and Voted Perceptron (VP) when applied to a social signal processing task. The signal processing task is in the field of computational politics where the aim is to predict the political parties of American congress members based on their response to certain questions. Using this dataset...
The aim of this paper is to investigate the use of oculography signals for the recognition of experts in visual arts. We focused our attention on the number of sight transitions between characteristic image areas (ROIs). In the experiments we used oculographic data recorded at the Department of Experimental Psychology at the Catholic University of Lublin for 29 images and 34 users. The EM method was...
Recently, multi-label classification has gained prime importance among the classification problems. The applications of classification problems has increased so rapidly that the need for efficient and accurate classifiers has become a vital requirement in the area of data mining. Multi-label classification problem is distinguished from the single label classification because of the capability to handle...
Graduate employability is an increasingly major concern for academic institutions and assessing student employability provides a way of linking student skills and employer business requirements. Enhancing student assessment methods for employability can improve their understanding about companies in order to get suitable company for them. So, enhanced employability prediction of student can help them...
In the Linked Data context, identity link is one of the most important semantic links that can be established between the datasets. It specifies that different identifiers refer to the same real world object and therefore must be linked. The process of detecting these identical instances across different data repositories is referred as instance matching. This is used to connect existing data sources...
The success of machine learning (ML) algorithms depends on the quality of data given to them. If the input data contains insufficient or irrelevant features, the accuracy of machine learning algorithm decreases. Attribute selection has a key role in creation of classification models. Based on the ‘logic behind the inference’ principle in the Nyaya school of thought, this paper proposes a new method...
Hadith is the second source of Islamic law after Qur'an and an explanation of verses of the Qur'an. Today, there are many hadiths that appear and that are doubtful of its authenticity. The number of hadits that are doubtful of its authenticity or so-called dhaif and maudhu hadith can lead to errors in the determination of Islamic law for everyday life. The classification of hadith is required to know...
Case-Based Reasoning also known as CBR model has been widely used to solve the problem in various cases. This study aims to explain the implementation of K-Nearest Neighbor Algorithm in Case-Based Reasoning model. The research showed that KNN algorithm is suitable to be used in CBR model. The results of this study are to measure the accuracy level of automatic answer identity formation and search...
Credit scoring is explored to assess default risk of consumer behaviors for financial institutions, banks in particular. The advanced Bayesian algorithm is proposed for credit assessment. The new trial ensembles logistic regression analysis (LRA), cluster and MLP-NN in Bayesian approach as an advanced classifier. The investigation contain evidence that Bayesian ensemble technique optimizes LRA, cluster...
Image annotation is always an easy task for humans but a tough task for machines. Inspired by human's thinking mode, there is an assumption that the computer has double systems. Each of the systems can handle the task individually and in parallel. In this paper, we introduce a new hierarchical model for image annotation, based on constructing a novel, hierarchical tree, which consists of exploring...
Tuberculosis is one of the top ten causes of death worldwide. Although this disease is curable and preventable, yet many new tuberculosis cases still occur especially in developing countries. Many low-income families cannot afford the medical diagnosis for tuberculosis. Therefore, this paper proposes an initial screening for tuberculosis infection using a data mining approach. In this paper, the initial...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.