The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
A significant part of our knowledge is relationships between two terms. However, most of these information is documented as unstructured text in various forms, like books, online articles and webpages. Extract those information and store them in a structured database could help people utilize these information more conveniently. In this study, we proposed a novel approach to extract the relationships...
The paper exposes the behavior of the Decision Trees (DT) algorithms on a big database with many cases and many attributes: Forest Covertype (FC) from UCI Knowledge Discovery in Databases Archive. In classification experiments considered have been taken into account 22 splitting criteria and two pruning methods whose performances were presented in terms of classification error rate on test data, data...
Recruitment and selection of new employees rank to the important processes of human potential management and development. Especially the process of employee selection prepares proper conditions for a successful work performance and decides on a future progress-ability of the organizations. In a unique sector of private security, the precise realization of employee selection can solve one of the most...
The information systems are widely spread in most official institutions, and become certified in all areas of our life such as education, health and entertainment. Usability is one of the most important factors, which encourages users to deal with these systems or refuse it. Data mining is the process of finding correlations or patterns among dozens of fields in large relational databases. In this...
Oriented graphs belong to a part of Mathematics - Combinatorics called Graph Theory. One of the fundamental terms here is a tree. The tree structures have widespread use not only in Mathematics. They can be used in Decision Theory as data mining tools as well. In the present paper we point out to the use of decision trees as models for financial services, namely, by credit scoring, fraud and churn...
This paper focuses on use of Matlab for data mining. There is wide range of data mining software where free or cheaper solutions offer similar possibilities. We wanted to try Matlab for these purposes. Our data consists of parameters, which describes cloud usage at IT company that offers cloud services. We used phases from the CRISP-DM methodology in our work. We built clustering and classification...
The Consumer Financial Protection Bureau was established in USA for enabling the USA consumers to report customer support and complaint related information regarding their financial issues with the US government. The complaint data is freely available for analysis and tracking of how efficiently and effectively the financial institutes handle the complaints lodged against them. Each complaint consists...
Data mining is commonly used in the healthcare industry and managing Intensive Care Unit (ICU) is no exception. This study aims to examine how data mining techniques can be employed to predict mortality and length of stay in an ICU and to evaluate various classification techniques. Real-life healthcare datasets, like MIMIC 2, incorporate an unbalanced distribution of sample sizes, which means that...
Rapidly growing Internet Technology may cause cybercrimes committed by attackers. Different type of digital devices is being used to commit an attack. To detect such a criminal activity forensic investigator has to use various data recovery methods and practical framework. There are various type of forensic tool kit (FTK), freeware software's, techniques and tools are available for file forensic investigation...
Data mining is the procedure of breaking down data from unlike perspectives and resuming it into useful information. It is very important in the field of classification of the objects. It has been fruitfully applied in expert systems to get knowledge. We can determine appropriate classification of unknown objects according to decision tree rules by applying inductive methods to the given values of...
This paper examined the students' history of accessing the university Learning Management System (LMS) data. Classification techniques are used to build an educational model based on Knowledge Discovery in Databases (KDD) to predict learner's behavior. It identified the most valuable influencer for learning outcomes of the learners; it generated prediction models using the J48 decision tree algorithm...
In this paper, the brief survey of data mining classification by using the machine learning techniques is presented. The machine learning techniques like decision tree and support vector machine play the important role in all the applications of artificial intelligence. Decision tree works efficiently with discrete data and SVM is capable of building the nonlinear boundaries among the classes. Both...
Leptospirosis is a disease that affects mainly low-income populations, with an incidence of 500,000 cases per year worldwide[1]. The disease has symptoms often confused with other febrile syndromes, such as dengue, influenza and viral hepatitis. Improved diagnosis of patients with leptospirosis is very important for health professionals, epidemiological surveillance and primarily for rapid evaluation...
This study focuses on the mosquito borne diseases including dengue-1, dengue-4, yellow fever, west nile virus infection, and filariasis. These are the diseases that are typically shown in the African continent and Eastern Asia, which are the places that suffer from poverty the most. Vaccines for some of the diseases have already been made but the ones who inhabit in those areas do not have the ability...
Often, an evaluation of a classifier is performed without deeper analysis. In this paper we decided to perform more rigorous evaluation. We present an evaluation of various classifier methods over biomedical data with orientation towards nature inspired methods. We have performed an experimental assessment of various traditional and nature inspired methods (41 distinct classifiers) over the total...
Computers and internet connection are now common equipment in households. However, computer games have become an obstacle in education because pupils spend time needed for education and relax by playing games. One of the possible solutions of this problem is a creation of attractive education programs in the form of computer games. Such programs will be attractive for children and will introduce them...
Learning is an important activity for learners. Every learner must learn, but how to learn with the most effective outcome is still in question. A lot of theories about learning styles, for example, Kolb's Learning Styles, VARK Learning Styles and Index of Learning Styles (ILS) were created. This paper has adapted ILS with e-Learning method because e-Learning is an efficient technology that particularly...
World Wide Web is a huge data and information repository that contains the different formats and different aspects of data. User consumes the search engines to find the data according to their own aspects. But during search the search engines returns a significant amount of search results. Thus the results listing according to the user need is required. Therefore the search engines utilize the page...
Cardiovascular disease (CVD) is a big reason of morbidity and mortality in the current living style. Identification of Cardiovascular disease is an important but a complex task that needs to be performed very minutely, efficiently and the correct automation would be very desirable. Every human being can not be equally skillful and so as doctors. All doctors cannot be equally skilled in every sub specialty...
Dropout rates for students in correspondence and open courses are on increase. There is a need of analysis of factors causing increase in dropout rate. The discovery of hidden knowledge from the educational data system by the effective process of data mining technology to analyze factors affecting student drop out can lead to a better academic planning and management to reduce students drop out from...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.