The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The National Collision Database (NCDB) provided by Transport Canada contains information about individuals that have been in traffic accidents. A thorough analysis of this data was performed in order to isolate and identify possible dangerous traffic scenarios that could result in injuries or even death. The purpose of our study was to identify any occurring patterns of injuries or fatalities, and...
Clinical practice calls for reliable diagnosis and optimized treatment. However, human errors in health care remain a severe issue even in industrialized countries. The application of clinical decision support systems (CDSS) casts light on this problem. However, given the great improvement in CDSS over the past several years, challenges to their wide-scale application are still present, including:...
The aim of this article is to describe the design, implementation and evaluation of the educational application to support learning of data mining algorithms. The role of the application is to help students to better understand the algorithms such as Naive Bayes classifier, decision trees and association rules. The application also includes a test area that allows students to generate and solve different...
On an average day in the United States, thousands of commercial flights make their way through more than 5,000,000 square miles of U.S airspace. People rely on these flights for business and pleasure needs. Flying can be a very frustrating experience. Data mining past flight data can help consumers make informed decisions about many aspects of flying such as when the best days to fly are and out of...
Machine learning is a subdivision of Artificial Intelligence (AI) that is concerned with the design and development of intelligent algorithms that enables machines to learn from data without being programmed. Machine learning mainly focus on how to automatically recognize complex patterns among data and make intelligent decisions. In this paper, intelligent machine learning algorithms are used to...
The article describes the possibilities of parallelization algorithms of decision trees and their implementation in programming languages that use the features of the functional programming paradigm. They allow simplifying transformation of sequential algorithm to its parallel form. As example we consider the C4.5 algorithm and comparison of its implementation in Java 7 and Java 8 programming languages...
In this work a decision support system (DSS) for the conversion of Unified Parkinson's Disease Rating Scale (UPDRS) motor symptoms into a Hoehn & Yahr stage representation is proposed. Accurate estimation of a Parkinson's Disease patient's Hoehn & Yahr stage is of great importance since this single value is enough to represent condition, severity of symptoms and localization and disease progression...
This decision tree is normally applicable in data mining in order to produce a framework that predicts the value of object or its dependent variable, established on the various input or independent variable. CART algorithms are mainly used in Medical, Statistics etc. For heart disease patients it is complex for medical practitioners to predict the heart attack as it is a complex task that requires...
Breast cancer is one of the leading cause of death for women today and it is the most common cancer in developed countries. The cause and degree of the breast cancer are very much associated with the malfunctions of its tissues and cells. It is very hard and rigorous task for the doctors to observe the clinical records for many affected patients and regulate the therapy manually. Therefore, it is...
In software engineering, information retrieval which is also referred as data mining has attracted many researcher's attention. By the virtue of its definition, data mining is responsible for extracting relevant data from large volume of database or dataset. In this context, several techniques have been proposed in literature. Through this paper, an attempt to comparative analysis of various classification...
The Consumer Financial Protection Bureau was established in USA for enabling the USA consumers to report customer support and complaint related information regarding their financial issues with the US government. The complaint data is freely available for analysis and tracking of how efficiently and effectively the financial institutes handle the complaints lodged against them. Each complaint consists...
A general approach is proposed to determine the occupancy in a room using sensor data and knowledge coming respectively from observation and questioning are determined. Means to estimate occupancy include motion detections, power consumption and and acoustic pressure rewarded by a micro-phone. The proposed approach is inspired from machine learning. It starts by determining the most useful measurements...
Earthquake prediction has been long considered as impossible phenomenon but recent research studies show some progress in this field by considering it as a data mining problem. There are numerous challenges in earthquake prediction, which includes highly non-linear behavior of seismic activity and non-availability of reliable seismic precursors. This work focuses on earthquake prediction in Hindukush...
Breast cancer is a major threat for middle aged women throughout the world and currently this is the second most threatening cause of cancer death in women. But early detection and prevention can significantly reduce the chances of death. An important fact regarding breast cancer prognosis is to optimize the probability of cancer recurrence. This paper aims at finding breast cancer recurrence probability...
The recent computing trend is producing tons of data every minutes where the amount of imbalanced data is quite high as far as real life data sets are concerned. In practical aspects of data mining, the imbalanced data set is prone to misguide a data mining model. However, data set needs pre-processing before mining. This work focuses on some practical data mining techniques and produces a valid evaluation...
Data Preprocessing is an essential and primary step in the process of knowledge discovery; because the data obtained from the logs may be incomplete, noisy or inconsistent. The quality of the training data plays a vital role in the success of the data mining algorithms thus; Data Preprocessing should not be an exception in the process of knowledge discovery. The most promising attributes of the quality...
The Islamic State of Iraq and Syria (ISIS) is a extremist militant group in the Middle East known to employ social media for propaganda and recruiting purposes. In particular, the social media website Twitter is well known to be exploited by ISIS supporters. To this end, we devise an effective and scalable classification scheme to filter out ISIS propaganda accounts from the rest of the Twitter accounts...
Data Mining is an emerging field used in educational purposes to improve the perceptive and learning method of students. It focuses on recognizing, extracting and calculating data associated to the learning method and improving student's performance. Mining in a learning field is known as educational information mining which is fretful with exploring latest techniques to find out knowledge from educational...
A significant number of new algorithms constantly emerge ubiquitously as computer science and other computational related disciplines grow in advancement and complexity. A majority of these algorithms are developed by professional researchers who publish their algorithmic advancements in scholarly articles, especially in the form of pseudo-codes. The ability to automatically collect, manage, and index...
Understanding how to make education effective is a critical step in educational data mining. We have considered various socioeconomic, psychological and academic factors to fully understand what a person's life is during adolescence and how those factors impact their academic performance. Using pre-processing techniques such as feature selection, data balancing, discretization and normalization, and...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.