Search results

Items from 1 to 20 out of 52 results

chapter

Predicting IT employability using data mining techniques

Keno C. Piad, Menchita Dumlao, Melvin A. Ballera, Shaneth C. Ambat

2016 Third International Conference on Digital Information Processing, Data Mining, and Wireless Communications (DIPDMWC) > 26 - 30

2016 Third International Conference on Digital Information Processing, Data Mining, and Wireless Communications (DIPDMWC)

Researchers in higher education are beginning to explore the potential of data mining in analyzing data for the purpose of giving quality service and needs of their graduates. Thus, educational data mining emerges as one tools to study academic data to identify patterns and help for decision making affecting the education. This paper predicts the employability of IT graduates using nine variables...

chapter

Investigation of effect of reducing dataset's size on classification algorithms

Neelam Singhal, Mohd. Ashraf

2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom) > 2036 - 2040

2015 2nd International Conference on "Computing for Sustainable Global Development" (INDIACom)

Data mining is now one of the most active field of research. Extracting those nuggets of information is becoming crucial and one of its important technique is classification. It helps to group the data in some predefined classes. Various techniques for classification exists which classifies the data using different algorithms. Each algorithm has its own area of best and worst performance. This paper...

chapter

Feature subset selection based on Filter technique

K. Fathima Bibi, M. Nazreen Banu

2015 International Conference on Computing and Communications Technologies (ICCCT) > 1 - 6

2015 International Conference on Computing and Communications Technologies (ICCCT)

From a large amount of data, significant knowledge is discovered by means of applying techniques in the knowledge management process and those techniques is known as Data mining techniques. For a specific domain, a form of knowledge discovery called data mining is necessary for solving the problems. The classes of unknown data are detected by the technique called classification. Neural networks, rule...

chapter

Comparison of classification techniques for predicting the performance of students academic environment

M. Mayilvaganan, D. Kalpanadevi

2014 International Conference on Communication and Network Technologies > 113 - 118

2014 International Conference on Communication and Network Technologies (ICCNT)

The aim of this study is to compares some classification techniques used to predict the performance of student. It is helps to analyse the slow leaner in the semester exams that are likely study in poor which are used to improve their skill as early to achieve the goal in end semester. The task can be processed based on the several attributes to predict the performance of the student activity respectively...

chapter

Data mining approaches to predict final grade by overcoming class imbalance problem

Raisul Islam Rashu, Naheena Haq, Rashedur M Rahman

2014 17th International Conference on Computer and Information Technology (ICCIT) > 14 - 19

2014 17th International Conference on Computer and Information Technology (ICCIT)

Data mining approaches have been used in business purposes since its inception; however, at present it is used successfully in new and emerging areas like education systems. Government of Bangladesh emphasizes the need to improve the education system. In this research, we use data mining approaches to predict students' final outcome, i.e., final grade in a particular course by overcoming the problem...

chapter

Violations detection of listed companies based on decision tree and K-nearest neighbor

Zhang Yu, Yu Guang, Jin Zi-qi

2013 International Conference on Management Science and Engineering 20th Annual Conference Proceedings > 1671 - 1676

2013 International Conference on Management Science and Engineering (ICMSE)

Violations of listed companies to disclose accounting information will mislead the ordinary investors seriously and bring huge losses to investors. Therefore, it is particularly necessary to analyze and identify the violations of listed companies based on scientific and effective methods to avoid investment risks in advance. In this paper, we firstly use t-statistic to select eight useful and characteristic...

chapter

Study on inferring interwell connectivity of injection-production system based on decision tree

Maojun Cao, Fuhua Shang

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 1010 - 1014

2013 10th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Interwell connectivity of injection-production system is a kind of important information of reservoir performance analysis. It is largely significant for researching the distribution of remaining-oil and adjusting the oilfield development plan. In order to change the status quo of inferring interwell connectivity in NO.1 oil production plant of Daqing Oilfield, an automatic identification method based...

chapter

Comparison of data mining classification algorithms for breast cancer prediction

Chintan Shah, Anjali G. Jivani

2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT) > 1 - 4

2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT)

Data mining is an area of computer science with a huge prospective, which is the process of discovering or extracting information from large database or datasets. There are many different areas under Data Mining and one of them is Classification or the supervised learning. Classification also can be implemented through a number of different approaches or algorithms. We have conducted the comparison...

chapter

A Post-Pruning Decision Tree Algorithm Based on Bayesian

Wenchao Zhang, Yafen Li

2013 International Conference on Computational and Information Sciences > 988 - 991

2013 Fifth International Conference on Computational and Information Sciences (ICCIS)

The C4.5 Algorithm can result in a thriving decision tree and will overfit the training data while training the model. In order to overcome those disadvantages, this paper proposed a post-pruning decision tree algorithm based on Bayesian theory, in which each branch of the decision tree generated by the C4.5 algorithm is validated by Bayesian theorem, and then those branches that do not meet the conditions...

chapter

Fast Time Series Classification Based on Infrequent Shapelets

Qing He, Zhidong, Fuzhen Zhuang, Tianfeng Shang, more

2012 11th International Conference on Machine Learning and Applications > 1 > 215 - 219

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

Time series shapelets are small and local time series subsequences which are in some sense maximally representative of a class. E.Keogh uses distance of the shapelet to classify objects. Even though shapelet classification can be interpretable and more accurate than many state-of-the-art classifiers, there is one main limitation of shapelets, i.e. shapelet classification training process is offline,...

chapter

Performance Analysis between Different Decision Trees for Uncertain Data

Xiaoming Peng, Haoran Guo, Jianmin Pang

2012 International Conference on Computer Science and Service System > 574 - 577

2012 International Conference on Computer Science and Service System (CSSS)

In order to compare the classification accuracies and performance differences between traditional and probability-based decision tree classifiers, and come to understand those algorithms, which aim to improve construction efficiency of probability-based decision trees, mentioned in "Decisions Trees for Uncertain Data", this paper tested several algorithms, named AVG, UDT, UDT-BP, UDT-LP,...

chapter

A decision tree generation algorithm based on maximum similarity

Xinmeng Zhang, Shengyi Jiang

2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC) > 1032 - 1035

2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC)

Node splitting is good or bad depends on the measure method of the impurity. We propose a new decision tree feature selection strategy based on maximum similarity, called fsms. First, splitting the dataset into subset according to each attribute value, calculating the sum of average similarity of each subset, then selecting the attribute with the maximum similarity as the best splitting attribute...

chapter

Predictive models for dengue outbreak using multiple rulebase classifiers

Azuraliza Abu Bakar, Zuriyah Kefli, Salwani Abdullah, Mazrura Sahani

Proceedings of the 2011 International Conference on Electrical Engineering and Informatics > 1 - 6

2011 International Conference on Electrical Engineering and Informatics (ICEEI)

The paper aims to develop the predictive models for dengue outbreak detection using Multiple Rule Based Classifiers. The rule based classifiers used are the Decision Tree, Rough Set Classifier, Naive Bayes, and Associative Classifier. Dengue fever (DF) and dengue hemorrhagic fever (DHF) have been continuously becoming a public health related issues in Malaysia and growing pandemic as reported by World...

article

Cycle-Time Key Factor Identification and Prediction in Semiconductor Manufacturing Using Machine Learning and Data Mining

Yair Meidan, Boaz Lerner, Gad Rabinowitz, Michael Hassoun

IEEE Transactions on Semiconductor Manufacturing > 2011 > 24 > 2 > 237 - 248

Within the complex and competitive semiconductor manufacturing industry, lot cycle time (CT) remains one of the key performance indicators. Its reduction is of strategic importance as it contributes to cost decreasing, time-to-market shortening, faster fault detection, achieving throughput targets, and improving production-resource scheduling. To reduce CT, we suggest and investigate a data-driven...

chapter

Diversity of feature selection approaches combined with distinct classifiers

Li Feng-Chia, Wang Peng-Kai, Yeh Li-Lon

2010 IEEE International Conference on Industrial Engineering and Engineering Management > 28 - 32

2010 IEEE International Conference on Industrial Engineering & Engineering Management (IE&EM 2010)

The credit scoring has been regarded as a critical topic and its related departments make efforts to collect huge amount of data to avoid wrong decision. An effective classificatory model will objectively help managers instead of intuitive experience. This study proposes five approaches combining with the back-propagation neural network (BPN) classifier for features selection that retains sufficient...

chapter

Text Classification Techniques Used to Faciliate Cyber Terrorism Investigation

D A Simanjuntak, H P Ipung, Charles Lim, A S Nugroho

2010 Second International Conference on Advances in Computing, Control, and Telecommunication Technologies > 198 - 200

2010 Second International Conference on Advances in Computing, Control and Telecommunication Technologies (ACT 2010)

Rising of computer violence, such as Distributed Denial of Service (DDoS), web vandalism, and cyber bullying are becoming more serious issues when they are politically motivated and intentionally conducted to generate fear in society. These kinds of activity are categorized as cyber terrorism. As the number of such cases increase, the availability of information regarding these actions is required...

chapter

Status detection and fault diagnosing of rotatory machinery by vibration analysis using data mining

A Pourebrahimi, S Mokhtar, S Sahami, M Mahmoodi

2010 2nd International Conference on Computer Technology and Development > 131 - 135

2nd International Conference on Computer Technology and Development (ICCTD 2010)

The data generated within the construction industry has become increasingly overwhelming. Data mining technology presents an opportunity to increase significantly the rate at which the volumes of data generated through the maintenance process can be turned into useful information. This can be done using classification algorithms to discover patterns and correlations within a large volume of data....

chapter

A comparative analysis of mining techniques for automatic detection of student's learning style

Nor Bahiah Hj Ahmad, Siti Mariyam Shamsuddin

2010 10th International Conference on Intelligent Systems Design and Applications > 877 - 882

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This paper compares performance of several classifiers provided in WEKA such as Bayes, decision tree and classification rules in classifying student's learning style. The student's preferences and behavior while using e-learning system have been observed and analyzed and twenty attributes have been selected to map into Felder Silverman learning style model. There are four learning dimensions in Felder...

chapter

An objective method to find better RBF networks in classification

Hyontai Sug

5th International Conference on Computer Sciences and Convergence Information Technology > 373 - 376

2010 5th International Conference on Computer Sciences and Convergence Information Technology (ICCIT 2010)

RBF networks are good at prediction tasks of data mining, and k-means clustering algorithm is one of the mostly used clustering algorithms for basis functions of RBF networks. K-means clustering algorithm needs the number of clusters for initialization, and depending on the number of clusters, the accuracy of RBF networks change. But we cannot resort to increasing the number of clusters in the RBF...

chapter

A Comparative Study on Data Mining Algorithms for Individual Credit Risk Evaluation

Hong Yu, Xiaolei Huang, Xiaorong Hu, Hengwen Cai

2010 International Conference on Management of e-Commerce and e-Government > 35 - 38

2010 Fourth International Conference on Management of E-Commerce and E-Government (ICMeCG 2010)

Individual credit risk evaluation is an important and challenging data mining problem in financial analysis domain. This paper compares the effectiveness of four data mining algorithms - logistic regression (LR), decision tree (C4.5), support vector machine (SVM) and neural networks (NN) by applying them to two credit data sets. Experiment results show that the LR and SVM algorithms produced the best...

Data set:
ieee
Keywords:
DATA MINING
ACCURACY
DECISION TREE
CLASSIFICATION ALGORITHMS

Publication date

Set your own date range

Publication type

book (50)
article (2)

Keywords

DECISION TREES (50)
CLASSIFICATION (14)
PATTERN CLASSIFICATION (13)
DATA MODELS (12)
TRAINING (12)
CLASSIFICATION TREE ANALYSIS (11)
LEARNING (ARTIFICIAL INTELLIGENCE) (9)
MACHINE LEARNING (9)
SUPPORT VECTOR MACHINES (9)
PREDICTION ALGORITHMS (8)
ALGORITHM DESIGN AND ANALYSIS (7)
BAYES METHODS (7)
FEATURE SELECTION (7)
SUPPORT VECTOR MACHINE (7)
ARTIFICIAL NEURAL NETWORKS (6)
NEURAL NETS (6)
NAIVE BAYES (5)
NEURAL NETWORK (5)
FEATURE EXTRACTION (4)
GENETIC ALGORITHM (4)
PREDICTION (4)
REGRESSION ANALYSIS (4)
ROUGH SET (4)
ROUGH SET THEORY (4)
STATISTICAL ANALYSIS (4)
TESTING (4)
ARTIFICIAL NEURAL NETWORK (3)
CLUSTERING ALGORITHMS (3)
COMPUTATIONAL MODELING (3)
COMPUTER AIDED INSTRUCTION (3)
CONSTRUCTION INDUSTRY (3)
F-SCORE (3)
GENETIC ALGORITHMS (3)
GEOPHYSICS COMPUTING (3)
K-NEAREST NEIGHBOR (3)
LINEAR DISCRIMINATE ANALYSIS (3)
MEDICAL DIAGNOSTIC IMAGING (3)
PREDICTIVE MODELS (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
TRAINING DATA (3)
BAYESIAN METHODS (2)
BELIEF NETWORKS (2)
BREAST CANCER (2)
CLASSIFICATION ACCURACY (2)
CLASSIFICATION RULES (2)
CREDIT SCORING (2)
DATA ANALYSIS (2)
DATA CLASSIFICATION ALGORITHMS (2)
DATA HANDLING (2)
DATA MINING METHOD (2)
DECISION SUPPORT SYSTEMS (2)
EDUCATIONAL INSTITUTIONS (2)
ENTROPY (2)
EXPERT SYSTEMS (2)
FEATURE SELECTION APPROACHES (2)
FINANCIAL DATA PROCESSING (2)
FUZZY SET THEORY (2)
HYPERSPECTRAL IMAGING (2)
IMAGE CLASSIFICATION (2)
INTERNET (2)
K-MEANS CLUSTERING ALGORITHM (2)
KNOWLEDGE DISCOVERY (2)
LOGISTIC REGRESSION (2)
MINERALS (2)
NAïVE BAYES (2)
NEURAL NETWORKS (2)
PATTERN CLUSTERING (2)
PERFORMANCE ANALYSIS (2)
QUEST (2)
REMOTE SENSING (2)
RESERVOIRS (2)
SIMULATED ANNEALING (2)
STATISTICAL LDA (2)
SVM (2)
TIME SERIES (2)
TIME SERIES ANALYSIS (2)
TRADITIONAL CHINESE MEDICINE (2)
TRAFFIC ENGINEERING COMPUTING (2)
UNCERTAINTY (2)
VEGETATION (2)
WEKA (2)
WEKA TOOL (2)
ABSTRACTING (1)
ABSTRACTION (1)
ACCURACY RATE (1)
ADAM-IVICS (1)
ADAPTATION MODEL (1)
ADAPTIVE ENGLISH LEARNING SYSTEM (1)
AEROLOGY EVENT PREDICTION (1)
AEROSOLS (1)
AGGREGATION (1)
AIRBORNE MINERAL DUST (1)
ANALYTICS (1)
ANN (1)
APPROXIMATION METHODS (1)
APRIORI (1)
more

INFONA - science communication portal

Search results

Predicting IT employability using data mining techniques

Investigation of effect of reducing dataset's size on classification algorithms

Feature subset selection based on Filter technique

Comparison of classification techniques for predicting the performance of students academic environment

Data mining approaches to predict final grade by overcoming class imbalance problem

Violations detection of listed companies based on decision tree and K-nearest neighbor

Study on inferring interwell connectivity of injection-production system based on decision tree

Comparison of data mining classification algorithms for breast cancer prediction

A Post-Pruning Decision Tree Algorithm Based on Bayesian

Fast Time Series Classification Based on Infrequent Shapelets

Performance Analysis between Different Decision Trees for Uncertain Data

A decision tree generation algorithm based on maximum similarity

Predictive models for dengue outbreak using multiple rulebase classifiers

Cycle-Time Key Factor Identification and Prediction in Semiconductor Manufacturing Using Machine Learning and Data Mining

Diversity of feature selection approaches combined with distinct classifiers

Text Classification Techniques Used to Faciliate Cyber Terrorism Investigation

Status detection and fault diagnosing of rotatory machinery by vibration analysis using data mining

A comparative analysis of mining techniques for automatic detection of student's learning style

An objective method to find better RBF networks in classification

A Comparative Study on Data Mining Algorithms for Individual Credit Risk Evaluation

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options