Search results

Items from 1 to 20 out of 28 results

chapter

An Empirical Evaluation of Techniques for Feature Selection with Cost

Stephen Adams, Ryan Meekins, Peter A. Beling

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 834 - 841

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Feature selection is the process of selecting a subset of relevant features from the larger set of collected features. As the amount of available data grows with technology, feature selection becomes a more important part of the system-design process. In real-world applications, there are several costs associated with the collection, processing, and storage of data. Given that these costs can vary...

chapter

An approach for predicting employee churn by using data mining

Ibrahim Onuralp Yigit, Hamed Shourabizadeh

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) > 1 - 4

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)

Employee churn prediction which is closely related to customer churn prediction is a major issue of the companies. Despite the importance of the issue, there is few attention in the literature about. In this study, we applied well-known classification methods including, Decision Tree, Logistic Regression, SVM, KNN, Random Forest, and Naive Bayes methods on the HR data. Then, we analyze the results...

chapter

Stock price trend prediction using Artificial Neural Network techniques: Case study: Thailand stock exchange

Weerachart Lertyingyod, Nunnapus Benjamas

2016 International Computer Science and Engineering Conference (ICSEC) > 1 - 6

2016 International Computer Science and Engineering Conference (ICSEC)

This paper presents a predictive model which to predict the trends of stock prices using Data Mining techniques. This research will allow the investor to make a more informed decision to buy and sell stocks, and in the most appropriate period. The predictive concept in this work implies learning historical price patterns, indicators, and behavior; and then predicting the future trends in one, five,...

chapter

Use Educational Data Mining to Predict Undergraduate Retention

Steven Lehr, Hong Liu, Sean Kinglesmith, Alex Konyha, more

2016 IEEE 16th International Conference on Advanced Learning Technologies (ICALT) > 428 - 430

2016 IEEE 16th International Conference on Advanced Learning Technologies (ICALT)

This paper presents an application of educational data mining to predict undergraduate retention. The research provides valuable insight about data feature ranking, algorithm selections and validation methods based on unique types of data that come from educational settings. The data from a cohort of 972 students enrolled in 2008 at Embry-Riddle Aeronautical University (ERAU) were used to train and...

chapter

Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content

Saeed Raheel

2015 3rd International Conference on Artificial Intelligence, Modelling and Simulation (AIMS) > 113 - 116

2015 3rd International Conference on Artificial Intelligence, Modelling & Simulation (AIMS)

Feature selection is the process of choosing a subset of the available features or attributes from a certain dataset in order to render the process of building a predictive model more efficient and accurate. The selection of attributes is, in most of the times, done sequentially. In this paper we propose a new filtering strategy that selects the attributes in a composite way rather than sequential...

chapter

Fasting Blood Glucose Change Prediction Model Based on Medical Examination Data and Data Mining Techniques

Wenxiang Xao, Fengjing Shao, Jun Ji, Rencheng Sun, more

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity) > 742 - 747

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)

Fasting blood glucose (FBG) is an important indicator for human's health. Prediction for FBG is meaningful for finding and healing diseases, especially for diabetes mellitus. Based on four years' historical medical examination data, a prediction model of coming year's FBG is presented using traditional data mining techniques with a novel algorithm to estimate the FBG change probability and a proposed...

chapter

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Mamta Singh, Jyoti Singh, Arpana Rawal

2014 International Conference on Information Technology > 221 - 227

2014 International Conference on Information Technology (ICIT)

Since four decades, a sincere concern has aroused among managerial, professional, towards the satisfaction of teaching-learning objective in Academia. Huge span of time has already been spent revealing student's profile patterns using predictive modeling methods, however, very little effort is put up in identifying the causative features responsible for varied students' performances followed by decisive...

chapter

Exploration of robust features for multiclass emotion classification

Bincy Thomas, Dhanya K. A, Vinod P

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1704 - 1709

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Classification of emotion from sentences requires the classifier to be trained on relevant features. This paper focuses on different features (a) Bag-of-Words (b) Part-of-Speech tags (c) Sentence Length and (d) Lexical Emotion Features. Extensive evaluation on variable feature length for classifying textual emotions is carried out to understand their role in model performance. Experiments depict that...

chapter

Automated Configuration Bug Report Prediction Using Text Mining

Xin Xia, David Lo, Weiwei Qiu, Xingen Wang, more

2014 IEEE 38th Annual Computer Software and Applications Conference > 107 - 116

2014 IEEE 38th Annual Computer Software and Applications Conference (COMPSAC)

Configuration bugs are one of the dominant causes of software failures. Previous studies show that a configuration bug could cause huge financial losses in a software system. The importance of configuration bugs has attracted various research studies, e.g., To detect, diagnose, and fix configuration bugs. Given a bug report, an approach that can identify whether the bug is a configuration bug could...

chapter

Temporary Staffing Services: A Data Mining Perspective

Jeroen DHaen, Dirk Van Den Poel

2012 IEEE 12th International Conference on Data Mining Workshops > 287 - 292

2012 IEEE 12th International Conference on Data Mining Workshops

Research on the temporary staffing industry discusses different topics ranging from workplace safety to the internationalization of temporary labor. However, there is a lack of data mining studies concerning this topic. This paper meets this void and uses a financial dataset as input for the estimated models. Bagged decision trees were utilized to cope with the high dimensionality. Two bagged decision...

chapter

The effects of feature selection and model selection on the correctness of classification

Shu-chuan Lo

2010 IEEE International Conference on Industrial Engineering and Engineering Management > 989 - 993

2010 IEEE International Conference on Industrial Engineering & Engineering Management (IE&EM 2010)

In this research we took an experiment of two feature selection methods - eta square and stepwise methods on two classification models - back propagation neural network (BPNN) and general regression neural network (GRNN) to study the effects on the correctness of firm bankruptcy classification. The correctness includes the average classification correctness and the power of bankruptcy classification...

chapter

Anomaly Detection Using an Ensemble of Feature Models

K Noto, C Brodley, D Slonim

2010 IEEE International Conference on Data Mining > 953 - 958

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

We present a new approach to semi-supervised anomaly detection. Given a set of training examples believed to come from the same distribution or class, the task is to learn a model that will be able to distinguish examples in the future that do not belong to the same class. Traditional approaches typically compare the position of a new data point to the set of ``normal'' training data points in a chosen...

chapter

Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction

T M Khoshgoftaar, Kehan Gao, N Seliya

2010 22nd IEEE International Conference on Tools with Artificial Intelligence > 1 > 137 - 144

2010 22nd International Conference on Tools with Artificial Intelligence (ICTAI 2010)

The data mining and machine learning community is often faced with two key problems: working with imbalanced data and selecting the best features for machine learning. This paper presents a process involving a feature selection technique for selecting the important attributes and a data sampling technique for addressing class imbalance. The application domain of this study is software engineering,...

chapter

Forecasting the change of intraday stock price by using text mining news of stock

Shou-Hsiung Cheng

2010 International Conference on Machine Learning and Cybernetics > 5 > 2605 - 2609

2010 International Conference on Machine Learning and Cybernetics (ICMLC 2010)

This paper presents a method for forecasting the change of intraday stock price by utilizing text mining news of stock. This method is based on text mining techniques coupled with rough sets theories and support vector machine classifier. The method can handle without difficulty unstructured news of Taiwan stock market through preprocessing, feature selection and mark. The method also extracts the...

chapter

Features and Bayesian Network Model of Conceptual Change for INQPRO

Choo-Yee Ting, Kok-Chin Khor, Somnuk Phon-Amnuaisuk

2010 Second International Conference on Computer Engineering and Applications > 2 > 305 - 309

2010 Second International Conference on Computer Engineering and Applications (ICCEA 2010)

Predicting conceptual change in scientific inquiry learning environment is not trivial due to the challenges that stemmed when eliciting a student's implicit properties. The challenges could be more complicated when such learning environment employs exploratory learning approach. One plausible approach to tackle the challenges is by employing data mining approach. In this study, 129 interaction logs...

chapter

Task-driven data mining in the formation evaluation field

Xiongyan Li, Hongqi Li, He Xu, Zhou Jinyu, more

2010 6th International Conference on Advanced Information Management and Service (IMS) > 42 - 47

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

In the traditional data-driven data mining process, there are huge gaps between the efficient algorithms and intelligent tools as well as the invalidity of knowledge, which is obtained by traditional data-driven data mining. Meanwhile, each data in the earth science field contains a solid physical meaning. If there is no corresponding domain knowledge involved in the mining process, the information...

chapter

Feature Selection with Imbalanced Data for Software Defect Prediction

T.M. Khoshgoftaar, Kehan Gao

2009 International Conference on Machine Learning and Applications > 235 - 240

Eighth International Conference on Machine Learning and Applications (ICMLA 2009)

In this paper, we study the learning impact of data sampling followed by attribute selection on the classification models built with binary class imbalanced data within the scenario of software quality engineering. We use a wrapper-based attribute ranking technique to select a subset of attributes, and the random undersampling technique (RUS) on the majority class to alleviate the negative effects...

chapter

Intrusion detection using k-Nearest Neighbor

M. Govindarajan, R.M. Chandrasekaran

2009 First International Conference on Advanced Computing > 13 - 20

2009 First International Conference on Advanced Computing (ICAC 2009)

Data mining is the use of algorithms to extract the information and patterns derived by the knowledge discovery in databases process. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. In many data mining applications that address classification problems, feature and model selection...

chapter

High-Dimensional Software Engineering Data and Feature Selection

Huanjing Wang, T.M. Khoshgoftaar, Kehan Gao, N. Seliya

2009 21st IEEE International Conference on Tools with Artificial Intelligence > 83 - 90

2009 21st IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2009)

Software metrics collected during project development play a critical role in software quality assurance. A software practitioner is very keen on learning which software metrics to focus on for software quality prediction. While a concise set of software metrics is often desired, a typical project collects a very large number of metrics. Minimal attention has been devoted to finding the minimum set...

chapter

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Minho Kim, Youngim Jung, Hyuk-Chul Kwon

2009 21st IEEE International Conference on Tools with Artificial Intelligence > 323 - 327

2009 21st IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2009)

Prediction of the prosodic phrase boundary is a potent influence on the performance of speech recognition and voice synthesis systems. We propose a statistical approach using efficient learning features for the natural prediction of the Korean prosodic phrase boundary. These new features reflect factors that affect the generation of the prosodic phrase boundary better than existing learning features...

Data set:
ieee
Keywords:
DATA MINING
PREDICTIVE MODELS
FEATURE SELECTION

Publication date

Set your own date range

INFONA - science communication portal

Search results

An Empirical Evaluation of Techniques for Feature Selection with Cost

An approach for predicting employee churn by using data mining

Stock price trend prediction using Artificial Neural Network techniques: Case study: Thailand stock exchange

Use Educational Data Mining to Predict Undergraduate Retention

Dimensionality Reduction with a Composite-Selective Strategy in Documents with a Hybrid Content

Fasting Blood Glucose Change Prediction Model Based on Medical Examination Data and Data Mining Techniques

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Exploration of robust features for multiclass emotion classification

Automated Configuration Bug Report Prediction Using Text Mining

Temporary Staffing Services: A Data Mining Perspective

The effects of feature selection and model selection on the correctness of classification

Anomaly Detection Using an Ensemble of Feature Models

Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction

Forecasting the change of intraday stock price by using text mining news of stock

Features and Bayesian Network Model of Conceptual Change for INQPRO

Task-driven data mining in the formation evaluation field

Feature Selection with Imbalanced Data for Software Defect Prediction

Intrusion detection using k-Nearest Neighbor

High-Dimensional Software Engineering Data and Feature Selection

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options