Search results for: Taghi M. Khoshgoftaar

Items from 1 to 9 out of 9 results

chapter

Investigating Transfer Learners for Robustness to Domain Class Imbalance

Karl R. Weiss, Taghi M. Khoshgoftaar

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 207 - 213

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

A transfer learning environment is characterized by a machine learning algorithm being trained with data from one domain (the source domain) and being tested on data from a different domain (the target domain). In a transfer learning scenario, the class probability of the source domain may be different from the class probability of the target domain, which is referred to as "domain class imbalance"...

chapter

A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data

Jason Van Hulse, Taghi M. Khoshgoftaar, Amri Napolitano

2012 11th International Conference on Machine Learning and Applications > 2 > 551 - 557

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

Boosting methods have been successfully applied in a wide variety of machine learning applications. In the context of data quality issues, a number of variants of the standard boosting method have been proposed and evaluated. To address the problem of mislabeled examples, ORBoost was developed to prevent over fitting to noisy examples. Our research group has recently proposed RUSBoost as an enhancement...

chapter

A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction

Kehan Gao, Taghi M. Khoshgoftaar, Amri Napolitano

2012 11th International Conference on Machine Learning and Applications > 2 > 281 - 288

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

High dimensionality and class imbalance are the two main problems affecting many software defect prediction. In this paper, we propose a new technique, named SelectRUSBoost, which is a form of ensemble learning that in-corporates data sampling to alleviate class imbalance and feature selection to resolve high dimensionality. To evaluate the effectiveness of the new technique, we apply it to a group...

chapter

A novel dataset-similarity-aware approach for evaluating stability of software metric selection techniques

Huanjing Wang, Taghi M. Khoshgoftaar, Randall Wald, Amri Napolitano

2012 IEEE 13th International Conference on Information Reuse & Integration (IRI) > 1 - 8

2012 IEEE 13th International Conference on Information Reuse & Integration (IRI)

Software metric (feature) selection is an important pre-processing step before building software defect prediction models. Although much research has been done analyzing the classification performance of feature selection methods, fewer works have focused on their stability (robustness). Stability is important because feature selection methods which reliably produce the same results despite changes...

chapter

Measuring Stability of Threshold-Based Feature Selection Techniques

Huanjing Wang, Taghi M. Khoshgoftaar

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence > 986 - 993

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence (ICTAI)

Feature selection has been applied in many domains, such as text mining and software engineering. Ideally a feature selection technique should produce consistent outputs regardless of minor variations in the input data. Researchers have recently begun to examine the stability (robustness) of feature selection techniques. The stability of a feature selection method is defined as the degree of agreement...

chapter

Impact of Data Sampling on Stability of Feature Selection for Software Measurement Data

Kehan Gao, Taghi M. Khoshgoftaar, Amri Napolitano

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence > 1004 - 1011

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence (ICTAI)

Software defect prediction can be considered a binary classification problem. Generally, practitioners utilize historical software data, including metric and fault data collected during the software development process, to build a classification model and then employ this model to predict new program modules as either fault-prone (fp) or not-fault-prone (nfp). Limited project resources can then be...

chapter

Measuring robustness of Feature Selection techniques on software engineering datasets

Huanjing Wang, Taghi M. Khoshgoftaar, Randall Wald

2011 IEEE International Conference on Information Reuse & Integration > 309 - 314

2011 IEEE International Conference on Information Reuse & Integration (IRI)

Feature Selection is a process which identifies irrelevant and redundant features from a high-dimensional dataset (that is, a dataset with many features), and removes these before further analysis is performed. Recently, the robustness (e.g., stability) of feature selection techniques has been studied, to examine the sensitivity of these techniques to changes in their input data. In this study, we...

chapter

Predicting Faults in High Assurance Software

Naeem Seliya, Taghi M Khoshgoftaar, Jason Van Hulse

2010 IEEE 12th International Symposium on High Assurance Systems Engineering > 26 - 34

2010 IEEE 12th International Symposium on High-Assurance Systems Engineering (HASE)

Reducing the number of latent software defects is a development goal that is particularly applicable to high assurance software systems. For such systems, the software measurement and defect data is highly skewed toward the not-fault-prone program modules, i.e., the number of fault-prone modules is relatively very small. The skewed data problem, also known as class imbalance, poses a unique challenge...

chapter

A Comparative Study of Threshold-Based Feature Selection Techniques

Huanjing Wang, Taghi M Khoshgoftaar, Jason Van Hulse

2010 IEEE International Conference on Granular Computing > 499 - 504

2010 IEEE International Conference on Granular Computing (GrC-2010)

Given high-dimensional software measurement data, researchers and practitioners often use feature (metric) selection techniques to improve the performance of software quality classification models. This paper presents our newly proposed threshold-based feature selection techniques, comparing the performance of these techniques by building classification models using five commonly used classifiers...

Filter options

Keywords:
SOFTWARE

Publication date

Set your own date range

Keywords

SOFTWARE METRICS (6)
INDEXES (4)
STABILITY CRITERIA (4)
DATA MODELS (3)
PREDICTION ALGORITHMS (3)
TRAINING DATA (3)
ANALYSIS OF VARIANCE (2)
BOOSTING (2)
CLASS IMBALANCE (2)
CLASSIFICATION (2)
DATA SAMPLING (2)
DEFECT PREDICTION (2)
MEASUREMENT (2)
PREDICTIVE MODELS (2)
ROBUSTNESS (2)
SOFTWARE MEASUREMENT (2)
SOFTWARE QUALITY (2)
STABILITY (2)
SUPPORT VECTOR MACHINES (2)
TRAINING (2)
ALGORITHM DESIGN AND ANALYSIS (1)
BAGGING (1)
BAYES METHODS (1)
C4.5 DECISION TREE (1)
CLASS IMBALANCE PROBLEM (1)
CLASS NOISE (1)
CLASS-SKEWED DATA (1)
CLASSIFICATION ALGORITHM (1)
COMPUTATIONAL MODELING (1)
DECISION TREES (1)
DEFECT DATA (1)
DEFECT PREDICTION MODEL (1)
DOMAIN CLASS IMBALANCE (1)
ECLIPSE DATA SETS (1)
FAULT PREDICTION (1)
FAULT-PRONE PROGRAM MODULE (1)
FEATURE SELECTION (1)
HIGH ASSURANCE SOFTWARE SYSTEMS (1)
HIGH DIMENSIONALITY (1)
HIGH-DIMENSIONAL SOFTWARE MEASUREMENT DATA (1)
IMBALANCED DATA (1)
INTEGRATED CIRCUITS (1)
MACHINE LEARNING ALGORITHMS (1)
NAIVE BAYES (1)
NOISE (1)
NOISE LEVEL (1)
NOISE MEASUREMENT (1)
NOISE-RESISTANT BOOSTING ALGORITHMS (1)
NOT-FAULT-PRONE PROGRAM MODULES (1)
PARTITIONING ALGORITHMS (1)
PERFORMANCE METRICS (1)
RADIO FREQUENCY (1)
ROBUSTNESS OF FEATURE SELECTION (1)
ROUGHLY BALANCED BAGGING (1)
SOFTWARE DEFECT PREDICTION (1)
SOFTWARE DEFECTS (1)
SOFTWARE ENGINEERING DOMAIN (1)
SOFTWARE FAULT TOLERANCE (1)
SOFTWARE MEASUREMENTS (1)
SOFTWARE QUALITY CLASSIFICATION (1)
SOFTWARE QUALITY CLASSIFICATION MODELS (1)
SOFTWARE QUALITY ESTIMATION MODEL (1)
SOFTWARE QUALITY MODELS (1)
TESTING (1)
THRESHOLD-BASED FEATURE RANKING (1)
THRESHOLD-BASED FEATURE SELECTION TECHNIQUE (1)
THRESHOLD-BASED FEATURE SELECTION TECHNIQUES (1)
TRADITIONAL MACHINE LEARNING (1)
TRANSFER LEARNING (1)
more

INFONA - science communication portal

Search results for: Taghi M. Khoshgoftaar

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options