Taghi M. Khoshgoftaar

rozdział

Predicting sentinel node status in melanoma from a real-world EHR dataset

Aaron N. Richter, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1872 - 1878

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Melanoma is the fastest growing cancer worldwide, and 1 in 50 Americans will develop it in their lifetime. Sentinel lymph node (SLN) metastasis is one of the most important prognostic indicators for melanoma survival. We present several machine learning models for predicting SLN metastasis using data from a real-world dermatology electronic health record (EHR) system. The class label is the result...

rozdział

Detection of Phishing Webpages Using Heterogeneous Transfer Learning

Karl R. Weiss, Taghi M. Khoshgoftaar

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC) > 190 - 197

2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)

The detection of phishing websites using traditional machine learning methods has been demonstrated in previous studies. Traditional machine learning methods assume that the input feature space is the same between the training and testing data. There are scenarios in machine learning, where the available labeled training data has a different input feature space than the testing data. In cases where...

rozdział

Estimating Outlier Score Probabilities

Richard A. Bauder, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 559 - 568

2017 IEEE International Conference on Information Reuse and Integration (IRI)

Outlier detection is a critical function across a diverse range of tasks and domains. There are numerous outlier detection methods, the majority of which produce scores to indicate an outlier versus inlier. An issue with these scores is that they can be difficult to interpret and do not allow for comparisons between different methods. One solution is to convert the outlier score to probabilities....

rozdział

Using Weather and Playing Surface to Predict the Occurrence of Injury in Major League Soccer Games: A Case Study

Sara Landset, Michael F. Bergeron, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 366 - 371

2017 IEEE International Conference on Information Reuse and Integration (IRI)

Injuries in professional soccer games are very common and can greatly impact players, teams, and leagues. The ability to predict conditions under which injuries are likely to occur would help to mitigate competitive and financial losses. This paper presents a case study in which we look at injuries during 713 Major League Soccer games spanning the 2015 and 2016 seasons. Our dataset consists of 713...

rozdział

User Behavior Anomaly Detection for Application Layer DDoS Attacks

Maryam M. Najafabadi, Taghi M. Khoshgoftaar, Chad Calvert, Clifford Kemp

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 154 - 161

2017 IEEE International Conference on Information Reuse and Integration (IRI)

Distributed Denial of Service (DDoS) attacks are a popular and inexpensive form of cyber attacks. Application layer DDoS attacks utilize legitimate application layer requests to overwhelm a web server. These attacks are a major threat to Internet applications and web services. The main goal of these attacks is to make the services unavailable to legitimate users by overwhelming the resources on a...

rozdział

Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims

Matthew Herland, Richard A. Bauder, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 579 - 588

2017 IEEE International Conference on Information Reuse and Integration (IRI)

Fraud, waste, and abuse in medical insurance contributes to significant increases in costs for providers and patients. One way to reduce costs is through the detection of abnormal medical practices that could indicate possible fraud. In this paper, we expand upon our previous research into medical specialty anomaly detection by validating the efficacy of our model using real-world fraud cases, and...

rozdział

Modernizing Analytics for Melanoma with a Large-Scale Research Dataset

Aaron N. Richter, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 551 - 558

2017 IEEE International Conference on Information Reuse and Integration (IRI)

We present the Modernizing Analytics for MELanoma (MAMEL) dataset: a real-world, dermatologyspecific research dataset specifically crafted to advance data mining and machine learning research in the field of melanoma diagnosis, analysis, and treatment. This dataset was collected and curated from Modernizing Medicine’s EMA DermatologyTM application, a cloud-based Electronic Health Record (EHR) platform...

rozdział

Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures

Joseph D. Prusa, Ryan Sagul, Taghi M. Khoshgoftaar, Michael Sterling

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 42 - 48

2017 IEEE International Conference on Information Reuse and Integration (IRI)

We propose and demonstrate an approach for the often attempted problem of market prediction. We restrict our study to a widely purchased and well recognized commodity, crude oil, which experiences significant volatility. Robust debate exists over the applicability of the weak and semi-strong versions of the Efficient Market Hypothesis (EMH) to financial markets. In this paper we train nine learners...

rozdział

Analysis of Transfer Learning Performance Measures

Karl R. Weiss, Taghi M. Khoshgoftaar

2017 IEEE International Conference on Information Reuse and Integration (IRI) > 338 - 345

2017 IEEE International Conference on Information Reuse and Integration (IRI)

In machine learning applications, there are scenarios of having no labeled training data, due to the data being rare or too expensive to obtain. In these cases, it is desirable to use readily available labeled data, that is similar to, but not the same as, the domain application of interest. Transfer learning algorithms are used to build high-performance classifiers, when the training data has different...

rozdział

A Review of Performance Evaluation on 2D Face Databases

Gabriel Castaneda, Taghi M. Khoshgoftaar

2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService) > 218 - 223

2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService)

Face recognition methods are evaluated against face image databases. Recent face image databases provide an evaluation protocol for an impartial comparison and assessment of where a facial recognition algorithm stands compared to other methods. Unfortunately, many authors test their facial recognition methods using either restricted face databases, random subsets from public databases, or do not follow...

rozdział

A Probabilistic Programming Approach for Outlier Detection in Healthcare Claims

Richard A. Bauder, Taghi M. Khoshgoftaar

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 347 - 354

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Healthcare is an integral component in people's lives, especially for the rising elderly population. Medicare is one such healthcare program that provides for the needs of the elderly. It is imperative that these healthcare programs are affordable, but this is not always the case. Out of the many possible factors for the rising cost of healthcare, claims fraud is a major contributor, but its impact...

rozdział

An Investigation of Ensemble Techniques for Detection of Spam Reviews

Brian Heredia, Taghi M. Khoshgoftaar, Joseph Prusa, Michael Crawford

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 127 - 133

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Whether purchasing a product or searching for a new doctor, consumers often turn to online reviews for recommendations. Determining whether reviews are truthful is imperative to the consumer, as to not get misled by false recommendations. Unfortunately, it is often difficult, or impossible, for humans to ascertain the validity of a review through reading the text, however, studies have shown machine...

rozdział

Investigating Transfer Learners for Robustness to Domain Class Imbalance

Karl R. Weiss, Taghi M. Khoshgoftaar

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 207 - 213

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

A transfer learning environment is characterized by a machine learning algorithm being trained with data from one domain (the source domain) and being tested on data from a different domain (the target domain). In a transfer learning scenario, the class probability of the source domain may be different from the class probability of the target domain, which is referred to as "domain class imbalance"...

rozdział

An Investigation of Transfer Learning and Traditional Machine Learning Algorithms

Karl R. Weiss, Taghi M. Khoshgoftaar

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI) > 283 - 290

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)

Previous research focusing on the evaluation of transfer learning algorithms has predominantly used real-world datasets to measure an algorithm's performance. A test with a real-world dataset exposes an algorithm to a single instance of distribution difference between the training (source) and test (target) datasets. These previous works have not measured performance over a wide-range of source and...

rozdział

Integrating Multiple Data Sources to Enhance Sentiment Prediction

Brian Heredia, Taghi M. Khoshgoftaar, Joseph Prusa, Michael Crawford

2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC) > 285 - 291

2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC)

Understanding the sentiment conveyed by a person is an important part of any social interaction, and sentiment in text can provide valuable insight into an author's opinion. Sentiment analysis for text is a large field of research within machine learning, as it allows the sentiment of large numbers of text instances to be determined and used to answer various questions, such as election prediction...

rozdział

Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims

Richard A. Bauder, Taghi M. Khoshgoftaar, Aaron Richter, Matthew Herland

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI) > 784 - 790

2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)

The healthcare industry is a complex system with many moving parts. One issue in this field is the misuse of medical insurance systems, such as Medicare. In this paper, we build a machine learning model to detect when physicians exhibit anomalous behavior in their medical insurance claims. This new research has the potential to give some insight in determining if, and when, physicians are acting outside...

rozdział

A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper)

Richard A. Bauder, Taghi M. Khoshgoftaar

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI) > 11 - 19

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI)

Healthcare has and continues to be an integral component in people's lives, especially for the rising elderly population. One such healthcare program that provides for the needs of the elderly is Medicare. It is important that any such program be affordable but, unfortunately, this is not always the case. Out of the many possible factors for the rising cost of healthcare, fraud is a major contributor,...

rozdział

Designing a Better Data Representation for Deep Neural Networks and Text Classification

Joseph D. Prusa, Taghi M. Khoshgoftaar

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI) > 411 - 416

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI)

Traditional machine learning requires data to be described by attributes prior to applying a learning algorithm. In text classification tasks, many feature engineering methodologies have been proposed to extract meaningful features, however, no best practice approach has emerged. Traditional methods of feature engineering have inherent limitations due to loss of information and the limits of human...

rozdział

Predicting Cancer Relapse with Clinical Data: A Survey of Current Techniques

Aaron N. Richter, Taghi M. Khoshgoftaar

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI) > 369 - 376

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI)

While cancer treatments are constantly advancing, there is still a real risk of relapse after potentially curative treatments. At the risk of adverse side effects, certain adjuvant treatments can be given to patients that are at high risk of recurrence. The challenge, however, is in finding the best tradeoff between these two extremes. Patients that are given more potent treatments, such as chemotherapy,...

rozdział

Designing a Testing Framework for Transfer Learning Algorithms (Application Paper)

Karl R. Weiss, Taghi M. Khoshgoftaar, Oneeb Rehman

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI) > 152 - 159

2016 IEEE 17th International Conference on Information Reuse and Integration (IRI)

Most works covering the topic of transfer learning propose an algorithm to solve a given domain adaptation problem, then test the algorithm using real-world datasets. A test with a real-world dataset represents a single transfer learning test condition, which partially measures an algorithm's performance. Previous research has placed little emphasis on developing a comprehensive and uniform test for...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Taghi M. Khoshgoftaar

Predicting sentinel node status in melanoma from a real-world EHR dataset

Detection of Phishing Webpages Using Heterogeneous Transfer Learning

Estimating Outlier Score Probabilities

Using Weather and Playing Surface to Predict the Occurrence of Injury in Major League Soccer Games: A Case Study

User Behavior Anomaly Detection for Application Layer DDoS Attacks

Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims

Modernizing Analytics for Melanoma with a Large-Scale Research Dataset

Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures

Analysis of Transfer Learning Performance Measures

A Review of Performance Evaluation on 2D Face Databases

A Probabilistic Programming Approach for Outlier Detection in Healthcare Claims

An Investigation of Ensemble Techniques for Detection of Spam Reviews

Investigating Transfer Learners for Robustness to Domain Class Imbalance

An Investigation of Transfer Learning and Traditional Machine Learning Algorithms

Integrating Multiple Data Sources to Enhance Sentiment Prediction

Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims

A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper)

Designing a Better Data Representation for Deep Neural Networks and Text Classification

Predicting Cancer Relapse with Clinical Data: A Survey of Current Techniques

Designing a Testing Framework for Transfer Learning Algorithms (Application Paper)

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

Zbiór danych

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Taghi M. Khoshgoftaar

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zbiór danych

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu