Search results

chapter

Predicting Faults in High Assurance Software

Naeem Seliya, Taghi M Khoshgoftaar, Jason Van Hulse

2010 IEEE 12th International Symposium on High Assurance Systems Engineering > 26 - 34

2010 IEEE 12th International Symposium on High-Assurance Systems Engineering (HASE)

Reducing the number of latent software defects is a development goal that is particularly applicable to high assurance software systems. For such systems, the software measurement and defect data is highly skewed toward the not-fault-prone program modules, i.e., the number of fault-prone modules is relatively very small. The skewed data problem, also known as class imbalance, poses a unique challenge...

chapter

The problem of classification in imbalanced data sets in knowledge discovery

Haifeng Sui, Bingru Yang, Yun Zhai, Wu Qu, more

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 9 > V9-658 - V9-661

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

It has been observed that classification in imbalanced data sets have drawn more attention to researchers in knowledge discovery and data mining fields. In such problems, almost all the samples are labeled as one class, while far fewer samples are labeled as the other class, which are usually more important. But traditional classifiers that try to pursue whole accurate performance over a full range...

chapter

Cascade generalization: Is SVMs' inductive bias useful?

Nahla Barakat

2010 IEEE International Conference on Systems, Man and Cybernetics > 1393 - 1399

2010 IEEE International Conference on Systems, Man and Cybernetics (SMC 2010)

The problem of choosing the best classification algorithm for a specific problem domain has been extensively researched. This issue was also the main motivation behind the ever increasing interest in ensemble methods since 1992. In this paper, we propose a new method for classifiers' fusion, which integrates cascade generalization and voting techniques. The proposed method utilizes two learning algorithms...

chapter

Instance-based ensemble learning algorithm with stacking framework

Haleh Homayouni, Sattar Hashemi, Ali Hamzeh

2010 2nd International Conference on Software Technology and Engineering > 2 > V2-164 - V2-169

2010 2nd International Conference on Software Technology and Engineering (ICSTE 2010)

Nowadays the most active research in supervised learning includes an integration of several base classifiers into the combined classification system. Such systems are known under the names multiple classifiers, ensembles methods. This topic attracts an interest of machine learning researchers as multiple classifiers are often much more accurate than the component classifiers that make them up. In...

chapter

Predicting player behavior in Tomb Raider: Underworld

Tobias Mahlmann, Anders Drachen, Julian Togelius, Alessandro Canossa, more

Proceedings of the 2010 IEEE Conference on Computational Intelligence and Games > 178 - 185

2010 IEEE Information Theory Workshop (ITW 2010)

This paper presents the results of an explorative study on predicting aspects of playing behavior for the major commercial title Tomb Raider: Underworld (TRU). Various supervised learning algorithms are trained on a large-scale set of in-game player behavior data, to predict when a player will stop playing the TRU game and, if the player completes the game, how long will it take to do so. Results...

chapter

Optimized fuzzy information granulation based machine learning classification

Yang Li, Fusheng Yu

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 1 > 259 - 263

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In machine learning classification, the classifier can be described by some rules, and the rules can be expressed by fuzzy granules corresponding to fuzzy concepts. In this paper we will introduce fuzzy information granulation to the process of building fuzzy classifier. Furthermore, we will present an optimized information granulation based machine learning classification algorithm. Experiments carried...

chapter

A classification algorithm for noisy data streams

Yan Li, Yuhong Zhang, Hu Xuegang, Li Peipei

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 2239 - 2244

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

Classification on noisy data streams has recently become one of the most important topics in streaming data mining. In this paper, a Classification algorithm for mining Data Streams based on Mixture Models of C4.5 and NB is proposed called CDSMM. In this algorithm, C4.5 is used as the base classifiers, the hypothesis testing method is introduced for the detection of concept drifts, and a Naïve Bayes...

chapter

A new clustering classification approach based on FCR

De-gan Zhang, Weiwei Liu, Xuejing Kang, Ying Chen, more

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 967 - 971

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

A new clustering classification approach based on fuzzy closeness relationship (FCR) is studied in this paper. As we know, fuzzy clustering classification is one of important and valid methods to knowledge discovery. One of problems in fuzzy clustering classification is to determine a certain fuzzy sample classification in given limited sample space. Another is its validity, that is to say, if the...

chapter

A Practical Heterogeneous Classifier for Relational Databases

Geetha Manjunath, M Narasimha Murty, Dinkar Sitaram

2010 20th International Conference on Pattern Recognition > 3316 - 3319

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Most enterprise data is distributed in multiple relational databases with expert-designed schema. Using traditional single-table machine learning techniques over such data not only incur a computational penalty for converting to a ”flat” form (mega-join), even the human-specified semantic information present in the relations is lost. In this paper, we present a two-phase hierarchical meta-classification...

chapter

Comparison of support vector machine and support vector regression: An application to predict financial distress and bankruptcy

Mu-Yen Chen, Chia-Chen Chen, Ya-Fen Chang

2010 7th International Conference on Service Systems and Service Management > 1 - 6

2010 7th International Conference on Service Systems and Service Management (ICSSSM 2010)

Lately, many notorious financial distress and bankruptcy events occurred in the world economic. As we known, bankruptcy of Lehman Brothers Holdings Inc. (LEH) is the largest bankruptcy filing in U.S. history in 2008. These events have serious impacted on the socio-economic and investment in public wealth. Due to solve this dilemma, this research collected 68 listed companies as the raw data from Taiwan...

chapter

A New Supervised Dimensionality Reduction Method for Image Data Using Evolutionary Strategy

Mudasser Naseer, Shi-Yin Qin

2010 Second International Conference on Computer Research and Development > 116 - 120

Second International Conference on Computer Research and Development (ICCRD 2010)

Most of the classifiers suffer from curse of dimensionality during classification of high dimensional image data. In this paper, we introduce a new supervised nonlinear dimensionality reduction (S-NLDR) algorithm called evolutionary strategy based supervised dimensionality reduction (ESSDR). The ESSDR method uses population based evolutionary strategy (ES) algorithm to find low dimensional embedded...

chapter

Predicting Phishing Websites Using Classification Mining Techniques with Experimental Case Studies

Maher Aburrous, M A Hossain, Keshav Dahal, Fadi Thabtah

2010 Seventh International Conference on Information Technology: New Generations > 176 - 181

Seventh International Conference on Information Technology: New Generations (ITNG 2010)

Classification Data Mining (DM) Techniques can be a very useful tool in detecting and identifying e-banking phishing websites. In this paper, we present a novel approach to overcome the difficulty and complexity in detecting and predicting e-banking phishing website. We proposed an intelligent resilient and effective model that is based on using association and classification Data Mining algorithms...

chapter

Part-of-Speech Approach to Evaluation of Textbook Reviews

Patrawadee Tanawongsuwan

2010 Second International Conference on Computer and Network Technology > 352 - 356

2010 Second International Conference on Computer and Network Technology (ICCNT 2010)

Book reviews are comments written by readers regarding their experiences about a particular book. Some reviews contain useful information and may help prospective buyers in making a purchase decision, while some are viewed as less helpful, such as, complaints about shipping delay. The review's content is the key to differentiating them. Presenting a methodology for evaluating the helpfulness of a...

chapter

Intelligent crawling based on rough set for web resource discovery

LingXia Hu

2010 2nd IEEE International Conference on Information Management and Engineering > 624 - 627

2010 2nd IEEE International Conference on Information Management and Engineering (ICIME 2010)

The rapid development of the Internet brings a new problem, which is how to rapidly and effectively retrieve needed web resource from vast number of web pages. The progress of machine learning techniques shows a new direction of solving this problem. In this paper, intelligent crawling algorithm based on rough set is proposed. The algorithm use the hypertext features behavior in order to perform topic...

chapter

Seizure prediction with spectral power of time/space-differential EEG signals using cost-sensitive support vector machine

Yun Park, T Netoff, K Parhi

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5450 - 5453

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

A patient-specific seizure prediction algorithm is proposed using a classifier to differentiate preictal from interictal ECoG signals. Spectral power of ECoG processed in four different fashions are used as features: raw, time-differential, space-differential, and time/space-differential ECoG. The features are classified using cost-sensitive support vector machines by the double cross-validation methodology...

chapter

Associative Classification techniques for predicting e-banking phishing websites

Maher Aburrous, M A Hossain, Keshav Dahal, Fadi Thabtah

2010 International Conference on Multimedia Computing and Information Technology (MCIT) > 9 - 12

2010 International Conference on Multimedia Computing and Information Technology (MCIT 2010)

This paper presents a novel approach to overcome the difficulty and complexity in detecting and predicting e-banking phishing website. We proposed an intelligent resilient and effective model that is based on using association and classification Data Mining algorithms. These algorithms were used to characterize and identify all the factors and rules in order to classify the phishing website and the...

chapter

Using machine learning on sensor data

A Moraru, M Pesko, M Porcius, C Fortuna, more

Proceedings of the ITI 2010, 32nd International Conference on Information Technology Interfaces > 573 - 578

2010 32nd International Conference on Information Technology Interfaces (ITI 2010)

Developing hardware, algorithms and protocols, as well as collecting data in sensor networks are all important challenges in building good systems. We describe a vertical system integration of a sensor node and a toolkit of machine learning algorithms. Based on a dataset that combines sensor data with additional introduced data we predict the number of persons in a closed space. We analyze the dataset...

chapter

Improving the accuracy of Tagging Recommender System by Using Classification

Jian Song, Liang He, Xin Lin

2010 The 12th International Conference on Advanced Communication Technology (ICACT) > 1 > 387 - 391

2010 12th International Conference on Advanced Communication Technology (ICACT 2010)

Collaborative tagging system has become more and more popular and recently achieved widespread success due to flexibility and conceptual comprehensibility of tagging systems. Recommender system has the access to adopt tagging systems to achieve better performance. In this paper we consider that the items can be categorized into different classifications in which users show different interests. Here...

chapter

Bio-inspired machine learning in microarray gene selection and cancer classification

S.H. Aljahdali, M.E. El-Telbany

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) > 339 - 343

2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2009)

Microarray technology today has the ability of having the whole genome spotted on a single chip. It allows the biologist to inspect thousands of gene activities simultaneously. Machine learning approaches are suited and used to discovering the complex relationships between genes under controlled experimental conditions and classify microarray data by identifying a subset of informative genes embedded...

chapter

A Novel Feature Selection Approach and Feature Weight Adjustment Technique in Text Classification

Yixing Liao, Xuezeng Pan

2009 Seventh ACIS International Conference on Software Engineering Research, Management and Applications > 41 - 44

2009 7th ACIS International Conference on Software Engineering Research, Management and Applications (SERA 2009)

Feature selection and feature weight calculating are key preprocesses in text classification. A new feature selection approach based on average interaction gain (AIG) is presented and a new feature weight adjustment technique (WA) taking inter-class distribution and intra-class distribution into consideration is presented too. Then a new approach combining AIG with WA called AIG-WA is presented. In...

INFONA - science communication portal

Search results

Predicting Faults in High Assurance Software

The problem of classification in imbalanced data sets in knowledge discovery

Cascade generalization: Is SVMs' inductive bias useful?

Instance-based ensemble learning algorithm with stacking framework

Predicting player behavior in Tomb Raider: Underworld

Optimized fuzzy information granulation based machine learning classification

A classification algorithm for noisy data streams

A new clustering classification approach based on FCR

A Practical Heterogeneous Classifier for Relational Databases

Comparison of support vector machine and support vector regression: An application to predict financial distress and bankruptcy

A New Supervised Dimensionality Reduction Method for Image Data Using Evolutionary Strategy

Predicting Phishing Websites Using Classification Mining Techniques with Experimental Case Studies

Part-of-Speech Approach to Evaluation of Textbook Reviews

Intelligent crawling based on rough set for web resource discovery

Seizure prediction with spectral power of time/space-differential EEG signals using cost-sensitive support vector machine

Associative Classification techniques for predicting e-banking phishing websites

Using machine learning on sensor data

Improving the accuracy of Tagging Recommender System by Using Classification

Bio-inspired machine learning in microarray gene selection and cancer classification

A Novel Feature Selection Approach and Feature Weight Adjustment Technique in Text Classification

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options