Advanced search

chapter

A systematic framework to discover pattern for web spam classification

Hamed Jelodar, Yongli Wang, Chi Yuan, Xiaohui Jiang

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) > 32 - 39

2017 8th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)

Web spam is a big problem for search engine users in World Wide Web. They use deceptive techniques to achieve high rankings. Although many researchers have presented the different approach for classification and web spam detection still it is an open issue in computer science. Analyzing and evaluating these websites can be an effective step for discovering and categorizing the features of these websites...

chapter

Prediction and diagnosis of leukemia using classification algorithms

Khaled A. S. Abu Daqqa, Ashraf Y. A. Maghari, Wael F. M. Al Sarraj

2017 8th International Conference on Information Technology (ICIT) > 638 - 643

2017 8th International Conference on Information Technology (ICIT)

Algorithms used in data mining techniques are of great importance in the field of health care, especially in the case of getting patterns or models that are undiscovered in databases. In the area of health care, leukemia affects the blood status and can be discovered by using the Blood Cell Counter (CBC). This study aims to predict the leukemia existence by determining the relationships of blood properties...

chapter

Anomaly detection on a real-time server using decision trees step by step procedure

Georges Chaaya, Hoda Maalouf

2017 8th International Conference on Information Technology (ICIT) > 127 - 133

2017 8th International Conference on Information Technology (ICIT)

Anomaly detection is the process of finding outlying records from a given data set. The aim of this paper is to study a well-known anomaly detection technique on the “Short Message Service Centre” server, used in the telecommunications field to handle and store messages. This server was studied in details, a script was written to gather all the required data that went through a cleaning phase and...

chapter

Analyzing Titanic disaster using machine learning algorithms

Aakriti Singh, Shipra Saraswat, Neetu Faujdar

2017 International Conference on Computing, Communication and Automation (ICCCA) > 406 - 411

2017 International Conference on Computing, Communication and Automation (ICCCA)

Titanic disaster occurred 100 years ago on April 15, 1912, killing about 1500 passengers and crew members. The fateful incident still compel the researchers and analysts to understand what can have led to the survival of some passengers and demise of the others. With the use of machine learning methods and a dataset consisting of 891 rows in the train set and 418 rows in the test set, the research...

chapter

Predictive analytics for E learning system

Madhav S. Vyas, Reshma Gulwani

2017 International Conference on Inventive Systems and Control (ICISC) > 1 - 4

2017 International Conference on Inventive Systems and Control (ICISC)

E Learning courses are much in demand in recent times. The need to study student's performance and predicting their performance is increasing along with it. With the growing popularity of educational technology, various data mining algorithms suitable for predicting student performance have been reviewed. The best algorithm depends on the nature of prediction the faculty wants to make. As the amount...

chapter

An Improved Decision Tree Method Base on RELIEFF for Medical Diagnosis

Quanjun Liu, Xiaowei Xu, Ye Tao, Xiaodong Wang

2016 6th International Conference on Digital Home (ICDH) > 133 - 138

2016 6th International Conference on Digital Home (ICDH)

There emerges an increasing need to mine and analyze the health data from smart home medical systems and community medical organizations. Regarding to the influence of irrelevant attributes, in this study, an improved C4.5 decision tree method based on RELIEFF attribute weighting techniques is proposed for medical diagnosis. This method includes two steps: the first step is to delete the irrelevant...

chapter

Intrusion detection system using data mining a review

Varsha Singh, Shubha Puthran

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) > 587 - 592

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC)

Everyday huge amount of information are transferred from one network to another, the information may be exposed to attacks. The information and information system should be protected from unauthorized users. To provide and maintain the Confidentiality and Integrity of the information is a very tedious job so Intrusion Detection plays a very important role. Although various methods are used to protect...

chapter

Study of machine learning algorithms for special disease prediction using principal of component analysis

B. Dhomse Kanchan, M. Mahale Kishor

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) > 5 - 10

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC)

The worldwide study on causes of death due to heart disease/syndrome has been observed that it is the major cause of death. If recent trends are allowed to continue, 23.6 million people will die from heart disease in coming 2030. The healthcare industry collects large amounts of heart disease data which unfortunately are not “mined” to discover hidden information for effective decision making. In...

chapter

Prediction of User's Purchase Intention Based on Machine Learning

Liu Bing, Shi Yuliang

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI) > 99 - 103

2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI)

In recent years, the use of machine learning methods to deal with the problem of user interest prediction has become a hot research direction in the field of electronic commerce. In the present stage, a naive Bayesian algorithm has the advantages of simple implementation and high classification efficiency. However, this method is too dependent on the distribution of samples in the sample space, and...

chapter

The application of decision tree C4.5 algorithm to soil quality grade forecasting model

Li Dongming, Li Yan, Yuan Chao, Li Chaoran, more

2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI) > 552 - 555

2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI)

This paper based on the analysis of the basic meaning in data mining and the structure of decision tree uses the decision tree algorithm — C4.5 to establish a soil quality grade prediction model and combines the soil composition in Lishu to be a training sample. C4.5 algorithm also expresses the acquired knowledge by means of quantitative rules. The experiment results manifest that the expression...

chapter

Performance analysis of cart and C5.0 using sampling techniques

M Balamurugan, S Kannan

2016 IEEE International Conference on Advances in Computer Applications (ICACA) > 72 - 75

2016 IEEE International Conference on Advances in Computer Applications (ICACA)

Data mining is the process of extracting the hidden predictive model from large databases. It has various methods and algorithms. Classification is a supervised method, which builds a model for predicting the new instances. Different algorithms like decision tree, neural networks, support vector machines, k nearest neighbour, Bayesian classification are available for the classification. Decision tree...

chapter

An approach to sample selection from big data for classification

Sheng Xing, Yulin He, Hong Zhu, Xizhao Wang

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2928 - 2935

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

When traditional sample selection methods are used to compress large data sets, the computational complexity turns out to be very high and it is really time consuming. To avoid these shortcomings, we propose a new method to select samples based on non-stable cut points. With the basic characteristic of convex function that its extreme values occur at the endpoints of intervals, the method measures...

chapter

An analysis of machine learning techniques (J48 & AdaBoost)-for classification

Poonam Pandey, Radhika Prabhakar

2016 1st India International Conference on Information Processing (IICIP) > 1 - 6

2016 1st India International Conference on Information Processing (IICIP)

Extraction of relevant Information from data Is a challenging task. Many times an analyst may end up with an erroneous classifier because of huge, redundant, unreliable and noisy data. It may also be due to misinterpretation of results and usage of inappropriate techniques for a specific situation. In our study, we have investigated the two main approaches in data mining which are Decision Tree (J48...

chapter

Comparison of machine learning algorithms for breast cancer

Palli Suryachandra, P Venkata Subba Reddy

2016 International Conference on Inventive Computation Technologies (ICICT) > 3 > 1 - 6

2016 International Conference on Inventive Computation Technologies (ICICT)

Machine learning algorithms are computer programs that try to predict cancer type based on the past data. The eventual goal of Machine learning algorithms in cancer diagnosis is to have a trained machine learning algorithm that gives the gene expression levels from cancer patient, can accurately predict what type and severity of cancer they have, aiding the doctor in treating it. The existing technology...

chapter

A hybrid ensemble of machine and statistical learning using confidence-based boosting

Nattawut Chairatanasongporn, Saichon Jaiyen

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE) > 41 - 45

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE)

Nowadays, the classification problems have become more challenging due to the various types of data set. Some data are appropriated for machine learning techniques and some data are appropriated for statistical leaning techniques. This work proposes a new hybrid ensemble of machine and statistical learning models using confidence-based boosting. The proposed method which uses variants of based classifiers...

chapter

An Experimental Evaluation of Data Mining Algorithms Using Hyperparameter Optimization

Rayrone Z.N. Marques, Luciano R. Coutinho, Tiago B. Borchartt, Samyr B. Vale, more

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) > 152 - 156

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)

The challenge to choose the best algorithm and its best parameters for a given problem is known as Combined Algorithm Selection and Hyperparameter Optimization Problem. Among all the classification algorithms available are those based on human comprehensible representations, such as decision trees and classification rule induction. These algorithms are usually chosen by the clarity of the results...

chapter

Decision tree algorithm optimization research based on MapReduce

Fangfang Yuan, Fusheng Lian, Xingjian Xu, Zhaohua Ji

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 1010 - 1013

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

With the advent of the computer science, the data volume that needed to be processed under many practical situations increases dramatically, challenging many traditional machine learning techniques. Bearing this in mind, we made an intensive study on the optimization of decision tree algorithm and its corresponding porting to the big data analysis in this paper. An optimized genetic algorithm is merged...

chapter

On machine learning technique selection for classification

Rahmad Kurniawan, Mohd Zakree Ahmad Nazri, M. Irsyad

2015 International Conference on Electrical Engineering and Informatics (ICEEI) > 540 - 545

2015 International Conference on Electrical Engineering and Informatics (ICEEI)

Extracting meaningful pattern from data can be challenging. Irrelevant, redundant, noisy and unreliable data, misinterpretation of results and incompatibility of a technique to extract unknown patterns from data may lead analyst to develop an erroneous classifier. This research is encouraged by ‘No Free Lunch’ theorem that can be simplified as no classification technique that works best for every...

chapter

Using of a matrix method of building of nonbinary decision trees for determining of stability of fixation of a tibia fracture

M. S. Kupriyanov, J. A. Shichkina, E. Y. Shukeilo

2015 XVIII International Conference on Soft Computing and Measurements (SCM) > 283 - 286

2015 XVIII International Conference on Soft Computing and Measurements (SCM)

Rapid development of information technologies, in particular, progress in methods of collection, storage and processing of data has allowed to collect huge data arrays with the purpose of their analysis in many organizations. Opportunities of experts are not enough because amount of these data are too much. This generates demand for methods of automatic data analysis number of which annually increase...

chapter

The Based on Rough Set Theory Development of Decision Tree after Redundant Dimensional Reduction

Priya Pal, Deepak Motwani

2015 Fifth International Conference on Advanced Computing & Communication Technologies > 278 - 282

2015 Fifth International Conference on Advanced Computing & Communication Technologies (ACCT)

Decision tree technologists have been examined to be a helpful way to find out the human decision making within a host. Decision tree performs variable screening or feature selection. It requires relatively lesser effort from the users for the preparation of the data. In the proposed algorithm firstly we have undertaken to minimize the unnecessary redundancy in the decision tree, reducing the volume...

INFONA - science communication portal

Advanced search

Advanced search in people

A systematic framework to discover pattern for web spam classification

Prediction and diagnosis of leukemia using classification algorithms

Anomaly detection on a real-time server using decision trees step by step procedure

Analyzing Titanic disaster using machine learning algorithms

Predictive analytics for E learning system

An Improved Decision Tree Method Base on RELIEFF for Medical Diagnosis

Intrusion detection system using data mining a review

Study of machine learning algorithms for special disease prediction using principal of component analysis

Prediction of User's Purchase Intention Based on Machine Learning

The application of decision tree C4.5 algorithm to soil quality grade forecasting model

Performance analysis of cart and C5.0 using sampling techniques

An approach to sample selection from big data for classification

An analysis of machine learning techniques (J48 & AdaBoost)-for classification

Comparison of machine learning algorithms for breast cancer

A hybrid ensemble of machine and statistical learning using confidence-based boosting

An Experimental Evaluation of Data Mining Algorithms Using Hyperparameter Optimization

Decision tree algorithm optimization research based on MapReduce

On machine learning technique selection for classification

Using of a matrix method of building of nonbinary decision trees for determining of stability of fixation of a tibia fracture

The Based on Rough Set Theory Development of Decision Tree after Redundant Dimensional Reduction

Filter options

Publication date

Keywords

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options