Search results

Items from 61 to 80 out of 725 results

chapter

An Improvement to Feature Selection of Random Forests on Spark

Ke Sun, Wansheng Miao, Xin Zhang, Ruonan Rao

2014 IEEE 17th International Conference on Computational Science and Engineering > 774 - 779

2014 IEEE 17th International Conference on Computational Science and Engineering (CSE)

The Random Forests algorithm belongs to the class of ensemble learning methods, which are common used in classification problem. In this paper, we studied the problem of adopting the Random Forests algorithm to learn raw data from real usage scenario. An improvement, which is stable, strict, high efficient, data-driven, problem independent and has no impact on algorithm performance, is proposed to...

chapter

Sequence Classification Based on Delta-Free Sequential Patterns

Pierre Holat, Marc Plantevit, Chedy Raissi, Nadi Tomeh, more

2014 IEEE International Conference on Data Mining > 170 - 179

2014 IEEE International Conference on Data Mining (ICDM)

Sequential pattern mining is one of the most studied and challenging tasks in data mining. However, the extension of well-known methods from many other classical patterns to sequences is not a trivial task. In this paper we study the notion of &#x3B4;-freeness for sequences. While this notion has extensively been discussed for item sets, this work is the first to extend it to sequences. We...

chapter

SNOC: Streaming Network Node Classification

Ting Guo, Xingquan Zhu, Jian Pei, Chengqi Zhang

2014 IEEE International Conference on Data Mining > 150 - 159

2014 IEEE International Conference on Data Mining (ICDM)

Many real-world networks are featured with dynamic changes, such as new nodes and edges, and modification of the node content. Because changes are continuously introduced to the network in a streaming fashion, we refer to such dynamic networks as streaming networks. In this paper, we propose a new classification method for streaming networks, namely streaming network node classification (SNOC). For...

chapter

Modeling risk prediction of diabetes — A preventive measure

Bakshi Rohit Prasad, Sonali Agarwal

2014 9th International Conference on Industrial and Information Systems (ICIIS) > 1 - 6

2014 9th International Conference on Industrial and Information Systems (ICIIS)

Databases in clinical scenario have tremendous amount of data regarding patients and clinical history associated. Here, data mining plays vital role in searching for patterns within huge clinical data that could provide useful basis of knowledge for efficient and effective decision-making. Classification mechanism is widely used tool of data mining employed in healthcare applications to facilitate...

chapter

Clustering high-dimensional data via random sampling and consensus

Panagiotis A. Traganitis, Konstantinos Slavakis, Georgios B. Giannakis

2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 307 - 311

2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In response to the urgent need for learning tools tuned to big data analytics, the present paper introduces a feature selection approach to efficient clustering of high-dimensional vectors. The resultant method leverages random sampling and consensus (RANSAC) arguments, originally developed for robust regression tasks in computer vision, to yield novel dimensionality reduction schemes. The advocated...

chapter

Optimization of feature selection method for high dimensional data using fisher score and minimum spanning tree

Bharat Singh, Jitendra Singh Sankhwar, Om Prakash Vyas

2014 Annual IEEE India Conference (INDICON) > 1 - 6

2014 Annual IEEE India Conference (INDICON)

For classification of High Dimensional data, feature selection is the most important step for obtaining optimal result with respect to processing power required and time taken. Feature selection is a method by which the most relevant feature is selected from a set of features containing redundant and irrelevant features thereby reducing the load on the classification algorithm. This paper proposes...

chapter

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Mamta Singh, Jyoti Singh, Arpana Rawal

2014 International Conference on Information Technology > 221 - 227

2014 International Conference on Information Technology (ICIT)

Since four decades, a sincere concern has aroused among managerial, professional, towards the satisfaction of teaching-learning objective in Academia. Huge span of time has already been spent revealing student's profile patterns using predictive modeling methods, however, very little effort is put up in identifying the causative features responsible for varied students' performances followed by decisive...

chapter

Feature selection using RapidMiner and classification through probabilistic neural network for fault diagnostics of power transformer

Hasmat Malik, Sukumar Mishra

2014 Annual IEEE India Conference (INDICON) > 1 - 6

2014 Annual IEEE India Conference (INDICON)

The diagnosis of incipient fault is important for power transformer condition monitoring. The incipient faults are monitored by conventional and artificial intelligence based models. The key gases, percentage value of gases and ratio of Doernenburg, Roger, IEC methods are input variables to artificial intelligence (AI) models which affects the accuracy of incipient fault diagnosis so selection of...

chapter

Reliable condition monitoring of an induction motor using a genetic algorithm based method

Won-Chul Jang, Myeongsu Kang, Jaeyoung Kim, Jong-Myon Kim, more

2014 IEEE Symposium on Computational Intelligence for Engineering Solutions (CIES) > 37 - 41

2014 IEEE Symposium on Computational Intelligence for Engineering Solutions (CIES)

Condition monitoring is a vital task in the maintenance of industry machines. This paper proposes a reliable condition monitoring method using a genetic algorithm (GA) which selects the most discriminate features by taking a transformation matrix. Experimental results show that the features selected by the GA outperforms original and randomly selected features using the same k-nearest neighbor (k-NN)...

chapter

Random forest algorithm for improving the performance of speech/non-speech detection

Sincy V. Thambi, K. T. Sreekumar, C. Santhosh Kumar, P. C Reghu Raj

2014 First International Conference on Computational Systems and Communications (ICCSC) > 28 - 32

2014 First International Conference on Computational Systems and Communications (ICCSC)

Speech/non-speech detection (SND) distinguishes between speech and non-speech segments in recorded audio and video documents. SND systems can help reduce the storage space required when only speech segments from the audio documents are required, for example content analysis, spoken language identification, etc. In this work, we experimented with the use of time domain, frequency domain and cepstral...

chapter

A framework of preparing corpora from social network sites for sentiment analysis

Walaa Medhat, Ahmed H. Yousef, Hoda Korashy

International Conference on Information Society (i-Society 2014) > 32 - 39

2014 International Conference on Information Society (i-Society)

This paper proposes a framework for preparing and using corpora from online social networks and review sites for sentiment analysis task. The framework consists of three phases. The first phase is the preprocessing and cleaning of data collected, then data annotation. The second phase is applying various text processing techniques including: removing stopwords, replacing the negation words and the...

chapter

Customer relationship management classification using data mining techniques

S. Ummugulthum Natchiar, S. Baulkani

2014 International Conference on Science Engineering and Management Research (ICSEMR) > 1 - 5

2014 International Conference on Science Engineering and Management Research (ICSEMR)

Customer Relationship Management possess Business Intelligence by incorporating information acquisition, information storage, and decision support functions to provide customized customer service. It enables customer representatives to analyze and classify data to address customer needs in order to promote greater customer satisfaction and retention, but in reality we have learned CRM classification...

chapter

Improving textual data classification and discrimination using an ad-hoc metric: Application to a famous text discrimination challenge

Jean-Charles Lamirel, Pascal Cuxac

2014 4th International Symposium ISKO-Maghreb: Concepts and Tools for knowledge Management (ISKO-Maghreb) > 1 - 6

2014 4th International Symposium ISKO-Maghreb: Concepts and Tools for knowledge Management (ISKO-Maghreb)

Labelling maximization (F-max) is an unbiased metric for estimation of the quality of non-supervised classification (clustering) that promotes the clusters with a maximum value of feature F-measure. In this paper, we show that an adaptation of this metric within the supervised classification allows to perform a selection of features and to calculate for each of them a function of contrast. The method...

chapter

Improved Intrusion Detection in DDoS Applying Feature Selection Using Rank & Score of Attributes in KDD-99 Data Set

Aditya Harbola, Jyoti Harbola, Kunwar Singh Vaisla

2014 International Conference on Computational Intelligence and Communication Networks > 840 - 845

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

In today's networked environment, massive volume of data being generated, gathered and stored in databases across the world. This trend is growing very fast, year after year. Today it is normal to find databases with terabytes of data, in which vital information and knowledge is hidden. The unseen information in such databases is not feasible to mine without efficient mining techniques for extracting...

chapter

A Biologically Verified Classification of Microarray Data

Ritwik Mondal, Bholanath Mahata, Srirupa Dasgupta

2014 International Conference on Computational Intelligence and Communication Networks > 686 - 690

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

A micro array represents thousands of gene expression levels across a few samples. Determination of an optimal set of features from such a high dimensional dataset requires a good feature selection method. Based on statistical significance of the features, an elimination of insignificant genes can be performed. However such methods lack biological validation. In this paper we propose a method where...

chapter

A Novel Approach for Classification of Schizophrenia Patients and Healthy Subjects Using Auditory Oddball Functional MRI

Akanksha Juneja, Bharti Rana, R.K. Agrawal

2014 13th Mexican International Conference on Artificial Intelligence > 75 - 81

2014 13th Mexican International Conference on Artificial Intelligence (MICAI)

Schizophrenia is a serious psychiatric illness which needs early and accurate diagnosis. Difference in activation patterns of schizophrenia patients and healthy subjects can be identified with the help of functional magnetic resonance imaging (fMRI). However, manual diagnosis using fMRI depends on subjective observation and may be erroneous. This has motivated the pattern recognition and machine learning...

chapter

Comparative analysis of multiple kernel learning on learning emotion recognition

Oryina Kingsley Akputu, Yunli Lee, Kah Phooi Seng

Proceedings of the 6th International Conference on Information Technology and Multimedia > 357 - 362

2014 International Conference on Information Technology and Multimedia (ICIMU)

Local appearance descriptors are widely used on facial emotion recognition tasks. With these descriptors, image filters, such as Gabor wavelet or local binary patterns (LBP) are applied on the whole or specific regions of the face to extract facial appearance changes. But it is also clear that beside feature descriptor; choice of suitable learning method that integrates feature novelty is vital. The...

chapter

Center-based group genetic algorithm for attribute clustering

Tzung-pei Hong, Chun-hao Chen, Feng-shih Lin, Shyue-liang Wang

2014 International Conference on Fuzzy Theory and Its Applications (iFUZZY2014) > 178 - 182

2014 International Conference on Fuzzy Theory and Its Applications (iFuzzy)

In our previous study, a grouping-geneticalgorithm- based (GGA-based) attribute clustering process has been proposed for grouping features. In this paper, we further improve its performance and propose a center-based GGA for attribute clustering (CGGA). A new encoding scheme with corresponding crossover and mutation operators are designed, and an improved fitness function is proposed to achieve better...

chapter

Detection and classification of voice pathology using feature selection

Malak Al Mojaly, Ghulam Muhammad, Mansour Alsulaiman

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA) > 571 - 577

2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA)

The aim of this study is to apply automatic speech recognition (ASR) mechanism to improve the amount of information extracted from the voice and to increase the accuracy of the system by using selective highly discriminative features among different types of acoustic features. For feature extraction, we applied three techniques which are Mel Frequency Cepstral Coefficient (MFCC), Linear Prediction...

chapter

Experiments with Feature-Prior Hybrid Ensemble Method for Classification

Junyang Zhao, Zhili Zhang, Chongzhao Han, Lijiang Sun

2014 Tenth International Conference on Computational Intelligence and Security > 223 - 227

2014 Tenth International Conference on Computational Intelligence and Security (CIS)

In multiple classifier systems, base classifiers are trained to be accurate and diverse by a set of training data. The generation of training data is necessary and important in classifier ensemble, which can be achieved by instance selection (IS) or feature selection (FS) on initial data. In this paper, a feature-prior FS-IS hybrid ensemble method is proposed by integrating feature selection with...

Keywords:
ACCURACY
FEATURE SELECTION

Publication date

Set your own date range

INFONA - science communication portal

Search results

An Improvement to Feature Selection of Random Forests on Spark

Sequence Classification Based on Delta-Free Sequential Patterns

SNOC: Streaming Network Node Classification

Modeling risk prediction of diabetes — A preventive measure

Clustering high-dimensional data via random sampling and consensus

Optimization of feature selection method for high dimensional data using fisher score and minimum spanning tree

Feature Extraction Model to Identify At -- Risk Level of Students in Academia

Feature selection using RapidMiner and classification through probabilistic neural network for fault diagnostics of power transformer

Reliable condition monitoring of an induction motor using a genetic algorithm based method

Random forest algorithm for improving the performance of speech/non-speech detection

A framework of preparing corpora from social network sites for sentiment analysis

Customer relationship management classification using data mining techniques

Improving textual data classification and discrimination using an ad-hoc metric: Application to a famous text discrimination challenge

Improved Intrusion Detection in DDoS Applying Feature Selection Using Rank & Score of Attributes in KDD-99 Data Set

A Biologically Verified Classification of Microarray Data

A Novel Approach for Classification of Schizophrenia Patients and Healthy Subjects Using Auditory Oddball Functional MRI

Comparative analysis of multiple kernel learning on learning emotion recognition

Center-based group genetic algorithm for attribute clustering

Detection and classification of voice pathology using feature selection

Experiments with Feature-Prior Hybrid Ensemble Method for Classification

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options