The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper we deal with the problem of feature selection by introducing a new approach based on Gravitational Search Algorithm (GSA). The proposed algorithm combines the optimization behavior of GSA together with the speed of Optimum-Path Forest (OPF) classifier in order to provide a fast and accurate framework for feature selection. Experiments on datasets obtained from a wide range of applications,...
This paper discusses a method for improving accuracy of fuzzy-rule-based classifiers using particle swarm optimization (PSO). Two different fuzzy classifiers are considered and optimized. The first classifier is based on Mamdani fuzzy inference system (M_PSO fuzzy classifier). The second classifier is based on Takagi-Sugeno fuzzy inference system (TS_PSO fuzzy classifier). The parameters of the proposed...
Recognition of named entities (people, companies, locations, etc) is an essential task of text analytics. We address the subproblem of this task, namely, named entity classification. We propose a novel approach that constructs an effective fine-grained named entity classifier. Its key highlights are semi-automatic training set construction from Wikipedia articles and additional feature selection....
Nowadays many sensors for information acquisition are widely employed in remote sensing and different properties of the objects can be revealed. Unfortunately each imaging sensor has its own limits on scene recognition in the sense of thematic, temporal, and other interpretation. Integration (fusion) of different data types is expected to increase the quality of scene interpretation and decision making...
In this article, a new Multi Agent System based on Online Sequential Extreme Learning Machine (OSELM) neural network and Bayesian Formalism (MAS-OSELM-BF) is introduced. It is an improvement of a single OSELM (single agent) by combined multiple OSELMs (multi agents) with a final decision making module (parent agent). Here, the development of the parent agent is motivated by the Bayesian Formalism...
Discriminative subgraphs can be used to characterize complex graphs, construct graph classifiers and generate graph indices. The search space for discriminative subgraphs is usually prohibitively large. Most measurements of interestingness of discriminative subgraphs are neither monotonic nor antimonotonic with respect to subgraph frequencies. Therefore, branch-and-bound algorithms are unable to mine...
A classifier model for satellite image data by using Partitioned-Feature based Classifier (PFC)is proposed in this paper. The PFC does not use concatenated feature vectors extracted from the original data at once to classify each datum, but uses extracted feature vectors to classify data separately. In the training stage, the contribution rate calculated from each feature vector group is drawn throughout...
In general, music retrieval and classification methods using music moods use a lot of acoustic features similar to music genre classification. These features are used as the spectral features, the rhythm features, the harmony features, and so on. However, all of these features may not be efficient for music retrieval and classification using music moods. Hence, in this paper, we propose a feature...
Natural Language processing (NLP) is a field of computer science and linguistics concerned with the interactions between computers and human (natural) languages. Ambiguity is one of these problems which have been a great challenge for computational linguists. This paper concentrates on the problem of target word selection in Myanmar to English machine translation, for which the approach is directly...
Kernel Principal Component Analysis (KPCA) is a widely used technique in the dimension reduction, de-noising and discovering nonlinear intrinsic dimensions of data set. In this paper we describe a reweighing kernel-based classification method for improving recognition problem. Firstly, we map the training samples to the feature space by non-linear transformation, and then perform principal component...
The infant cry signals with asphyxia have distinct patterns which can be recognized using pattern classifiers such as Artificial Neural Network (ANN). This study investigates the effect of selecting infant cry features using the Binary Particle Swarm Optimization on the performance of Multilayer Perceptron (MLP) classifier in discriminating between healthy and infants with asphyxia from cry signals...
Classifying biomedical spectra is often difficult due to the bir voluminous nature; typically, only a small subset of spectral features is discriminatory, while the large majority tends to have a confounding effect on pattern classifiers. We present a two-pronged approach to dealing with this issue. First, we describe an iterative technique whereby many classifier instances operate on different feature...
Currently, network intrusion detection is in face of the conflict between the difficult to label data and the high accuracy request to detect intrusion. In this paper, we propose a SVM co-training based method to detect network intrusion. It exploits the large amount of unlabeled data, and increase the detection accuracy and stability by co-training two classifiers. The simulation results show that...
Decision trees induction is among powerful and commonly encountered architecture for extracting of classification knowledge from datasets of labeled instances. However, learning decision trees from large irrelevant datasets is quite different from learning small and moderate sized datasets. In this paper, we propose a simple yet effective composite splitting criterion equal to a random sampling approach...
In real world classification tasks, the original instances are represented by raw features. Usually domain related algorithms are needed to extract discriminative features. But the algorithms selection and additional parameters tuning are difficult for people with little domain knowledge and experience. In this paper, a new machine learning framework called "decompose learning" is proposed...
In this paper, we present a consumption pattern recognition system based on SVM. It can produce an optimized classification pattern using SVM algorithm and use the pattern to predict consumer behaviors. In this system, three dimension reduction methods including Principal Component Analysis (PCA), correlation analysis and data cubes are applied to reduce dimension of features and two training methods...
E-mail is a major revolution taking place over traditional communication systems due to its convenient, economical, fast, and easy to use nature. A major bottleneck in electronic communications is the enormous dissemination of unwanted, harmful emails known as spam emails. In this paper, a novel spam filtering framework (NSFF) is proposed, which is based on particle swarm optimization, fuzzy logic...
This article proposes such a question classification approach that integrates multiple semantic features. It is aimed at these two questions in Chinese question classification models: inaccurate semantic information extraction and too slow processing speed caused by too high Eigenvector dimension. With the help of HowNet and the support vector machine and syntactic and semantic information of question...
Nowadays, text classification has been one of the key subjects in intelligent information processing. Owing to the complex features of natural language, the feature space dimensions will be particularly high. How to improve the accuracy of text classification is an important and hard problem. As rough set is a useful tool to deal with uncertain information, a hybrid algorithm for text classification...
This paper describes an application of the Orthogonal Least Squares (OLS) algorithm for feature selection of spoken letters. Traditionally used for system identification purposes, the OLS method was used to select important Mel-Frequency Cepstrum Coefficients (MFCC) for classification of two spoken letters - `A' and `S' using Multi-Layer Perceptron (MLP) neural network. We evaluated several network...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.