The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The paper considers the problem of feature selection in learning using privileged information (LUPI), where some of the features (referred to as privileged ones) are only available for training, while being absent for test data. In the latest implementation of LUPI, these privileged features are approximated using regressions constructed on standard data features, but this approach could lead to polluting...
One of the most current challenging problems in Gaussian process regression (GPR) is to handle large-scale datasets and to accommodate an online learning setting where data arrive irregularly on the fly. In this paper, we introduce a novel online Gaussian process model that could scale with massive datasets. Our approach is formulated based on alternative representation of the Gaussian process under...
A hybrid sampling technique is proposed by combining Complementary Fuzzy Support Vector Machine (CMTFSVM) and Synthetic Minority Oversampling Technique (SMOTE) for handling the imbalanced classification problem. The proposed technique uses an optimised membership function to enhance the classification performance and it is compared with three different classifiers. The experiments consisted of four...
The electroencephalography (EEG) data records vast amounts of human cerebral activity yet is still reviewed primarily by human readers. Most of the times, the data is contaminated with non-cerebral originated signals, called artifacts, which could be very difficult to visually detect and, undiscovered, could damage the neural information analysis. The purpose of our work is to detect the artifacts...
In this paper, a method for reducing coding artifacts introduced by lossy image compression is proposed. The method is similar to sample adaptive offset (SAO) which is adopted in the H.265/HEVC video coding standard as one of in-loop filtering tools. In the SAO, samples of the reconstructed image are classified into several categories based on some simple algorithms, and an optimum offset value is...
Context: Software Bug Severity Classification can help to improve the software bug triaging process. However, severity levels present a high-level of data imbalance that needs to be taken into account. Aim: We investigate cost-sensitive strategies in multi-class bug severity classification to counteract data imbalance. Method: We transform datasets from three severity classification papers to a common...
Recognizing secondary structures in proteins can be a highly computationally expensive task that may not always yield good results. Using Restricted Boltzmann Machines (RBM) we were able to train a simple neural network to recognize an alpha-helix with a good degree of accuracy. Modifying the RBM implementation to be much simpler and more efficient than the standard implementation we are able to see...
Feature representation plays an important role in text classification. Feature mapping based on labels information is an algorithm suitable for Binary Relevance. Compared with the conventional text representation, it makes the dimension of the text under control by means of word embedding. More importantly, it takes full advantage of the general characteristics of the label on text representation...
Traffic anomaly detection is primarily concerned with identifying malicious traffic patterns in a much larger stream of benign traffic. Traditionally, this is achieved by selecting a very specialized set of traffic-based features that are used for both training a model, as well as for detection at runtime. This paper introduces a novel method of anomaly detection that breaks the assumption that the...
Pooling second-order local feature statistics to form a high-dimensional bilinear feature has been shown to achieve state-of-the-art performance on a variety of fine-grained classification tasks. To address the computational demands of high feature dimensionality, we propose to represent the covariance features as a matrix and apply a low-rank bilinear classifier. The resulting classifier can be evaluated...
Support vector machine (SVM) is a popular machine learning method and has been widely applied in many real-world applications. Since SVM is sensitive to noises, fuzzy SVM (FSVM) has been proposed to relieve the over-fitting problem caused by noises through assigning a fuzzy membership to each sample. Then, different samples make different contributions to the learning of classification hyperplane...
When beginners practice Chinese calligraphy, they often copy from ancient calligraphic works and try to imitate the style as closely as possible. However there are inevitably some characters whose styles are not correctly followed. Thus we are motivated to detect the style consistency of all written characters in one practice. With the styles extracted by using stacked autoencoders of deep neural...
This paper extends the idea of Universum learning to regression problems. We propose new Universum-SVM formulation for regression problems that incorporates a priori knowledge in the form of additional data samples. These additional data samples, or Universum samples, belong to the same application domain as the training samples, but they follow a different distribution. Several empirical comparisons...
In order to analyze the economic performance of thermal power plant, a partial least squares support vector machine coupling model was constructed. First of all, the coal consumption rate was selected as the evaluating indicator, which is an important indicator to evaluate the economy of power plant. At the same time, the physical quantities were established, which is closely related to coal consumption...
Unknown awareness is very important for many applications such as face recognition. In a typical unknown aware classifier, an “unknown” label is assigned to strange test instances. This study proposes an unknown aware classifier known as UAkNN by extending the well-known kNN classifier. In UAkNN, unknown awareness is achieved by exploiting distances between instances of individual classes. These distances...
Biometric is a pattern recognition system that automatically identifies people according to their physiologic and behavioral properties. Among the physiologic properties, hand has a special place so that all features of hand like palm lines, inner knuckles, external knuckles and geometry could be used. More recently, the usage of blood vessels pattern in the palm, in addition to the high acceptability,...
Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard...
The P300 Speller is a Brain Computer Interface that enables communication using the EEG signal. The P300 wave is an Event Related Potential that occurs as a response to a familiar stimulus. This system can be used to aid persons who are unable to communicate via conventional methods. In this paper, the P300 Speller has been modified to allow communication in three languages: English, Sinhala and Tamil...
Micro-expression recognition is a challenging task in computer vision field due to the repressed facial appearance and short duration. Previous work for micro-expression recognition have used hand-crafted features like LBP-TOP, Gabor filter and optical flow. This paper is the first work to explore the possible use of deep learning for micro-expression recognition task. Due to the lack of data for...
Domain adaptation (DA) aims to eliminate the difference between the distribution of labeled source domain on which a classifier is trained and that of unlabeled or partly labeled target domain to which the classifier is to be applied. Compared with the semi-supervised domain adaptation where some labeled data from target domain is utilized to help train the classifier, the unsupervised domain adaptation...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.