The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper provides a novel and unified framework of representation based classification technique. The proposed atomic representation based classification (ARC) framework includes, but not limited to, sparse representation based classification (SRC), low-rank representation based classification (LRRC) as special cases. Despite good performance, most existing classification methods are heavily reliant...
In this letter, we present a novel feature extraction method for sound event classification, based on the visual signature extracted from the sound's time-frequency representation. The motivation stems from the fact that spectrograms form recognisable images, that can be identified by a human reader, with perception enhanced by pseudo-coloration of the image. The signal processing in our method is...
This paper presents a new approach for the classification of non-stationary signal patterns in an electric power network using a modified wavelet transform and neural network. The wavelet transform is phase corrected to yield a new transform known as S-transform, which has an excellent time-frequency resolution characteristic. The phase correction absolutely references the phase of the wavelet transform...
A new loss function has been introduced for Minimum Classification Error, that approaches optimal Bayes' risk and also gives an improvement in performance over standard MCE systems when evaluated on the Aurora connected digits database.
In this paper we propose a novel Support Vector Machine(SVM) based approach for noisy data removal from datasets. It is observed that the instability present in the dataset greatly affects the overall performance of the any classifier. Hence, we propose a methodology for removal of such instabilities. In the proposed approach, we proceed by determining the clusters formed using support equilibrium...
Parallel corpora are essential for training statistical machine translation models. Since parallel sentence-aligned corpora are usually noisy due to inexact automatic methods when generated from parallel or comparable documents, we need to clean parallel corpora. In this paper, new features are introduced to assess the correctness of a sentence pair. Also, the impact of new features in combination...
Worms are self-contained programs that spread over the Internet. Worms cause problems such as lost of information, information theft and denial-of-service attacks. The first part of the paper evaluates the detection of worms based on content classification by using all machine learning techniques available in WEKA data mining tools. Four most accurate and quite fast classifiers are identified for...
We propose a medically driven data mining application: system for diagnosing of gait patterns related to health problems of elderly to support their independent living. Gait of elderly is captured with motion capture system, which consists of tags attached to the body and sensors situated in the apartment. Position of the tags is acquired by the sensors and the resulting time series of position coordinates...
The presence of noise is common in any real data set and may adversely affect the accuracy, construction time and complexity of the classifiers. Models built by Fuzzy Rule Based Classification Systems are recognised for their interpretability, but traditionally these methods have not considered the presence of noise in the data, so it would be interesting to quantify its effect on them. The aim of...
A tool for discovery of gait anomalies of elderly from motion sensor data is proposed. The gait of the user is captured with the motion capture system, which consists of tags attached to the body and sensors situated in the apartment. Position of the tags is acquired by the sensors and the resulting time series of position coordinates are analyzed with dynamic time warping and machine learning algorithms...
This paper analyzes the existing decision tree classification algorithms based on variable precision rough set and finds that these algorithms have better classification accuracies and can tolerate the noise data. But when choosing the best attribute using variable precision rough set, these algorithms still have the shortages in ID3. That is, these algorithms also tend to choose the attribute with...
Tracking concept drifts in data streams has recently become a hot topic in data mining. Most of the existing work is built on a single-window-based mechanism to detect concept drifts. Due to the inherent limitation of the single-window-based mechanism, it is a challenge to handle different types of drifts. Motivated by this, a new classification algorithm based on a double-window mechanism for handling...
Classification on noisy data streams has recently become one of the most important topics in streaming data mining. In this paper, a Classification algorithm for mining Data Streams based on Mixture Models of C4.5 and NB is proposed called CDSMM. In this algorithm, C4.5 is used as the base classifiers, the hypothesis testing method is introduced for the detection of concept drifts, and a Naïve Bayes...
Choquet integral with regards to a non-additive set function μ is a useful combination tool when we consider the interactions between classifiers. This combination method works very well at the expense of run time and the memory space. This paper introduces samples reduction technology to degrade the complexity of determining the non-additive set functions μ which is determined by genetic algorithm...
A new method for detecting and classifying loudspeaker faults is presented in this paper. Total response of high-order harmonics groups is measured and used as defect features of loudspeaker. Based on support vector machine (SVM), we built a classification system combined with one-class SVM and Directed Acyclic Graphic SVM (DAGSVM). Comparing with K-nearest neighbor (k-NN) classifier, the accuracy...
Hyper surface classification (HSC) based on Jordan Curve Theorem is proven to be a simple and effective method to classify large datasets. Like most of classification algorithms, noise could also impact its accuracy even if the HSC algorithm limits the influence of noise in a local small region. In this paper, we propose a method that intuitively captures the primary goal of improving the accuracy...
The problem of classification on highly imbalanced datasets has been studied extensively in the literature. Most classifiers show significant deterioration in performance when dealing with skewed datasets. In this paper, we first examine the underlying reasons for SVM's deterioration on imbalanced datasets. We then propose two modifications for the soft margin SVM, where we change or add constraints...
Nearest Neighbor Classifier is one of the most classical lazy learning schemes. The basic nearest neighbor classifiers suffer from the common problem that the instances used to train the classifier are all stored indiscriminately, and as a result, the required memory storage is huge and response time becomes slow with a large database. In this paper, a new Instances Selection algorithm based on Classification...
Accurate software effort estimation is essential for successful project management. To improve the accuracy, a number of estimation techniques have been developed. Among those, Analogy-Based Estimation (ABE) has become one of the mainstreams of effort estimation. In general, ABE infers the effort to accomplish a new project from the efforts of the historical projects which possess similar characteristics...
This paper presents details of a convenient and unobtrusive system for monitoring daily activities. A smart phone equipped with an embedded 3D-accelerometer was worn on the belt for the purposes of data recording. Once collected the data was processed to identify 6 activities offline (walking, posture transition, gentle motion, standing, sitting and lying). The processing technique adopted a novel...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.