The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The adaptive elastic net has been widely studied in the microarray classification due to the elegant performances in gene selection. However, the classification accuracy will be affected if the noise is included. As such, this paper proposes a weighted adaptive elastic net for the binary microarray classification with noise by using the distances from the sample points to both class centers. Furthermore,...
This paper presents a method for designing binary trees for SVM classification. The proposed algorithm, multi-modal binary tree (MBT) tolerates misclassification in the upper nodes of the tree, allowing points to be classified in either output regardless of the initial specified class groupings. MBT can separate classes that are inseparable with a single classifier by using a piecewise division. The...
In this paper, we present a novel multi-scale and multi-position classification (MSPC) method for detection of clustered pleomorphic micro-calcifications in digital mammograms. With this method, mammograms are divided into sub-images from which the image features are extracted and a cascaded Support Vector Machine (SVM) classifier is used to detect pleomorphic calcifications. Using the MSPC method,...
Medicine is one of the major fields where the application of artificial intelligence primarily deals with construction of programs that perform diagnosis and make therapy recommendations. In digital mammography, data mining techniques are used to detect and characterize abnormalities in images and clinical reports. In the existing approaches, the mammogram image classification is done in either clinical...
Two of the most challenging problems in data mining are working with imbalanced datasets and with datasets which have a large number of attributes. In this study we compare three different approaches for handling both class imbalance and high dimensionality simultaneously. The first approach consists of sampling followed by feature selection, with the training data being built using the selected features...
Machine Learning algorithms have been widely used for gene expression data classification, despite the fact that these data have often intrinsic limitations, such as high dimensionality and a small number of examples. Few studies try to characterize to which extent these aspects can influence the performance of the classification models induced. In this paper we compute different measures characterizing...
Class imbalance is one of the challenging problems for machine learning algorithms. When learning from highly imbalanced data, most classifiers are overwhelmed by the majority class examples, so the false negative rate is always high. Although researchers have introduced many methods to deal with this problem, including resampling techniques and cost-sensitive learning (CSL), most of them focus on...
Microarray technology has been broadly used for monitoring the expression levels of thousands of genes simultaneously, providing the opportunities of identifying disease-related genes by finding differentially expressed genes in different conditions. However, a great challenge of analyzing microarray data is the significant noise brought by different experimental settings, laboratory procedures, genetic...
Microarray technology has been widely applied to search for biomarkers of diseases, diagnose diseases and analyze gene regulatory network. Abundance of expression data from microarray experiments are processed by informatics tools, such as supporting vector machines (SVM), artificial neural network (ANN), and so on. These methods achieve good results in single dataset. Nevertheless, most analyses...
Microarray datasets are often limited to a small number of samples with a large number of gene expressions. Therefore, dimensionality reduction through a feature/gene selection process is highly important for classification purposes. In this paper, a feature perturbation method we previously introduced is applied to do gene selection from microarray data. A publicly available colon cancer dataset...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.