Search results

Items from 1 to 20 out of 328 results

chapter

On classification of biological data using outlier detection

Yushan Qiu, Xiaoqing Cheng, Wenpin Hou, Wai-Ki Ching

12th International Symposium on Operations Research and its Applications in Engineering, Technology and Management (ISORA 2015) > 1 - 7

12th International Symposium on Operations Research and its Applications in Engineering, Technology and Management (ISORA 2015)

With the rapid development of information technology, the number of datasets, as well as their complexity and dimension, have been growing dramatically. This dramatic growth of biology data and non-biological commercial databases becomes a challenging issue in data mining. Classification technique is one of the major tools in the captured research area. However, the performance of classification may...

chapter

A new gene subset selection approach based on linearly separating gene pairs

A Jafarian, A Ngom

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) > 105 - 110

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)

The concept of linear separability of gene expression data sets with respect to two classes, has been recently studied in literature. The problem is to efficiently find all pairs of genes which induce a linear separation of the data. It has been suggested that an underlying molecular mechanism relates together the two genes of a separating pair to the phenotype under study, such as a specific cancer...

chapter

Invited: Multiclass RNA function classification using next-generation sequencing

P Ryvkin, Yuk Yee Leung, Li-San Wang, B D Gregory

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) > 10

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)

RNA-seq produces detailed information including length, strand and pairing states, which can be leveraged to characterize RNA functional categories using machine-learning approaches. Using fruit fly small-RNA-seq data, we demonstrate that by combining read length correlation with multi-class classifier models, we can classify four non-coding RNA function classes with high precision.

chapter

Keynote: High-resolution sequence and chromatin signatures predict transcription factor binding in the human genome

Christina Leslie

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) > 2

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)

Accurately modeling the DNA sequence preferences of transcription factors and predicting their genomic binding sites are key problems in regulatory genomics. These efforts have long been frustrated by the limited availability and accuracy of TF binding site motifs. Today, protein binding microarray (PBM) experiments and chromatin immunoprecipitation followed by sequencing (ChlP-seq) experiments are...

chapter

Evaluation of missing values imputation methods in cDNA microarrays based on classification accuracy

V F Ghoneim, N H Solouma, Y M Kadah

2011 1st Middle East Conference on Biomedical Engineering > 367 - 370

2011 1st Middle East Conference on Biomedical Engineering (MECBME)

Many attempts have been carried out to deal with missing values (MV) in microarrays data representing gene expressions. This is a problematic issue as many data analysis techniques are not robust to missing data. Most of the MV imputation methods currently being used have been evaluated only in terms of the similarity between the original and imputed data. While imputed expression values themselves...

chapter

Poster: Linear B-cell epitope prediction based on Support Vector Machine and propensity scales

Hsin-Wei Wang, Ya-Chi Lin, Tun-Wen Pai, Hao-Teng Chang

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) > 264

2011 IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)

B-cell epitopes play an important role for developing synthetic peptide vaccines and inducing antibody responses. Applying biological experiments for epitope identification is time consuming and demands a lot of experimental resources. Nevertheless, it is important yet challenging task for designing a computer-aided B-cell linear epitope prediction system with high precision rates. In this paper,...

chapter

Fuzzy Discretization for Rough Set Based Gene Selection Algorithm

S Paul, P Maji

2011 Second International Conference on Emerging Applications of Information Technology > 317 - 320

Second International Conference on Emerging Applications of Information Technology (EAIT 2011)

Selection of reliable genes from micro array gene expression data is essential to carry out a diagnostic test and successful treatment. In this regard, a rough set based gene selection algorithm is developed recently to select genes from micro array data. In this paper, a fuzzy discretization method is proposed for rough set based gene selection algorithm to compute relevance and significance of continuous...

chapter

Stacked spatial-pyramid kernel: An object-class recognition method to combine scores from random trees

N Larios, J Lin, M Zhang, D Lytle, more

2011 IEEE Workshop on Applications of Computer Vision (WACV) > 329 - 335

2011 IEEE Workshop on Applications of Computer Vision (WACV)

The combination of local features, complementary feature types, and relative position information has been successfully applied to many object-class recognition tasks. Stacking is a common classification approach that combines the results from multiple classifiers, having the added benefit of allowing each classifier to handle a different feature space. However, the standard stacking method by its...

chapter

A self-training semi-supervised support vector machine method for recognizing transcription start sites

Jun Cai Huang, Feng Bi Wang, Huan Zhang Mao, Ming Tian Zhou

The 2010 International Conference on Apperceiving Computing and Intelligence Analysis Proceeding > 372 - 375

2010 International Conference on Apperceiving Computing and Intelligence Analysis (ICACIA 2010)

The task of finding transcription start sites (TSSs) can be modeled as a classification problem. Semi-Supervised Support Vector Machines (S³VMs) are an appealing method for using unlabeled data in classification. Based incorporation prior biological knowledge for recognizing TSSs, propose a Self-Training S³VMs (ST-S³VMs) algorithm. ST-S³VM builds a SVM classifier based small amounts of labeled data...

chapter

Using Randomised Vectors in Transcription Factor Binding Site Predictions

F Rezwan, Yi Sun, N Davey, R Adams, more

2010 Ninth International Conference on Machine Learning and Applications > 523 - 527

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Finding the location of binding sites in DNA is a difficult problem. Although the location of some binding sites have been experimentally identified, other parts of the genome may or may not contain binding sites. This poses problems with negative data in a trainable classifier. Here we show that using randomized negative data gives a large boost in classifier performance when compared to the original...

chapter

Plant Species Classification Using a 3D LIDAR Sensor and Machine Learning

U Weiss, P Biber, S Laible, K Bohlmann, more

2010 Ninth International Conference on Machine Learning and Applications > 339 - 345

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

In the domain of agricultural robotics, one major application is crop scouting, e.g., for the task of weed control. For this task a key enabler is a robust detection and classification of the plant and species. Automatically distinguishing between plant species is a challenging task, because some species look very similar. It is also difficult to translate the symbolic high level description of the...

chapter

Peptide Sequence Tag-Based Blind Identification-based SVM Model

Hui Li, Chunmei Liu, Xumin Liu, M Diakite, more

2010 Ninth International Conference on Machine Learning and Applications > 979 - 984

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Identifying the ion types for a mass spectrum is essential for interpreting the spectrum and deriving its peptide sequence. In this paper, we proposed a novel method for identifying ion types and deriving matched peptide sequences for tandem mass spectra. We first divided our dataset into a training set and a testing set and then preprocessed the data using a Support Vector Machine and a 5-fold cross...

chapter

QSAR Studies on Toxicity of Organic Compounds to Chlorella vulgaris

Xin-Qi Lv, Yun-Tao Zhang

2010 Second WRI Global Congress on Intelligent Systems > 3 > 119 - 122

2010 Second WRI Global Congress on Intelligent Systems (GCIS 2010)

The quantitative structure-activity relationships (QSAR) studies on toxicity of 91 organic compounds to Chlorella vulgaris have been performed by using ν-support vector machine(ν-SVM) algorithm and taking the 2D-autocorrelation descriptors as the structural parameters based on variable selection with particle swarm optimization(PSO) methed. The correlation coefficient(R²) and Q_cv² of PSO-ν-SVM model...

chapter

Computational prediction of MicroRNA regulatory pathways

Dong Yue, Yidong Chen, Shou-Jiang Gao, Yufei Huang

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW) > 386 - 391

2010 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW 2010)

MicroRNAs (miRNAs) are known to regulate transcription and/or protein translation of hundreds of genes. Despite their importance, the functions of most human miRNAs are still poorly understood. In this paper, we proposed a SVM based algorithm - PathMicrO that elucidates the miRNA function by predicting the miRNA regulated pathways. PathMicrO combines the sequence-level target predictions with the...

chapter

Prediction of low coverage prone regions for Illumina sequencing projects using a support vector machine

Zejun Zheng, B Schmidt, G Bourque

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 13 - 16

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2010)

Applications of next-generation sequencing technologies have the potential to bring revolutionary changes to medicine and biology. However, coverage bias can pose a challenge to short read data analysis tools, which rely on high coverage. To address this issue we have developed a support vector machine (SVM) based method for predicting low coverage prone (LCP) regions on a given genome. The developed...

chapter

Protein structural class prediction using support vector machine

Gazi Mohammad Shafiullah, Hawlader Abdullah Al-Mamun

International Conference on Electrical&Computer Engineering (ICECE 2010) > 179 - 182

2010 6th International Conference on Electrical & Computer Engineering (ICECE 2010)

Protein structural class prediction can play a vital role in protein 3-D structure prediction by reducing the search space of 3-D structure prediction algorithms. In this paper we used support vector machine to predict protein structural class solely based of its amino acid sequences, i.e. mainly α, mainly β, α- β and fss from CATH protein structure database; all-α, all-β, α/β, α+β from SCOP protein...

chapter

Linear vs. non-linear dimensionality reduction techniques in predicting class-II MHC peptide binding

F A Chakik, A M Shahin, W H Moudani, B El-Hassan, more

2010 5th Cairo International Biomedical Engineering Conference > 125 - 128

2010 5th Cairo International Biomedical Engineering Conference (CIBEC 2010)

A key step in the development of an adaptive immune response to vaccines is the binding of peptides to molecules of the Major Histocompatibility Complex (MHC) for presentation to T lymphocytes, which are thereby activated. Several algorithms have been proposed for such binding predictions, but are limited to a small number of MHC molecules and present imperfect prediction power. We are undertaking...

chapter

Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method

Nicholas J Davidson, Xueyi Wang

2010 Ninth International Conference on Machine Learning and Applications > 546 - 551

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

As a growing number of protein structures are resolved without known functions, using computational methods to help predict protein functions from the structures becomes more and more important. Some computational methods predict protein functions by aligning to homologous proteins with known functions, but they fail to work if such homology cannot be identified. In this paper we classify enzymes/non-enzymes...

chapter

Local linear multi-SVM method for gene function classification

Benhui Chen, Feiran Sun, Jinglu Hu

2010 Second World Congress on Nature and Biologically Inspired Computing (NaBIC) > 183 - 188

2010 Second World Congress on Nature and Biologically Inspired Computing (NaBIC 2010)

This paper proposes a local linear multi-SVM method based on composite kernel for solving classification tasks in gene function prediction. The proposed method realizes a nonlinear separating boundary by estimating a series of piecewise linear boundaries. Firstly, according to the distribution information of training data, a guided partitioning approach composed of separating boundary detection and...

chapter

Application of rectangular features for the localization of fertile material in plant images

Upeka Premaratne

2010 Fifth International Conference on Information and Automation for Sustainability > 20 - 25

2010 5th International Conference on Information and Automation for Sustainability (ICIAfS)

Analysis of fertile material such as flowers and fruit is a key factor in the proper identification of plant species. Despite object recognition being a mature research area, the use of it in automated plant identification is still relatively new. This paper describes a novel method of detecting fertile material in plant images using rectangular features. Rectangular features are obtained for the...

Keywords:
SUPPORT VECTOR MACHINES
BIOLOGY COMPUTING

Publication date

Set your own date range

Content availability

Available (318)
None (10)

Keywords

PROTEINS (139)
PATTERN CLASSIFICATION (125)
SUPPORT VECTOR MACHINE (120)
MOLECULAR BIOPHYSICS (98)
GENETICS (83)
ACCURACY (76)
LEARNING (ARTIFICIAL INTELLIGENCE) (71)
TRAINING (65)
FEATURE EXTRACTION (61)
SVM (59)
CLASSIFICATION ALGORITHMS (49)
KERNEL (43)
BIOINFORMATICS (42)
DATA MINING (42)
DNA (42)
CANCER (35)
CELLULAR BIOPHYSICS (35)
MACHINE LEARNING (35)
AMINO ACIDS (31)
GENE EXPRESSION (26)
NEURAL NETS (26)
ARTIFICIAL NEURAL NETWORKS (23)
FEATURE SELECTION (23)
CLASSIFICATION (22)
DISEASES (22)
GENOMICS (22)
MACROMOLECULES (22)
MOLECULAR CONFIGURATIONS (22)
REGRESSION ANALYSIS (22)
PROTEIN SEQUENCE (20)
PREDICTIVE MODELS (19)
PREDICTION ALGORITHMS (17)
MICROARRAY DATA (16)
SUPPORT VECTOR MACHINE CLASSIFICATION (16)
BIOCHEMISTRY (15)
BIOLOGICAL TECHNIQUES (15)
CORRELATION (15)
DATA ANALYSIS (15)
GENE SELECTION (15)
PATTERN CLUSTERING (15)
PRINCIPAL COMPONENT ANALYSIS (15)
MICROORGANISMS (14)
GENETIC ALGORITHMS (13)
IMAGE CLASSIFICATION (13)
BIOLOGICAL SYSTEM MODELING (12)
HUMANS (12)
MEDICAL COMPUTING (12)
SEQUENCES (12)
SVM CLASSIFIER (12)
DECISION TREES (11)
PROTEIN SEQUENCES (11)
STATISTICAL ANALYSIS (11)
DATABASES (10)
SENSITIVITY (10)
SUPPORT VECTOR REGRESSION (10)
BAYES METHODS (9)
CANCER CLASSIFICATION (9)
COLON (9)
PATTERN RECOGNITION (9)
SUPPORT VECTOR MACHINE CLASSIFIER (9)
TESTING (9)
COMPUTATIONAL BIOLOGY (8)
LEAST SQUARES APPROXIMATIONS (8)
NOISE (8)
ORGANIC COMPOUNDS (8)
PROTEIN SECONDARY STRUCTURE PREDICTION (8)
PROTEOMICS (8)
BIOLOGY (7)
BIOMEDICAL COMPUTING (7)
CLUSTERING (7)
DECISION TREE (7)
DRUGS (7)
ENCODING (7)
FUZZY SET THEORY (7)
HISTOGRAMS (7)
MATRIX ALGEBRA (7)
MICROARRAY DATA ANALYSIS (7)
NEURAL NETWORKS (7)
PROTEIN-PROTEIN INTERACTION (7)
SPECTROSCOPY (7)
TUMOURS (7)
COMPOUNDS (6)
DATA MODELS (6)
DRUG DISCOVERY (6)
GENE EXPRESSION DATA (6)
GENETIC ALGORITHM (6)
IMAGE SEGMENTATION (6)
MATHEMATICAL MODEL (6)
MICROARRAY TECHNOLOGY (6)
PEPTIDES (6)
PROBABILITY (6)
PROTEIN (6)
PROTEIN ENGINEERING (6)
AMINO ACID (5)
AMINO ACID COMPOSITION (5)
BIOLOGICAL PROCESS (5)
BIOLOGICAL TISSUES (5)
CALIBRATION (5)
more

INFONA - science communication portal

Search results

On classification of biological data using outlier detection

A new gene subset selection approach based on linearly separating gene pairs

Invited: Multiclass RNA function classification using next-generation sequencing

Keynote: High-resolution sequence and chromatin signatures predict transcription factor binding in the human genome

Evaluation of missing values imputation methods in cDNA microarrays based on classification accuracy

Poster: Linear B-cell epitope prediction based on Support Vector Machine and propensity scales

Fuzzy Discretization for Rough Set Based Gene Selection Algorithm

Stacked spatial-pyramid kernel: An object-class recognition method to combine scores from random trees

A self-training semi-supervised support vector machine method for recognizing transcription start sites

Using Randomised Vectors in Transcription Factor Binding Site Predictions

Plant Species Classification Using a 3D LIDAR Sensor and Machine Learning

Peptide Sequence Tag-Based Blind Identification-based SVM Model

QSAR Studies on Toxicity of Organic Compounds to Chlorella vulgaris

Computational prediction of MicroRNA regulatory pathways

Prediction of low coverage prone regions for Illumina sequencing projects using a support vector machine

Protein structural class prediction using support vector machine

Linear vs. non-linear dimensionality reduction techniques in predicting class-II MHC peptide binding

Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method

Local linear multi-SVM method for gene function classification

Application of rectangular features for the localization of fertile material in plant images

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options