Search results

Items from 1 to 20 out of 22 results

chapter

NMF-Based LncRNA-Disease Association Inference and Bi-Clustering

Ashis Kumer Biswas, Jean X. Gao, Baoju Zhang, Xiaoyong Wu

2014 IEEE International Conference on Bioinformatics and Bioengineering > 97 - 104

2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE)

Long non-coding RNAs (lncRNAs) have been implicated in various biological processes, and are linked in many dysregulations. Researchers have reported large number of lncRNA associated human diseases over the past decade. In this article we employed the Non-negative Matrix Factorization method to develop a low-dimensional computational model that can describe the existing knowledge about lncRNA-disease...

chapter

An efficient computational intelligence technique for classification of protein sequences

Muhammad Javed Iqbal, Ibrahima Faye, Abas Md Said, Brahim Belhaouari Samir

2014 International Conference on Computer and Information Sciences (ICCOINS) > 1 - 6

2014 International Conference on Computer and Information Sciences (ICCOINS)

Many artificial intelligence techniques have been developed to process the constantly increasing volume of data to extract meaningful information from it. The accurate annotation of the unknown protein using the classification of the protein sequence into an existing superfamily is considered a critical and challenging task in bioinformatics and computational biology. This classification would be...

chapter

A Comparison Study of Virus Classification by Genome Sequences

Jing-Doo Wang

2011 IEEE 11th International Conference on Bioinformatics and Bioengineering > 270 - 273

2011 IEEE 11th International Conference on Bioinformatics & Bioengineering (BIBE)

In this study, instead of traditional approaches to virus classification, we proposed a novel approach in the vector space model for virus classification via two types of genome sequences, DNA and CDS. For DNA sequence, in this study, the k-mer approach was adopted for pattern extraction and the entropy of the pattern frequency distribution among classes was for pattern weighting. For CDS sequence,...

chapter

Evaluation of DNA mapping schemes for exon detection

S D Sharma, K Shakya, S N Sharma

2011 International Conference on Computer, Communication and Electrical Technology (ICCCET) > 71 - 74

2011 International Conference on Computer, Communication and Electrical Technology (ICCCET)

Identification of protein coding regions (exons) in eukaryotic genomic sequences is an active area of research at present. Mapping of symbolic genomic sequences to numeric sequences is the first step required for processing them using digital signal processing (DSP) tools. For DFT-based methods paired numeric and frequency of nucleotide are reported as the best mapping schemes. In this work performance...

chapter

Sparse nonnegative matrix factorization with the elastic net

Weixiang Liu, Songfeng Zheng, Sen Jia, Linlin Shen, more

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 265 - 268

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2010)

Nonnegative matrix factorization is used extensively for feature extraction and clustering analysis. Recently many sparsity/sparseness constraints, such as L₁ penalty, are introduced for sparse nonnegative matrix factorization. Inspired by sparsity measures from linear regression model, this paper proposes to integrate nonnegative matrix factorization with another sparsity constraint, the elastic...

chapter

Performance analysis of different DNA to numerical mapping techniques for identification of protein coding regions using tapered window based short-time discrete Fourier transform

M K Hota, V K Srivastava

2010 International Conference on Power, Control and Embedded Systems > 1 - 4

2010 International Conference on Power, Control and Embedded Systems (ICPCES 2010)

Prior to applying the digital signal processing techniques for identification of protein coding regions, mapping of DNA alphabet into numerical sequences is necessary. In this paper, the performance of existing DNA to numerical mapping techniques is analyzed at the nucleotide level for the identification of protein coding regions using tapered window based short-time discrete Fourier transform (ST-DFT)...

chapter

Analysis of threshold influence on the accuracy of gene-prediction methods based on power spectrum analysis

Shanglei Xu, Nini Rao, Xi Chen, Guangxiong Liu, more

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 1 - 4

2010 10th International Conference on Signal Processing (ICSP 2010)

The accuracy of methods based on power spectrum analysis depends on the threshold that is used to discriminate the coding and non-coding sequences. Due to gene structural differences of different organisms, we inferred that there is an optimal gene prediction threshold for each organism. To prove this, we analyzed real biological data, and found that there are indeed different optimal thresholds for...

chapter

Prediction of O-Glycosylation Sites in Protein Sequence by Kernel Principal Component Analysis

Xue-mei Yang, Xue-wei Cui, Xue-zhu Yang

2010 International Conference on Computational Aspects of Social Networks > 267 - 270

2010 International Conference on Computational Aspects of Social Networks (CASoN 2010)

O-glycosylation is one of the main types of the mammalian protein glycosylation, it occurs on the particular site of serine and threonine. It's important to predict the O-glycosylation site. In this paper, we propose a new method of kernel principal component analysis (KPCA) to predict the O-glycosylation site with window size w=9. The samples for experiment are encoded by the sparse coding and projected...

chapter

3D protein model assessment using geometric and biological features

Anjum Reyaz-Ahmed, Robert Harrison, Yan-Qing Zhang

The 2nd International Conference on Software Engineering and Data Mining > 351 - 354

2nd International Conference on Software Engineering and Data Mining (SEDM 2010)

Automatic prediction of protein three-dimensional structures from its amino acid sequence has become one of the most important researched fields in bioinformatics. With that increases the importance of determining the quality of these protein models. Protein three-dimensional structure evaluation is a complex problem in computational structure biology. We attempt to solve this problem using SVM and...

chapter

Assessment of Gene Annotation Accuracy by Inferring Transcripts from RNA-Seq

J. Martin, Wenhan Zhu, N. Bergman, M. Borodovsky

2009 IEEE International Conference on Bioinformatics and Biomedicine > 54 - 59

2009 IEEE International Conference on Bioinformatics and Biomedicine. BIBM 2009

Next generation sequencing is quickly changing long standing paradigms of genomics in terms of what is feasible to accomplish within a ldquoresearch life timerdquo and what is supposed to remain beyond limits of reliable experimental analysis. Sequencing and mapping of a prokaryote transcriptome can provide experimental validation for computationally predicted genes annotated in a prokaryotic genome...

chapter

Optimization of Multi-classifiers for Computational Biology: Application to the Gene Finding Problem

R. Romero-Zaliz, C. del Val, I. Zwir

2009 Ninth International Conference on Intelligent Systems Design and Applications > 1233 - 1238

2009 Ninth International Conference on Intelligent Systems Design and Applications (ISDA 2009)

Genomes of many organisms have been sequenced over the last few years. However, transforming such raw sequence data into knowledge remains a hard task. A great number of prediction programs have been developed to address part of this problem: the location of genes along a genome. We propose a multiobjective methodology to combine algorithms into an aggregation scheme in order to obtain optimal methods'...

chapter

Enhancing prediction understandability for transmembane segments by BoostingFOIL

Jieyue He, Pingping Chen, Dejing Zhao, Wei Zhong

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 1 > 739 - 743

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

In recent years, many studies have focused on improving the accuracy of prediction of trans-membrane segments, and many significant results have been achieved. In spite of these considerable results, the existing methods lack the ability to explain the process of how a learning result is reached and why a prediction decision is made. The explanation of the decision process is important for acceptance...

chapter

A committee of NNA classifiers for the prediction of the binding between miRNAs and the target genes using a novel coding method

Zhisong He, Kaiyan Feng, Yudong Cai

2009 IEEE International Conference on Systems, Man and Cybernetics > 4287 - 4292

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

We present a paper for the prediction of the bindings between microRNAs (miRNAs) and their target genes. A novel coding for the miRNAs, the binding sites (i.e. the target genes) and the flanking sequences of the binding sites is adopted to code the related information comprehensively. A feature selection method, Minimum Redundancy Maximum Relevance (mRMR), is used to filter out ineffective and redundant...

chapter

Splice site detection in DNA sequences using a fast classification algorithm

J. Cervantes, Xiaoou Li, Wen Yu

2009 IEEE International Conference on Systems, Man and Cybernetics > 2683 - 2688

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

Support vector machines (SVMs) are known to be excellent algorithms for classification problems. The principal disadvantage of SVMs is due to its excessive training time in large data set, such as DNA sequences. This paper presents a novel SVMs classification method which reduces significantly the input data set using Bayesian technique. Using this system, we are able to predict with a high accuracy...

chapter

Distinguishing Coding from Non-coding Sequences in a Prokaryote Complete Genome Based on the Global Descriptor

Guo-Sheng Han, Zu-Guo Yu, V. Anh, R.H. Chan

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery > 5 > 42 - 46

2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009)

Recognition of coding sequences in a complete genome is animportant problem in DNA sequence analysis. Their rapid and accurate recognition contributes to various relevant research and application. In this paper, we aim to distinguish the coding sequences from the non-coding sequences in a prokaryote complete genome. We select a data set of 51 available bacterial genomes. Then, we use the global descriptor...

chapter

StSUT2 Structure Prediction Based on Nucleic Acid Sequence Using GA-BP

Zhengwei Zhu, Yuying Guo

2009 Fifth International Conference on Natural Computation > 3 > 149 - 153

2009 Fifth International Conference on Natural Computation (ICNC 2009)

The protein secondary structure (PSS) prediction system presented in this paper is a subsystem of potato bioinformation research platform. The proposed method is a novel and practical PSS prediction method, which is based on nucleic acid sequence (NAS), uses an combined neural network (CNN) and takes an improved genetic algorithm (GA) to optimize the connection weights of CNN. The experimental results...

chapter

An Improved Fourier Method for DNA Sequence Classification

Baoshan Ma, Yisheng Zhu, Yuzhen Chen

2009 3rd International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2009 3rd International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2009)

The theory and methods of signal processing are becoming increasingly important in bioinformatics and systems biology. The ordinary Fourier analysis is satisfactory for the long DNA sequences to detect period-3 property, but is without impressive success for the short DNA sequences. An improved Fourier method is proposed to increase the accuracy of gene identification by amplifying period-3 behavior...

chapter

Improved Prediction Method of Protein Contact Based on RBF Neural Network

Pengfei Sun, Jianpei Zhang

2009 3rd International Conference on Bioinformatics and Biomedical Engineering > 1 - 4

2009 3rd International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2009)

In this paper, a prediction method of protein contact on the basis of information granules and RBF neural network have been brought forward. This method improved the encoding approach of protein structure data and classifier performance to enhance the predicting accuracy of protein contact. 200 nonhomologous proteins from the PDB database were encoded according to the encoding approach and were taken...

chapter

An integrative algorithm for predicting protein coding regions

Shuo Guo, Yi-Sheng Zhu

APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems > 438 - 441

APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS)

Due to the enormous amount of data in DNA sequences to be processed, the computational complexity and speed are important issues to be considered. In this paper, a new integrative method is presented for predicting protein coding regions. We first establish a Takagi-Sugeno fuzzy model to identify the first nucleotide of a codon in coding regions, then the time-frequency characteristics of the output...

chapter

Predicting translation initiation sites using a multi-agent architecture empowered with reinforcement learning

Jia Zeng, R. Alhajj

2008 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology > 241 - 248

2008 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2008)

The accurate recognition of translation initiation sites (TISs) is an important stage in genome annotation. Due to the complicated nature of the genetic information and our incomplete understanding of it, TIS prediction remains a challenging undertaking. Many computational approaches have been proposed in the literature, some of which have yielded quite impressive performance. However, most of them...

Data set:
ieee
Keywords:
ENCODING
ACCURACY
BIOINFORMATICS

Publication date

Set your own date range

Keywords

PROTEINS (13)
DNA (8)
GENOMICS (7)
TRAINING (7)
ARTIFICIAL NEURAL NETWORKS (6)
GENETICS (6)
MOLECULAR BIOPHYSICS (6)
BIOLOGY COMPUTING (5)
DATA MINING (4)
FEATURE EXTRACTION (4)
PATTERN CLASSIFICATION (4)
SIGNAL PROCESSING (4)
SUPPORT VECTOR MACHINES (4)
AMINO ACIDS (3)
BIOLOGICAL SYSTEM MODELING (3)
DISCRETE FOURIER TRANSFORMS (3)
HIDDEN MARKOV MODELS (3)
SVM (3)
COMPUTATIONAL COMPLEXITY (2)
DATA ANALYSIS (2)
DIGITAL SIGNAL PROCESSING (2)
DNA SEQUENCES (2)
KERNEL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MACROMOLECULES (2)
PATTERN RECOGNITION (2)
PREDICTION ALGORITHMS (2)
PREDICTIVE MODELS (2)
PROTEIN CODING REGION (2)
PROTEIN SEQUENCE (2)
SUPPORT VECTOR MACHINE (2)
TESTING (2)
TRAINING DATA (2)
WAVELET TRANSFORMS (2)
Z-CURVE (2)
3D PROTEIN MODEL ASSESSMENT (1)
A VER AGE PERIODOGRAM (1)
ACCURATE RECOGNITION CONTRIBUTES (1)
ADAPTATION MODEL (1)
ADDITIVE NOISE (1)
ADDITIVES (1)
AGGREGATION SCHEME (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AMINO ACID SEQUENCE (1)
APPROXIMATION ALGORITHMS (1)
ART (1)
ARTIFICIAL INTELLIGENCE (1)
AVAILABLE BACTERIAL GENOMES (1)
BAND PASS FILTERS (1)
BAYES METHODS (1)
BAYESIAN CLASSIFICATION (1)
BAYESIAN METHODS (1)
BAYESIAN TECHNIQUE (1)
BI-CLUSTERING (1)
BIOLOGICAL DATA ANALYSIS (1)
BIOLOGICAL FEATURE (1)
BIOLOGICAL PROBLEM (1)
BIOMEMBRANES (1)
BOOKS (1)
BOOSTING (1)
BOOSTINGFOIL ALGORITHM (1)
BRAIN (1)
BRAIN MODELING (1)
BRAIN MODELS (1)
BUILDINGS (1)
CANCER (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFIER PERFORMANCE (1)
CLUSTERING ANALYSIS (1)
CODING PRIMARY SEQUENCES (1)
CODING REGION DISCOVERY TOOL (1)
CODING REGION PREDICTION (1)
CODING/NONCODING DNA (1)
CODONS (1)
COMBINED NEURAL NETWORK (1)
COMPARATIVE GENOMICS (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL BIOLOGY (1)
COMPUTATIONAL EFFICIENCY (1)
COMPUTATIONAL INTELLIGENCE (1)
COMPUTATIONAL MODELING (1)
COMPUTATIONAL OVERHEAD MULTIPROCESSOR ENVIRONMENT (1)
COMPUTATIONAL STRUCTURE BIOLOGY (1)
COMPUTER SCIENCE (1)
COMPUTER SECURITY (1)
CONFERENCES (1)
CONVOLUTION (1)
CRYPTOGRAPHY (1)
DATA DISTRIBUTION (1)
DATA ENCODING (1)
DATA ENCODING SCHEME (1)
DATA HANDLING (1)
DATABASES (1)
DECISION MAKING (1)
DECISION TREE (1)
DECISION TREES (1)
more

INFONA - science communication portal

Search results

NMF-Based LncRNA-Disease Association Inference and Bi-Clustering

An efficient computational intelligence technique for classification of protein sequences

A Comparison Study of Virus Classification by Genome Sequences

Evaluation of DNA mapping schemes for exon detection

Sparse nonnegative matrix factorization with the elastic net

Performance analysis of different DNA to numerical mapping techniques for identification of protein coding regions using tapered window based short-time discrete Fourier transform

Analysis of threshold influence on the accuracy of gene-prediction methods based on power spectrum analysis

Prediction of O-Glycosylation Sites in Protein Sequence by Kernel Principal Component Analysis

3D protein model assessment using geometric and biological features

Assessment of Gene Annotation Accuracy by Inferring Transcripts from RNA-Seq

Optimization of Multi-classifiers for Computational Biology: Application to the Gene Finding Problem

Enhancing prediction understandability for transmembane segments by BoostingFOIL

A committee of NNA classifiers for the prediction of the binding between miRNAs and the target genes using a novel coding method

Splice site detection in DNA sequences using a fast classification algorithm

Distinguishing Coding from Non-coding Sequences in a Prokaryote Complete Genome Based on the Global Descriptor

StSUT2 Structure Prediction Based on Nucleic Acid Sequence Using GA-BP

An Improved Fourier Method for DNA Sequence Classification

Improved Prediction Method of Protein Contact Based on RBF Neural Network

An integrative algorithm for predicting protein coding regions

Predicting translation initiation sites using a multi-agent architecture empowered with reinforcement learning

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options