. .

chapter

OntoBio: A Biodiversity Domain Ontology for Amazonian Biological Collected Objects

Andrea C.F. Albuquerque, Jose L. Campos dos Santos, Alberto N. de Castro

2015 48th Hawaii International Conference on System Sciences > 3770 - 3779

2015 48th Hawaii International Conference on System Sciences (HICSS)

The use of ontology presents a novel data integration resource, when centred in semantic definitions and the need for interoperability. Results from previews works indicate that ontologies can drive knowledge acquisition processes for the purpose of comprehensive, transportable machine understanding and knowledge management. Applied to the biodiversity domain, ontologies can be a valuable resource...

article

A Maximum A Posteriori Probability and Time-Varying Approach for Inferring Gene Regulatory Networks from Time Course Gene Microarray Data

Shing-Chow Chan, Li Zhang, Ho-Chun Wu, Kai-Man Tsui

IEEE/ACM Transactions on Computational Biology and Bioinformatics > 2015 > 12 > 1 > 123 - 135

Unlike most conventional techniques with static model assumption, this paper aims to estimate the time-varying model parameters and identify significant genes involved at different timepoints from time course gene microarray data. We first formulate the parameter identification problem as a new maximum a posteriori probability estimation problem so that prior information can beincorporated as regularization...

chapter

Optimal Bayesian feature selection on high dimensional gene expression data

Ali Foroughi Pour, Lori A. Dalton

2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 1402 - 1405

2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Recent work proposes a Bayesian hierarchical model for feature selection in which priors are placed over the identity of each feature, as well as over the underlying feature-label distribution. Given data, Bayesian inference can be used to find a maximum posterior probability feature set. In this work, we examine the application of this theory to microarray data for biomarker discovery. A major challenge...

chapter

Random Forest and Gene Ontology for functional analysis of microarray data

Tham Wen Shi, Kohbalan Moorthy, Mohd Saberi Mohamad, Safaai Deris, more

2014 IEEE 7th International Workshop on Computational Intelligence and Applications (IWCIA) > 29 - 34

2014 IEEE 7th International Workshop on Computational Intelligence and Applications (IWCIA)

With the development of DNA microarray technology, scientists can now measure gene expression levels. However, such high-throughput microarray technologies produce a long list of genes with small sample size and high noisy genes. The data need to be further analysed and interpreting information on biological process requires a lot of practice and usually is a time consuming process. Most of the traditional...

chapter

Budgeted transcript discovery: A framework for joint exploration and validation studies

Sheehan Khan, Russell Greiner

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 188 - 191

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

This paper presents the budgeted transcript discovery problem (BTD): deciding how to spend a given research budget collecting data, using a combination of microarrays and PCRs, to discover which transcripts are differentially expressed with respect to a given phenotype. We present algorithms that address this task by sequentially analyzing the data collected so far, to decide which data would be most...

chapter

Coreference resolution in biomedical texts

Lishuang Li, Liuke Jin, Zhenchao Jiang, Jing Zhang, more

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 12 - 14

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Coreference resolution recently plays a more and more important role for many natural language processing tasks. In this paper, we propose two methods for the biomedical coreference resolution. One is the single machine learning method (SVM ranker-learning algorithm) which selects appropriate features for the pronoun and noun phrase coreference resolution respectively. The other one is the hybrid...

chapter

Storing provenance data of genome project workflows using graph database

Rodrigo Pinheiro, Bruno Aires, Aleteia F. Araujo, Maristela Holanda, more

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 16 - 22

2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Many scientific experiments are designed as computational workflows in bioinformatics. However, the amount of data generated increases at every phase of each execution, hindering the identification of the source and the transformation of data. Therefore, it has become necessary to create new tools to store data provenance, mainly which resources and parameters were used to generate the results, among...

chapter

Select-Bagging: Effectively Combining Gene Selection and Bagging for Balanced Bioinformatics Data

David J. Dittman, Taghi M. Khoshgoftaar, Amri Napolitano, Alireza Fazelpour

2014 IEEE International Conference on Bioinformatics and Bioengineering > 413 - 419

2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE)

Bioinformatics datasets have historically been difficult to work with. However, within machine learning, there is a potentially effective tool to combat such problems: ensemble learning. Ensemble learning generates a series of models and combines their results to make a single decision. This process has the benefit of utilizing the power of multiple models but the overhead of having to compute the...

chapter

Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets

Taghi M. Khoshgoftaar, Alireza Fazelpour, David J. Dittman, Amri Napolitano

2014 IEEE International Conference on Bioinformatics and Bioengineering > 420 - 426

2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE)

In the domain of bioinformatics, two common problems encountered when analyzing real-world datasets are class imbalance and high dimensionality. Boosting is a technique that can be used to improve classification performance, even in the presence of class imbalance. In addition, data sampling and feature selection are two important preprocessing techniques used to counter the adverse effects of both...

chapter

An MDL analysis framework for eQTL data

Georgios Chalkidis, Sumio Sugano

Asia-Pacific World Congress on Computer Science and Engineering > 1 - 7

2014 Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE)

Rapid development of genome sequencing technologies enables novel insights into the mechanisms of complex disease through Big Data analysis. Physicians can nowadays assay a patient's gene variants and gene expression patterns in a timely manner and use the obtained data to study an individual's susceptibility to complex disease and unravel the underlying mechanisms of disease pathogenesis. Massive...

chapter

NMF-Based LncRNA-Disease Association Inference and Bi-Clustering

Ashis Kumer Biswas, Jean X. Gao, Baoju Zhang, Xiaoyong Wu

2014 IEEE International Conference on Bioinformatics and Bioengineering > 97 - 104

2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE)

Long non-coding RNAs (lncRNAs) have been implicated in various biological processes, and are linked in many dysregulations. Researchers have reported large number of lncRNA associated human diseases over the past decade. In this article we employed the Non-negative Matrix Factorization method to develop a low-dimensional computational model that can describe the existing knowledge about lncRNA-disease...

chapter

Federated clouds for biomedical research: Integrating OpenStack for ICTBioMed

Cezary Mazurek, Juliusz Pukacki, Michal Kosiedowski, Szymon Trocha, more

2014 IEEE 3rd International Conference on Cloud Networking (CloudNet) > 294 - 299

2014 IEEE 3rd International Conference on Cloud Networking (CloudNet)

Increasingly complex biomedical data from diverse sources demands large storage, efficient software and high performance computing for the data's computationally intensive analysis. Cloud technology provides flexible storage and data processing capacity to aggregate and analyze complex data; facilitating knowledge sharing and integration from different disciplines in a collaborative research environment...

chapter

Modeling and analysis of gene regulatory networks with a Bayesian-driven approach

Shuqiang Wang, Jinxing Hu, Yanyan Shen, Ling Yin, more

2014 14th International Symposium on Communications and Information Technologies (ISCIT) > 289 - 293

2014 14th International Symposium on Communications and Information Technologies (ISCIT)

Modeling of gene regulatory networks play an important role in the post genomic era. In this work, we propose a Bayesian inference based model to quantitatively analyze the transcriptional regulatory network when the structure of regulatory network is given. In the proposed model, the dynamics of transcription factors are treated as a Markov process. Besides, the sequence features of genes are employed...

chapter

Robust biomarker discovery for cancer diagnosis based on meta-ensemble feature selection

Anouar Boucheham, Mohamed Batouche

2014 Science and Information Conference > 452 - 560

2014 Science and Information Conference (SAI)

Identification of biomarkers from high dimensional data is one of the most important emerging topics in genomics and personalized medicine. Gene selection aims to find a parsimonious subset of features that has the most discriminative information for a specific disease. The variations in real clinical tests have a great impact on the diagnosis efficiency. This influence makes producing stable or robust...

chapter

A new approach for mining deep order-preserving submatrices

Zhengling Liao, Jie Luo, Meihang Li, Yun Xue, more

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) > 341 - 345

2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In this paper, we proposed an exact method to discover all order-preserving submatrices (OPSMs) based on frequent sequential pattern mining. Firstly, an existing algorithm calACS is adjusted to disclose all common subsequences between every two row sequences, therefore all the deep OPSMs corresponding to long patterns with few supporting sequences will not be missed. Then an improved data structure...

chapter

A Model-based approach to transcription regulatory network reconstruction from time-course gene expression data

Hong Hu, Yang Dai

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 4767 - 4770

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Time-course gene expression profiling provides valuable data on dynamic behavior of cellular responses to external stimulation. Investigation of transcription factors (TFs) that regulate co-expressed genes in a dynamic process can reveal insights on the underlying molecular mechanisms. As the ChIP-seq technology is only suitable for a fraction of TFs in mammalian organisms, the computational identification...

chapter

Structural vs. functional mechanisms of duplicate gene loss following whole genome doubling

David Sankoff, Baoyong Wang, Chunfang Zheng, Carlos Fernando Buen Abad Najar

2014 IEEE 4th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) > 1 - 2

2014 IEEE 4th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)

The process of whole genome doubling (WGD) gives rise to two copies of each chromosome in a genome, containing the same genes in the same order. Through an attrition mechanism known as fractionation, one of each pair of duplicate genes is lost over evolutionary time, resulting in an interleaving patterns of deletions from duplicated regions [1]. This differentiates the WGD/fractionation model from...

chapter

AITION: A scalable KDD platform for Big Data Healthcare

Omiros Metaxas, Harry Dimitropoulos, Yannis Ioannidis, Md Paedigree

IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) > 601 - 604

2014 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)

We propose a comprehensive information processing, knowledge discovery and simulation platform for Big Data Healthcare. In addition, we present a related, well-defined workflow that promotes model-guided personalized medicine. We start by identifying disease signatures and homogeneous patient groups, whilst modeling case-based patient similarity. Then we analyze correlations between variables and...

chapter

A Storage Policy for a Hybrid Federated Cloud platform: A Case Study for Bioinformatics

Deric Lima, Breno Moura, Gabriel Oliveira, Edward Ribeiro, more

2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing > 738 - 747

2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)

Bioinformatics tools require large-scale processing mainly due to very large databases achieving gigabytes of size. In federated cloud environments, although services and resources may be shared, storage is particularly difficult, due to distinct computational capabilities and data management policies of several separated clouds. In this work, we propose a storage policy for BioNimbuZ, a hybrid federated...

chapter

HiCOMB Keynote and Invited Talks

Stephen Larson, Umit V. Catalyurek, Ananth Kalyanaraman

2014 IEEE International Parallel & Distributed Processing Symposium Workshops > 499

2014 IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW)

INFONA - science communication portal

Search results for: . .

OntoBio: A Biodiversity Domain Ontology for Amazonian Biological Collected Objects

A Maximum A Posteriori Probability and Time-Varying Approach for Inferring Gene Regulatory Networks from Time Course Gene Microarray Data

Optimal Bayesian feature selection on high dimensional gene expression data

Random Forest and Gene Ontology for functional analysis of microarray data

Budgeted transcript discovery: A framework for joint exploration and validation studies

Coreference resolution in biomedical texts

Storing provenance data of genome project workflows using graph database

Select-Bagging: Effectively Combining Gene Selection and Bagging for Balanced Bioinformatics Data

Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets

An MDL analysis framework for eQTL data

NMF-Based LncRNA-Disease Association Inference and Bi-Clustering

Federated clouds for biomedical research: Integrating OpenStack for ICTBioMed

Modeling and analysis of gene regulatory networks with a Bayesian-driven approach

Robust biomarker discovery for cancer diagnosis based on meta-ensemble feature selection

A new approach for mining deep order-preserving submatrices

A Model-based approach to transcription regulatory network reconstruction from time-course gene expression data

Structural vs. functional mechanisms of duplicate gene loss following whole genome doubling

AITION: A scalable KDD platform for Big Data Healthcare

A Storage Policy for a Hybrid Federated Cloud platform: A Case Study for Bioinformatics

HiCOMB Keynote and Invited Talks

Filter options

Publication date

Content availability

Publication type

Keywords

Journal

INFONA - science communication portal

Search results for: . .

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options