2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

chapter

Soft-bag based motif discovery for ChIP-seq datasets

Hongbo Zhang, De-Shuang Huang

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 146 - 149

The rapid development of high-throughput sequencing technology provides unique opportunities for studies of transcription factor binding, while also bringing new computational challenges. Recently, a series of discriminative motif discovery (DMD) methods have been proposed and offer promising solutions for addressing these challenges. However, because of the huge computational cost, most of them have...

chapter

Statistical selection of biological models for genome-wide association analyses

Wenjian Bi, Guolian Kang, Stanley B. Pounds

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 150 - 157

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Genome-wide association studies have discovered many biologically important associations of genes with phenotypes. Typically, genome-wide association analyses formally test the association of each genetic feature (SNP, CNV, etc) with the phenotype of interest and summarize the results with multiplicity-adjusted p-values. However, very small p-values only provide evidence against the null hypothesis...

chapter

LiDiAimc: LincRNA-disease associations through inductive matrix completion

Ashis Biswas, Jean Gao

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 158 - 163

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

The dysregulations of long intergenic non-coding RNAs (lincRNAs) have shown to be linked with a wide variety of human diseases over the past few years. However, there are only a few lincRNA-disease association inference tools available with most of them relying on very specific type of prior knowledge about the lincRNAs and the diseases. They fall short in generalized association predictions when...

chapter

MiteFinder: A fast approach to identify miniature inverted-repeat transposable elements on a genome-wide scale

Jialu Hu, Yan Zheng, Xuequn Shang

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 164 - 168

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Miniature inverted-repeat transposable element (M ITE) is a type of class II non-autonomous transposable element playing a crucial role in the process of evolution in biology. Development of bioinformatics tools that are capable of effectively identifying MITEs can enable genome-wide studies of MITE patterns in eukaryotes. Here, we present a fast, accurate and memory-efficient tool, MiteFinder, for...

chapter

Comparative analysis of alignment tools for nanopore reads

Natasha Pavlovikj, Etsuko N. Moriyama, Jitender S. Deogun

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 169 - 174

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Alignment of sequence reads is an important step of many bioinformatics workflows. While the alignment of short reads is well investigated, the alignment of long reads produced by third-generation sequencing technologies, such as Oxford Nanopore, is more challenging because they have high error rates (10–40%). Furthermore, due to their different algorithmic approaches, different tools produce varied...

chapter

Choosing optimal controls for genotyping arrays

J. Sebastian Sigmon, Leonard McMillan

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 175 - 180

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Before genotyping microarrays can be used, calling algorithms must first be calibrated with a control set. Calling algorithms that evaluate hybridization intensity data on the basis of individual markers are better able to compensate for sequence specific variations. However, they require that the control set includes samples sufficient to exercise every marker in all of its allelic states. Minimizing...

chapter

Unraveling complex local genomic rearrangements from long-read data

Zachary D. Stephens, Ravishankar K. Iyer, Chen Wang, Jean-Pierre A. Kocher

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 181 - 187

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

In this paper, we present a graph search approach for identifying arbitrarily complex structural genomic variation. Our method leverages the ability of long reads (e.g. from Pacific Biosciences platforms) to span multiple breakpoints of complicated local rearrangements, allowing us to resolve small-scale complexities that may be overlooked by other tools. We applied our method to a subset of NA12878...

chapter

What can one chromosome tell us about human biogeographical ancestry?

Tanjin Taher Toma, Zachary Williams, Jeremy Dawson, Donald Adjeroh

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 188 - 193

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

We study the problem of predicting human biogeographical ancestry using genomic data. While continental level ancestry is relatively simple using genomic information, distinguishing between individuals from closely associated subpopulations (e.g., from the same continent) is still a difficult challenge. In particular, we focus on the case where the analysis is constrained to using single nucleotide...

chapter

Multiplex confounding factor correction for genomic association mapping with squared sparse linear mixed model

Haohan Wang, Xiang Liu, Yunpeng Xiao, Ming Xu, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 194 - 201

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Genome-wide Association Study has presented a promising way to understand the association between human genomes and complex traits. Many simple polymorphic loci have been shown to explain a significant fraction of phenotypic variability. However, challenges remain in the non-triviality of explaining complex traits associated with multifactorial genetic loci, especially considering the confounding...

chapter

Differential gene expression analysis in single-cell RNA sequencing data

Tianyu Wang, Sheida Nabavi

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 202 - 207

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote...

chapter

Integrating embeddings of multiple gene networks to prioritize complex disease-associated genes

Mengmeng Wu, Wanwen Zeng, Wenqiang Liu, Yijia Zhang, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 208 - 215

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Genome-wide association study (GWAS), as one primary approach for genetic studies, has been successfully applied to a variety of complex diseases, leading to the discovery of substantial disease-associated loci. These discovered associations provide unprecedented opportunities for deepening our understanding of complex diseases, such as disease-associated risk variants, genes, and pathways. However,...

chapter

MEC: Misassembly error correction in contigs using a combination of paired-end reads and GC-contents

Binbin Wu, Jianxin Wang, Junwei Luo, Min Li, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 216 - 221

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

The de novo assembly aims to reconstruct the genome of the unknown species. Many algorithms have been proposed for de novo assemblies. Due to problems of repetitive regions and sequencing errors, contigs usually contain a large amount of misassemblies. Consequently, the misassembly correction of contigs is a challenging and significant work, which receives considerable attentions from researchers...

chapter

A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry

Runmin Yang, Daming Zhu, Qiang Kou, Poornima Bhat-Nakshatri, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 222 - 229

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Database search is the main approach for identifying proteoforms using top-down tandem mass spectra. However, it is extremely slow to align a query spectrum against all protein sequences in a large database when the target proteoform that produced the spectrum contains post-translational modifications and/or mutations. As a result, efficient and sensitive protein sequence filtering algorithms are...

chapter

Noise cancellation for robust copy number variation detection using next generation sequencing data

Fatima Zare, Sardar Ansari, Kayvan Najarian, Sheida Nabavi

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 230 - 236

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

High-throughput next generation sequencing (NGS) technologies have created an opportunity for detecting copy number variations (CNVs) more accurately. However, efficient and precise detection of CNVs remains challenging due to high levels of noise and biases, data heterogeneity and the “big data” nature of NGS data. In this work, we introduce a novel preprocessing pipeline to improve the detection...

chapter

Inversion detection using PacBio long reads

Shenglong Zhu, Scott J. Emrich, Danny Z. Chen

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 237 - 242

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Structural variation is important in disease etiology and ecological adaptation. Prior work has focused on using either only short paired-end reads or a hybrid approach that combines long and short reads to detect structural variants. Few methods have focused solely on using long reads. Here, we aim to detect a specific type of structural variation, large inversions, using only raw PacBio long reads...

chapter

A copy-number variation detection pipeline for single cell sequencing data on BGI online

Jingying Huang, Yuwen Zhou, Aodan Xu, Enhong Zhuo, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 243 - 246

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

The revolutionary invention of single-cell sequencing technology carves out a new way to delineate intra tumor heterogeneity and traces the evolution of single cells at the molecular level. To cater for fast and convenient needs in calling copy-number variations in analyzing single-cell sequencing data, a systematical protocol and a working pipeline is reported. The proposed pipeline consists of six...

chapter

Probabilistic estimation of overlap graphs for large sequence datasets

Rahul Nihalani, Sriram P Chockalingam, Shaowei Zhu, Vijay Vazirani, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 247 - 252

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Sequence overlap graphs, constructed based on suffix-prefix relationships between pairs of sequences, are an important data structure in computational biology. High throughput sequencers can read several million to a few billion DNA fragments in a single experiment, making the construction of overlap graphs for such datasets compute-intensive. In this paper, we present a Locality-Sensitive Hashing...

chapter

Genetic variant analysis of boys with Autism: A pilot study on linking facial phenotype to genotype

Tayo Obafemi-Ajayi, Luke Settles, Yuqing Su, Cynthia Germeroth, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 253 - 257

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

This work examines the validity of facial phenotypes as Autism Spectrum Disorders (ASD) biomarkers in boys with essential autism. A family-based association analysis framework is presented that uses previously identified facially-delineated (FD) clusters to examine relationship between FD clusters and known ASD genes. The hypothesis is that there are certain genetic variants, single nucleotide polymorphisms...

chapter

Improved classification model for peptide identification based on self-paced learning

Yongxiang Wang, Xijun Liang, Zhonghang Xia, Xinnan Niu, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 258 - 261

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Post-database searching is a key procedure for peptide spectrum matches (PSMs) in protein identification with mass spectrometry-based strategies. Although many machine learning-based approaches have been developed to improve the accuracy of peptide identification, the challenge remains for improvement due to the poor quality of data samples. CRanker has shown its effectiveness and efficiency in terms...

chapter

Pre-SCNAClonal: Efficient GC bias correction for SCNA based tumor subclonal populations inferring

Yanshuo Chu, Mingxiang Teng, Zhenxing Wang, Yongtian Wang, more

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 262 - 265

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. We found that, in a GC study, the GC contents and read count ratios on SCNA segments present a Log linear biased pattern. However, currently no subclonal inferring tools...

INFONA - science communication portal

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Soft-bag based motif discovery for ChIP-seq datasets

Statistical selection of biological models for genome-wide association analyses

LiDiAimc: LincRNA-disease associations through inductive matrix completion

MiteFinder: A fast approach to identify miniature inverted-repeat transposable elements on a genome-wide scale

Comparative analysis of alignment tools for nanopore reads

Choosing optimal controls for genotyping arrays

Unraveling complex local genomic rearrangements from long-read data

What can one chromosome tell us about human biogeographical ancestry?

Multiplex confounding factor correction for genomic association mapping with squared sparse linear mixed model

Differential gene expression analysis in single-cell RNA sequencing data

Integrating embeddings of multiple gene networks to prioritize complex disease-associated genes

MEC: Misassembly error correction in contigs using a combination of paired-end reads and GC-contents

A spectrum graph-based protein sequence filtering algorithm for proteoform identification by top-down mass spectrometry

Noise cancellation for robust copy number variation detection using next generation sequencing data

Inversion detection using PacBio long reads

A copy-number variation detection pipeline for single cell sequencing data on BGI online

Probabilistic estimation of overlap graphs for large sequence datasets

Genetic variant analysis of boys with Autism: A pilot study on linking facial phenotype to genotype

Improved classification model for peptide identification based on self-paced learning

Pre-SCNAClonal: Efficient GC bias correction for SCNA based tumor subclonal populations inferring

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)