Usman Roshan

chapter

Sequence-Length Requirements for Phylogenetic Methods

Bernard M.E. Moret, Usman Roshan, Tandy Warnow

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 343-356

We study the sequence lengths required by neighbor-joining, greedy parsimony, and a phylogenetic reconstruction method (DCM _NJ+MP) based on disk-covering and the maximum parsimony criterion. We use extensive simulations based on random birth-death trees, with controlled deviations from ultrametricity, to collect data on the scaling of sequence-length requirements...

chapter

Estimating the Deviation from a Molecular Clock

Luay Nakhleh, Usman Roshan, Lisa Vawter, Tandy Warnow

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 287-299

We address the problem of estimating the degree to which the evolutionary history of a set of molecular sequences violates a strong molecular clock hypothesis. We quantify this deviation formally, by defining the “stretch” of a model tree with respect to the underlying ultrametric tree (indicated by time). We then define the “minimum stretch” of a dataset for a tree and show how this can be computed...

chapter

The Performance of Phylogenetic Methods on Trees of Bounded Diameter

Luay Nakhleh, Usman Roshan, Katherine St. John, Jerry Sun, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 214-226

We study the convergence rates of neighbor-joining and several new phylogenetic reconstruction methods on families of trees of bounded diameter. Our study presents theoretically obtained convergence rates, as well as an empirical study based upon simulation of evolution on random birth-death trees. We find that the new phylogenetic methods offer an advantage over the neighbor-joining method, except...

chapter

Performance of Supertree Methods on Various Data Set Decompositions

Usman Roshan, Bernard M. E. Moret, Tiffani L. Williams, Tandy Warnow

Computational Biology > Phylogenetic Supertrees > Methodological considerations > 301-328

Many large-scale phylogenetic reconstruction methods attempt to solve hard optimization problems such as Maximum Parsimony (MP) and Maximum Likelihood (ML), but they are severely limited by the number of taxa that they can handle in a reasonable timeframe. A standard heuristic approach to this problem is the divide-and-conquer strategy: decompose the data set into smaller subsets, solve the subsets...

chapter

Prediction of Continuous Phenotypes in Mouse, Fly, and Rice Genome Wide Association Studies with Support Vector Regression SNPs and Ridge Regression Classifier

Abdulrhman Aljouie, Usman Roshan

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA) > 1246 - 1250

2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)

The ranking of SNPs and prediction of phenotypes in continuous genome wide association studies is a subject of increasing interest with applications in personalized medicine and animal and plant breeding. The ranking of SNPs in case control (discrete label) genome wide association studies has been examined in several previous studies with machine learning techniques but this is poorly explored for...

chapter

Cross-validation and cross-study validation of chronic lymphocytic leukemia with exome sequences and machine learning

Nihir Patel, Bharati Jhadav, Abdulrhman Aljouie, Usman Roshan

2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1367 - 1374

2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

The era of genomics brings the potential of better DNA based risk prediction and treatment. While genome-wide association studies are extensively studied for risk prediction, the potential of using whole exome data for this purpose is unclear. We explore this problem for chronic lymphocytic leukemia that is one of the largest whole exome dataset of 186 case and 169 controls available from the NIH...

article

Chi8: a GPU program for detecting significant interacting SNPs with the Chi-square 8-df test

Abdulrhman Al-jouie, Mohammadreza Esfandiari, Srividya Ramakrishnan, Usman Roshan

BMC Research Notes > 2015 > 8 > 1 > 1-7

Background Determining interacting SNPs in genome-wide association studies is computationally expensive yet of considerable interest in genomics. Findings We present a program Chi8 that calculates the Chi-square 8 degree of freedom test between all pairs of SNPs in a brute force manner on a Graphics Processing Unit. We analyze each of the seven WTCCC genome-wide association studies that have about...

article

MaxSSmap: a GPU program for mapping divergent short reads to genomes with the maximum scoring subsequence

Turki Turki, Usman Roshan

BMC Genomics > 2014 > 15 > 1 > 1-14

Background Programs based on hash tables and Burrows-Wheeler are very fast for mapping short reads to genomes but have low accuracy in the presence of mismatches and gaps. Such reads can be aligned accurately with the Smith-Waterman algorithm but it can take hours and days to map millions of reads even for bacteria genomes. Results We introduce a GPU program called MaxSSmap with the aim of achieving...

INFONA - science communication portal

Search results for: Usman Roshan

Sequence-Length Requirements for Phylogenetic Methods

Estimating the Deviation from a Molecular Clock

The Performance of Phylogenetic Methods on Trees of Bounded Diameter

Performance of Supertree Methods on Various Data Set Decompositions

Prediction of Continuous Phenotypes in Mouse, Fly, and Rice Genome Wide Association Studies with Support Vector Regression SNPs and Ridge Regression Classifier

Cross-validation and cross-study validation of chronic lymphocytic leukemia with exome sequences and machine learning

Chi8: a GPU program for detecting significant interacting SNPs with the Chi-square 8-df test

MaxSSmap: a GPU program for mapping divergent short reads to genomes with the maximum scoring subsequence

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

Journal

INFONA - science communication portal

Search results for: Usman Roshan

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options