The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
More than 100 microbial genomes have been sequenced since 1995 and thousands of microbial genomes will be sequenced in a decade. It implies that millions of open reading frames (ORFs) will be predicted and should be evaluated. Therefore, we need a high throughput system to evaluate the predicted ORFs and understand functions of genes based on comparative genomics. We established and applied a protocol...
In this paper, an integrated system called OBIEnv, which has been developed on OBIGrid, is described. In addition to automatic database transfer and deployment, it provides various functionalities for transparent and fault-tolerant processing of bioinformatics tasks on Grid. A feasibility study on the analysis of horizontal gene transfer was done using 119 heterogeneous Linux nodes in 5 different...
Many useful applications of simulation in computational cell biology, e.g. kinetic parameter estimation, Metabolic Control Analysis (MCA), and bifurcation analysis, require a large number of repetitive runs with different input parameters. The heavy requirements imposed by these analysis methods on computational resources has led to an increased interest in parallel- and distributed computing technologies...
OBIGrid provides high-throughput GRIDBLAST services (OBIGbs) for researchers who need to deal with many BLAST query sequences at one time by exploiting both distributed processing and parallel processing. A new application-oriented grid framework has been introduced to split a BLAST query into independent sub-queries and to execute the sub-queries on remote personal computers and PC clusters connected...
The rapid progress of biotechnology provides an increasing number of life science databases. These databases have been operated and managed individually on the Internet. Under such a circumstance, it is needed to develop an infrastructure that allows to share information contained in these databases and to conduct research collaboration. Grid technology is an emerging technology for seamless and loose...
High-throughput proteome annotation refers to the activity of extracting information from all proteins in a particular organism using bioinformatics software on a high performance computing platform such as the grid. The Encyclopedia of Life (EOL) project [1] aims to catalog all proteins in all species for public benefits using an Integrative Genome Annotation Pipeline [2] (iGAP). The intrinsic complexity...
The goal of the Encyclopedia of Life (EOL) Project is to predict structural information for all proteins, in all organisms. This calculation presents challenges both in terms of the scale of the computational resources required (approximately 1.8 million CPU hours), as well as in data and workflow management. While tools are available that solve some subsets of these problems, it was necessary for...
We developed OBITco (Open BioInfomatics Thermus thermophilus Cyber Outlet) for gene annotation of T. thermophilus HB8 strain. To provide system services for numbers of researchers in the project, we adopted Web based technology and high-level user authentication system with three functions which are rollback function, hierarch representative function and easy-and-systematic annotation. The robust...
Proteins are classified into families based on structural or functional similarities. Artificial intelligence methods such as Hidden Markov Models, Neural Networks and Fuzzy Logic have been used individually in the field of bioinformatics for tasks such as protein classification and microarray data analysis. We integrate these three methods into a protein classification system for the purpose of drug...
The maximum likelihood method is considered as one of the most reliable methods for phylogenetic tree inference. However, as the number of species increases, the approach quickly loses its applicability due to explosive exponential number of trees that need to be considered. An earlier work by one of the authors [3] demonstrated that, by decomposing the trees into fragments called splits, and calculating...
The plethora of bioinformatics tools currently available to the biologist, and the diversity of problems that these applications were designed to solve, has necessitated a look at providing a single environment which can serve as an interface to these many applications. At the same time, this environment should also be able to function as an infrastructure resource with adequate computational capacity...
The BRIDGES project is incrementally developing and exploring database integration over six geographically distributed research sites with the framework of a Wellcome Trust biomedical research project (the Cardiovascular Functional Genomics project) to provide a sophisticated infrastructure for bioinformaticians. Grid technologies are being used to facilitate this integration. Key issues to be investigated...
In the BiodiversityWorld project we are building a GRID to support scientific biodiversity-related research. The requirements associated with such a GRID are somewhat different from other GRIDs, and this has influenced the architecture that we have developed. In this paper we outline these requirements, most notably the need to interoperate over a diverse set of legacy databases and applications in...
In this study, a new Genetic Algorithm (GA) using the Tabu · Local Search mechanism is proposed. The GA described in this paper is considered a Mega Process GA, which has an effective mechanism to use massive processors, i.e., Mega Processors, in large-scale computing systems. Our proposed method has a GA-specific database that possesses information of searched space and performs a local search for...
This paper presents a genetic algorithm running on a grid computing environment for inference of genetic networks. In bioinformatics, inference of genetic networks is one of the most important problems, in which mutual interactions among genes are estimated by using gene-expression time-course data. Network-Structure-Search Evolutionary Algorithm (NSS-EA) is a promising inference method of genetic...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.