High-throughput proteome annotation refers to the activity of extracting information from all proteins in a particular organism using bioinformatics software on a high performance computing platform such as the grid. The Encyclopedia of Life (EOL) project aims to catalog all proteins in all species for public benefits using an Integrative Genome Annotation Pipeline (iGAP). The intrinsic complexity of the pipeline makes iGAP an ideal life sciences application to drive grid software development. It is a flagship application for the TeraGrid project. The deployment of iGAP on the grid using grid middleware and mediator software has been described previously. The heterogeneous and distributed computing environment on the grid requires an interactive user interface where jobs may be submitted and monitored. Here we describe our international collaborative effort in creating a grid portal solution for grid computing in life sciences under the auspices of PRAGMA. Specifically, the development of GridMonitor for interactive monitor of iGAP workflow, and the use of a GridSpeed generated iGAP application portal are described. The portal solution was part of the EOL demonstration at Supercomputing 2003 (SC'03), where resources from 6 institutions on 5 continents are utilized to annotate more than 36,000 proteins from various species. It is a testimony to the necessity and expediency for international collaboration in an increasingly global grid computational environment to advance life sciences research.
展开▼