Although the complete sequencing of the human genome has been the driving force behind the rapid development of systems biology, obtaining informative results relies upon the accurate representation of protein coding genes in genomic and protein databases. When these genes are inaccurately represented, the conclusions from these time intensive and often costly experiments can be compromised. Furthermore, many newly sequenced organisms with complex genomes are not amenable to proteomics experiments due to the lack of protein coding gene annotation, or are completely dependent upon computational predictions. New technologies capable of searching raw genomic sequence with mass spectrometric data will be invaluable for reannotating protein coding genes of well defined proteomes, as well as exploring these newly sequenced yet uncharted genomes.
展开▼