首页> 外文会议>Workshop on Genome Informatics >An infrastructure for comparative genomics to functionally characterize genes and proteins.
【24h】

An infrastructure for comparative genomics to functionally characterize genes and proteins.

机译:对比较基因组学的基础设施,以功能性表征基因和蛋白质。

获取原文

摘要

Current genome projects are resulting in a flood of sequence data. The interpretation of these sequences is lagging, and optimized data analysis strategies need to be developed. Much can be learned from comparing different genomes, as genomes of distant organisms may still encode proteins with high sequence similarity. The order of genes (co linearity) in genomes may also be conserved to some extend. We have employed both these observations to create a multi-functional, computational analysis system (genomeSCOUT) which allows for rapid identification and functional characterization of genes and proteins through genome comparison. With a number of independent algorithms, information about different levels of protein homology (concerning e.g. paralogs, orthologs and clusters of orthologous groups, COGs) and gene order is collected and stored in several value added databases. These databases are then used for interactive comparison of genomes and subsequent analysis. The application is based on the well established data integration system SRS. This ensures (1) fast handling of large genomic data sets, (2) straightforward access to a multitude of biological databases, (3) unique linking functions between these databases, (4) highly efficient collection of information on genes and proteins, and 5. fully integrated and user friendly graphical representations of search results. This application can be used for projects as diverse as the correct annotation of genomes, the optimization of (micro) organisms for industrial production, or the identification of drug targets.
机译:目前的基因组项目导致繁多的序列数据。对这些序列的解释是滞后的,并且需要开发优化的数据分析策略。可以从比较不同基因组来学习大量的,因为遥远生物的基因组仍然可以编码具有高序列相似性的蛋白质。基因组中的基因(Co线性度)的顺序也可能被保守到一些延伸。我们使用这些观察结果来创建多功能,计算分析系统(Genomescout),其通过基因组比较来允许基因和蛋白质的快速鉴定和功能性表征。利用许多独立算法​​,收集有关不同蛋白质同源水平的信息(关于例如,副总组,齿齿群,齿齿组,基因序列的副蛋白酶,直肠球菌和基因令中的群体。然后将这些数据库用于基因组的交互式比较和随后的分析。该应用程序基于已建立的数据集成系统SRS。这确保了(1)快速处理大型基因组数据集,(2)直接访问多种生物数据库,(3)这些数据库之间的唯一链接功能,(4)高效地对基因和蛋白质的信息集合,以及5 。完全集成和用户友好的搜索结果图形表示。本申请可用于项目,作为基因组的正确注释,工业生产的(微)生物体的优化,或鉴定药物靶标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号