首页> 外文会议>2010 DoD High Performance Computing Modernization Program Users Group Conference >Large-Scale Orthology Predictions for Inferring Gene Functions across Multiple Species
【24h】

Large-Scale Orthology Predictions for Inferring Gene Functions across Multiple Species

机译:跨物种的基因功能推断的大规模正交预测。

获取原文

摘要

An effective approach to infer the functions of genes is to use the concept of gene orthology. Because orthologous genes are likely to share similar functions, the functions of genes in an unstudied species can be inferred through the functions of their orthologs in a studied model species. To infer gene functions for a multitude of species, we developed a high-throughput orthology prediction method, termed PhyloTrace. PhyloTrace is both highly accurate and computationally efficient for large-scale applications, having the ability to infer orthologous genes across thousands of species. This is accomplished through three major steps: 1) all-against-all gene comparisons for every pair of genes, 2) pair-wise orthology predictions for every two genomes, and 3) the generation of orthologous clusters that contain orthologous genes across multiple genomes. We employed the previously developed Pipe man parallelization program to break down a set of millions of input sequences into small chunks and then processed them in parallel. We successfully predicted orthologs for over 900 bacterial genomes, achieving a false-positive prediction rate of 2.0%, which was a significant improvement compared with the widely used bidirectional best-hit method, which yielded a false-positive rate of 5.5%.
机译:推断基因功能的有效方法是使用基因正交学的概念。由于直系同源基因可能具有相似的功能,因此可以通过研究模型物种中直系同源基因的功能来推断未研究物种中基因的功能。为了推断多种物种的基因功能,我们开发了一种称为PhyloTrace的高通量正交预测方法。 PhyloTrace在大规模应用中具有很高的准确性和计算效率,能够推断成千上万个物种的直系同源基因。这可以通过三个主要步骤来完成:1)对每对基因进行全基因组比对; 2)对每两个基因组进行成对直系同源预测; 3)生成包含跨多个基因组的直系同源基因的直系同源簇。我们使用先前开发的Pipe man并行化程序将数百万个输入序列的集合分解为小块,然后对其进行并行处理。我们成功地预测了900多个细菌基因组的直系同源物,达到了2.0%的假阳性预测率,与广泛使用的双向最佳匹配方法(产生5.5%的假阳性率)相比,这是一个显着的进步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号