...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >THE NET-HMM APPROACH: PHYLOGENETIC NETWORK INFERENCE BY COMBINING MAXIMUM LIKELIHOOD AND HIDDEN MARKOV MODELS
【24h】

THE NET-HMM APPROACH: PHYLOGENETIC NETWORK INFERENCE BY COMBINING MAXIMUM LIKELIHOOD AND HIDDEN MARKOV MODELS

机译:NET-HMM方法:通过结合最大似然模型和隐马尔可夫模型来进行系统发育网络推断

获取原文
获取原文并翻译 | 示例
           

摘要

Horizontal gene transfer (HGT) is the event of transferring genetic material from one lineage in the evolutionary tree to a different lineage. HGT plays a major role in bac_terial genome diversification and is a significant mechanism by which bacteria develop resistance to antibiotics. Although the prevailing assumption is of complete HGT, cases of partial HGT (which are also named chimeric HGT) where only part of a gene is horizontally transferred, have also been reported, albeit less frequently. In this work we suggest a new probabilistic model, the NET-HMM, for analyzing and modeling phy_logenetic networks. This new model captures the biologically realistic assumption that neighboring sites of DNA or amino acid sequences are not independent, which increases the accuracy of the inference. The model describes the phylogenetic network as a Hid_den Markov Model (HMM), where each hidden state is related to one of the network's trees. One of the advantages of the NET-HMM is its ability to infer partial HGT as well as complete HGT. We describe the properties of the NET-HMM, devise efficient algorithms for solving a set of problems related to it, and implement them in software. We also provide a novel complementary significance test for evaluating the fitness of a model (NET-HMM) to a given dataset. Using NET-HMM, we are able to answer interesting biological questions, such as inferring the length of partial HGT's and the affected nucleotides in the genomic sequences, as well as inferring the exact location of HGT events along the tree branches. These advantages are demonstrated through the analysis of synthetical inputs and three different biological inputs.
机译:水平基因转移(HGT)是将遗传物质从进化树中的一个谱系转移到另一个谱系的事件。 HGT在细菌基因组多样化中起着重要作用,并且是细菌对抗生素产生抗性的重要机制。尽管主要假设是完整的HGT,但也有报道报道了部分HGT(也称为嵌合HGT)的案例,其中仅部分基因被水平转移,尽管这种情况发生的频率较低。在这项工作中,我们建议使用一种新的概率模型NET-HMM来对phy_logenetic网络进行分析和建模。该新模型捕获了生物学上现实的假设,即DNA或氨基酸序列的相邻位点不是独立的,从而提高了推断的准确性。该模型将系统发育网络描述为Hid_den Markov模型(HMM),其中每个隐藏状态都与网络中的一棵树相关。 NET-HMM的优点之一是其推断部分HGT以及完整HGT的能力。我们描述NET-HMM的属性,设计有效的算法来解决与之相关的一系列问题,并在软件中实现它们。我们还提供了一种新颖的互补重要性检验,用于评估模型(NET-HMM)对给定数据集的适用性。使用NET-HMM,我们能够回答有趣的生物学问题,例如推断部分HGT的长度和基因组序列中受影响的核苷酸,以及推断HGT事件沿树枝的确切位置。通过分析合成投入物和三种不同的生物投入物,可以证明这些优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号