首页> 外文学位 >Logical foundations of W-Curves used for phylogenetic analysis: An investigation of W-Metrics.
【24h】

Logical foundations of W-Curves used for phylogenetic analysis: An investigation of W-Metrics.

机译:用于系统发育分析的W曲线的逻辑基础:W度量的调查。

获取原文
获取原文并翻译 | 示例

摘要

We have defined a new metric (the W-Metric) on W-Curves for the use in the development of phylogenetic trees based on the argument that similarity between the W-Curves (that is, similarity between the underlying DNA strands) is the most logical basis for development of phylogenetic trees that reflect genetic kinship. The W-Curve was developed here at Illinois Institute of Technology as a tool for efficiently visualizing long genomic sequences.; We have tested our metric on published data about genome sequences for mammals where the phylogenetic trees are known and also on genetic data from the HIV virus. The trees resulting from the mammals data set strongly support the adequacy of this method to map out the classification of species. We have tested the efficiency and effectiveness of the W-Metric in building the trees for mammalian gene data, HIV-1 sequence data and the Bacitracin Synthetase sequence data. In the analysis, the results have been compared and contrasted with the ones obtained from other approaches. Subsequent general evaluation of procedures for doing phylogenetic analysis using W-curves lead to identify some areas in which improvements can be made.; The W-Curve method is computationally attractive since it provides a fast way of visually detecting special or repeating patterns, approximating the similarity. The W-Metric allows us to do in depth analysis by using W-Curves to support the construction of phylogenetic trees. However, the auto-regressive nature of the W-Curve does not lend itself to accurately quantify dissimilarity using extensions of existing means like percent difference or Euclidean metric approximation.; This research uncovered a problem with estimating dissimilarity between two W-Curves which is inherent to the concept of the auto-regressive function itself, namely the concern about overestimation of branch length. One way to investigate this problem more closely would be to attempt to produce the W-Curve of two or three medium or short sequences which differ just by one base. The dissimilarity has been defined by many phylogeny programs as the percent difference. The iterative function would not allow an almost zero dissimilarity between two W-Curves of which representative DNA sequences differ by only one base.; Gap Stripping resolution using a consensus algorithm, as we implemented in this research, is also of major importance since it provides us some insurance that biologically meaningful information is not discarded before it has served its purpose.
机译:基于W曲线之间的相似性(即基础DNA链之间的相似性)最大的论点,我们为W曲线定义了一个新的度量标准(W度量),用于系统发育树的开发。反映遗传亲缘关系的系统发育树的逻辑基础。 W曲线是伊利诺伊理工学院在这里开发的,可以有效地可视化长基因组序列。我们已经根据已知的系统发育树的哺乳动物基因组序列的公开数据以及HIV病毒的遗传数据测试了度量标准。来自哺乳动物数据集的树木强烈支持该方法足以确定物种的分类。我们已经测试了W-Metric在为哺乳动物基因数据,HIV-1序列数据和杆菌肽合成酶序列数据构建树木时的效率和有效性。在分析中,将结果与从其他方法获得的结果进行了比较和对比。随后对使用W曲线进行系统发育分析的程序进行一般评估,从而确定了可以改进的区域。 W曲线方法在计算上具有吸引力,因为它提供了一种视觉上快速检测特殊或重复图案,近似相似度的方法。 W-Metric允许我们通过使用W-Curves支持系统发育树的构建来进行深入分析。但是,W曲线的自回归特性无法利用现有手段(例如百分比差异或欧几里德度量逼近)的扩展来准确地量化差异。这项研究发现了估计两个W曲线之间差异的问题,这是自回归函数本身的概念所固有的,即担心分支长度过高。一种更仔细研究此问题的方法是,尝试产生两个或三个中等或短序列(仅相差一个碱基)的W曲线。许多系统发育程序已将差异定义为差异百分比。迭代功能不允许两个W曲线之间的差异几乎为零,而两个W曲线的代表性DNA序列仅相差一个碱基。正如我们在本研究中所实现的那样,使用共识算法进行的间隙剥离分辨率也非常重要,因为它为我们提供了一定的保证,即在有意义的信息没有达到其目的之前就不会将其丢弃。

著录项

  • 作者

    Kiomegne, Calvin Samo.;

  • 作者单位

    Illinois Institute of Technology.;

  • 授予单位 Illinois Institute of Technology.;
  • 学科 Computer Science.; Biology Genetics.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 132 p.
  • 总页数 132
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;遗传学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号