首页> 外文学位 >Evolutionary ancestor inference via genome rearrangement.
【24h】

Evolutionary ancestor inference via genome rearrangement.

机译:通过基因组重排进行进化祖先推断。

获取原文
获取原文并翻译 | 示例

摘要

Inferring ancestral gene orders in a phylgenomic tree is an important topic in comparative genomics. In this thesis, three different approaches have been used to infer ancestors, first, using common intervals in a model-free approach and extending it to using common clusters and neighbourhood parameter; second, using double cut and join operation (DCJ); third, using breakpoint distance. A statistically fair comparison between the performance of DCJ and breakpoint criteria ends the thesis.;Noticing the drawback that the concept of common intervals suffers from, we introduce the concept of generalized adjacency to find common clusters using a neighborhood parameter that turns out to be closely related to the bandwidth parameter of a graph. Our focus will be on how this parameter affects the characteristics of clusters: how numerous they are, how large they are, how rearranged they are and to what extent they are preserved from ancestor to descendant in a phylogenetic tree. Again, we use dynamic programming optimization to determine the presence of individual edges at the ancestral nodes of the phylogeny.;The DCJ (double cut and join) operation introduced by Yancopoulos et al. in 2005 is the most inclusive operation to date as it can generate all the movement rearrangements. One year later, Bergeron et al. restated the DCJ model and produced a simplified (linear) algorithm, which is now the most general existing algorithm to transform one genome into another using genome rearrangements events. Motivated by both, the most inclusive operation, DCJ, and its most general algorithm, we study the small phylogeny problem in the space of multichromosomal genomes under the DCJ metric. This is similar to the existing MGR (multiple genome rearrangements) approach, but it allows, in addition to inversion and reciprocal translocation, operations of transposition and block interchange.;Thanks to Tannier et al., the first polynomial solution to the median problem has been found in only one context, namely the case of breakpoint distance on multichromosomal genoms where chromosomes are unconstrained as to linearity or circularity. This motivated us to study the small phylogeny problem using breakpoint median as a third approach, that is different both biologically and computationally from the common intervals and DCJ approaches, and then to compare statistically the performance of both criteria, breakpoint and DCJ.;Away from any assumptions or considerations, probabilistic or combinatorial, about specific processes involved in rearranging genomes, we present a new phylogenetic reconstruction method based solely on common intervals. The objective function to be optimized is simply the sum over the tree branches of the symmetric difference between the two sets of intervals associated with the genomes at the two ends of the branch. To achieve this goal, we use dynamic programming optimization to determine the presence of common intervals at the ancestral nodes of the phylogeny.;Keywords: phylogenetic tree, genome rearrangment, inversion, reciprocal translocation, transposition, block interchange, common intervals, generalized adjacency, neighborhood parameter, graph bandwidth, multiple genome rearrangement (MGR), double cut and join (DCJ), breakpoint (BP), excess explanatory rate.
机译:推断系统发育树中的祖先基因顺序是比较基因组学中的重要课题。本文采用三种不同的方法来推断祖先,首先,在无模型方法中使用公共区间,然后将其扩展到使用公共聚类和邻域参数。其次,使用双重剪切和合并操作(DCJ);第三,使用断点距离。 DCJ的性能和断点条件之间的统计上的公平比较结束了本文。;为解决公用间隔概念所受的弊端,我们引入了广义邻接的概念,使用一个邻域参数来找到公用聚类,该邻域参数被证明是非常接近的与图形的带宽参数有关。我们的重点将放在此参数如何影响簇的特性上:簇的数量,大小,重新排列的程度以及在系统发育树中从祖先到后代的保留程度。再次,我们使用动态编程优化来确定系统发育祖先节点中单个边缘的存在。; Yancopoulos等人引入的DCJ(双重切割和合并)操作。 2005年迄今为止,这是最具包容性的行动,因为它可以产生所有机芯的重排。一年后,Bergeron等人。重新阐述了DCJ模型并产生了简化的(线性)算法,这是目前使用基因组重排事件将一个基因组转化为另一个基因组的最通用的现有算法。出于最包容的操作DCJ及其最通用的算法的推动,我们研究了在DCJ指标下多染色体基因组空间中的小系统发育问题。这类似于现有的MGR(多基因组重排)方法,但除倒置和倒易位外,还允许进行换位和块互换的操作。由于Tannier等人,中位数问题的第一个多项式解有了仅在一种情况下,即在染色体不受线性或圆度限制的多染色体基因组上,断点距离的情况才被发现。这促使我们使用断点中位数作为第三种方法来研究小型系统发育问题,这在生物学和计算上都不同于常见间隔和DCJ方法,然后在统计学上比较标准,断点和DCJ的性能。对于涉及基因组重排的特定过程的任何假设或考虑,无论是概率性的还是组合性的,我们仅基于共同的间隔就提出了一种新的系统发育重建方法。要优化的目标函数仅是与分支两端的基因组相关的两组间隔之间的对称差异在树枝上的总和。为实现这一目标,我们使用动态编程优化来确定系统发育祖先节点处共有区间的存在。关键词:系统发育树,基因组重排,倒置,相互易位,易位,移位,块互换,共同区间,广义邻接,邻域参数,图带宽,多基因组重排(MGR),双割和连接(DCJ),断点(BP),过量解释率。

著录项

  • 作者

    Adam, Zaky.;

  • 作者单位

    University of Ottawa (Canada).;

  • 授予单位 University of Ottawa (Canada).;
  • 学科 Biology Evolution and Development.;Biology Bioinformatics.;Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 99 p.
  • 总页数 99
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号