首页> 外文学位 >Computational genome analysis by alignment.
【24h】

Computational genome analysis by alignment.

机译:通过比对计算基因组分析。

获取原文
获取原文并翻译 | 示例

摘要

In the first part of this thesis, we describe a global multiple alignment program that aligns large­scale genomic sequences. Such tools are important for analyzing biological sequences. Heuristic algorithms with progressive alignment approach have previously been applied. Our approach considers matches among multiple sequences simultaneously. It adopts the idea previously used in sequencing projects and transforms the multiple alignment problem into a shortest path problem in a de Bruijn graph which can be solved in linear time. The multiple alignment is represented by a directed acyclic graph which provides a platform for further comparative studies. We have tested our method on a CFTR region consisting of 13 species with 1Mbp average size, and accurate results are obtained in time comparable to progressive alignment algorithms.;In the second part of this thesis, we describe a filtration algorithm that detects significantly overlapped optical maps among millions of optical maps in a short time. Optical mapping is a powerful technology that allows construction of ordered restriction maps. Research problems like whole genome restriction map assembly and comparison need method to quickly explore massive data to identify significant matchings without doing dynamic programming comparison. Our filtration method is based on a restriction fragments tuple matching scheme and dynamic programming. We demonstrate the capability of the program on several simulation datasets and a whole human mole data. Our method shows a good balance between the speed and accuracy.
机译:在本文的第一部分,我们描述了一个比对大规模基因组序列的整体多重比对程序。这样的工具对于分析生物学序列很重要。具有渐进对准方法的启发式算法先前已被应用。我们的方法同时考虑多个序列之间的匹配。它采用了以前在排序项目中使用的思想,并将多重比对问题转换为de Bruijn图中的最短路径问题,可以在线性时间内解决该问题。多重比对由有向无环图表示,该图为进一步的比较研究提供了平台。我们已经在CFTR区域上测试了我们的方法,该区域由13个物种组成,平均大小为1Mbp,并且在时间上可获得与渐进比对算法可比的准确结果。在本文的第二部分,我们描述了一种过滤算法,该算法可以检测到显着重叠的光学在短时间内在数百万张光学地图中绘制地图。光学映射是一项强大的技术,可用于构建有序限制图。诸如全基因组限制图装配和比较之类的研究问题需要一种方法来快速探索海量数据,以识别重要的匹配,而无需进行动态编程比较。我们的过滤方法基于限制片段元组匹配方案和动态编程。我们在几个模拟数据集和整个人类痣数据上演示了该程序的功能。我们的方法显示了速度和准确性之间的良好平衡。

著录项

  • 作者

    Yang, Yi.;

  • 作者单位

    University of Southern California.;

  • 授予单位 University of Southern California.;
  • 学科 Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 195 p.
  • 总页数 195
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号