首页> 外文学位 >Memory-efficient graph search applied to multiple sequence alignment.
【24h】

Memory-efficient graph search applied to multiple sequence alignment.

机译:内存有效的图搜索适用于多序列比对。

获取原文
获取原文并翻译 | 示例

摘要

Graph search is used in many areas of computer science. It is well-known that the scalability of graph-search algorithms such as A* is limited by their memory requirements. In this dissertation, I describe three complementary strategies for reducing the memory requirements of graph-search algorithms, especially for multiple sequence alignment (a central problem in computational molecular biology). These search strategies dramatically increase the range and difficulty of multiple sequence alignment problems that can be solved.; The first strategy uses a divide-and-conquer method of solution reconstruction, and one of my contributions is to show that when divide-and-conquer solution reconstruction is used, a layer-by-layer strategy for multiple sequence alignment is more memory-efficient than a best-first strategy.; The second strategy is a new approach to duplicate detection in external-memory graph search that involves partitioning the search graph based on an abstraction of the state space. For graphs with sufficient local structure, it allows graph-search algorithms to use external memory, such as disk storage, almost as efficiently as internal memory.; The third strategy is a technique for reducing the memory requirements of sub-alignment search heuristics that are stored in lookup tables. It uses the start and goal states of a problem instance to restrict the region of the state space for which a table-based heuristic is needed, making it possible to store more accurate heuristic estimates in the same amount of memory.; These three strategies dramatically improve the scalability of graph search not only for multiple sequence alignment, but for many other graph-search problems, and generalizations of these search strategies for other graph-search problems are discussed throughout the dissertation.
机译:图搜索用于计算机科学的许多领域。众所周知,诸如A *之类的图搜索算法的可伸缩性受到其内存需求的限制。在这篇论文中,我描述了三种互补的策略来减少图搜索算法的内存需求,特别是对于多序列比对(计算分子生物学中的一个中心问题)。这些搜索策略大大增加了可以解决的多个序列比对问题的范围和难度。第一种策略使用解决方案重构的分治方法,而我的贡献之一就是表明,当使用分治解决方案重构时,用于多序列比对的逐层策略具有更大的存储空间。比最佳优先策略有效。第二种策略是一种在外部存储器图搜索中进行重复检测的新方法,该方法涉及基于状态空间的抽象对搜索图进行分区。对于具有足够局部结构的图,它允许图搜索算法使用外部存储器(例如磁盘存储)的效率几乎与内部存储器一样。第三种策略是用于减少存储在查找表中的子对齐搜索启发式存储需求的技术。它使用问题实例的开始状态和目标状态来限制需要基于表的启发式的状态空间区域,从而可以在相同的内存量中存储更准确的启发式估计。这三种策略不仅极大地提高了图搜索的可扩展性,不仅适用于多序列比对,而且还解决了许多其他图搜索问题,并且本文将讨论这些搜索策略对其他图搜索问题的概括。

著录项

  • 作者

    Zhou, Rong.;

  • 作者单位

    Mississippi State University.;

  • 授予单位 Mississippi State University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 159 p.
  • 总页数 159
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号