首页> 外文期刊>Information and computation >Bidirectional search in a string with wavelet trees and bidirectional matching statistics
【24h】

Bidirectional search in a string with wavelet trees and bidirectional matching statistics

机译:具有小波树和双向匹配统计信息的字符串中的双向搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Searching for genes encoding microRNAs (miRNAs) is an important task in genome analysis. Because the secondary structure of miRNA (but not the sequence) is highly conserved, the genes encoding it can be determined by finding regions in a genomic DNA sequence that match the structure. It is known that algorithms using a bidirectional search on the DNA sequence for this task outperform algorithms based on unidirectional search. The data structures supporting a bidirectional search (affix trees and affix arrays), however, are rather complex and suffer from their large space consumption. Here, we present a new data structure called bidirectional wavelet index that supports bidirectional search with much less space. With this data structure, it is possible to search for candidates of RNA secondary structural patterns in large genomes, for example the complete human genome. Another important application of this data structure is short read alignment. As a second contribution, we show how bidirectional matching statistics can be computed in linear time.
机译:在基因组分析中,寻找编码microRNA(miRNA)的基因是一项重要的任务。由于miRNA的二级结构(而不​​是序列)高度保守,因此可以通过在基因组DNA序列中找到与该结构匹配的区域来确定编码它的基因。众所周知,为此任务在DNA序列上使用双向搜索的算法要优于基于单向搜索的算法。但是,支持双向搜索的数据结构(词缀树和词缀数组)非常复杂,并且占用大量空间。在这里,我们提出了一种称为双向小波索引的新数据结构,该结构支持双向搜索且空间要小得多。利用这种数据结构,可以在大型基因组(例如完整的人类基因组)中搜索RNA二级结构模式的候选对象。该数据结构的另一个重要应用是短读取对齐。作为第二个贡献,我们展示了如何在线性时间内计算双向匹配统计量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号