Accelerating Protein Classification Using Suffix Trees

机译：使用后缀树加速蛋白质分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Position-specific scoring matrices have been used extensively to recognize highly conserved protein regions. We present a method for accelerating these searches using a suffix tree data structure computed from the sequences to be searched. Building on earlier work that allows evaluation of a scoring matrix to be stopped early, the suffix tree-based method excludes many protein segments from consideration at once by pruning entire subtrees. Although suffix trees are usually expensive in space, the fact that scoring matrix evaluation requires an in-order traversal allows nodes to be stored more compactly without loss of speed, and our implementation requires only 17 bytes of primary memory per input symbol. Searches are accelerated by up to a factor of ten.

机译：定位特异性评分矩阵已被广泛用于识别高度保守的蛋白质区域。我们介绍了一种使用从要搜索的序列计算的后缀树数据结构加速这些搜索的方法。在早期的工作中建立允许早期评估得分矩阵，基于后缀的树木的方法通过修剪整个子树来不包括考虑的许多蛋白质区段。虽然后缀树通常在空间中昂贵，但得分矩阵评估需要按顺序遍历允许节点更加紧凑地存储而不会损失速度，并且我们的实现每输入符号只需要17个字节的主存储器。搜索达到10倍。

著录项

来源
《International Conference on Intelligent Systems for Molecular Biology》|2000年||共6页
会议地点
作者
Bogdan Dorohonceanu; C. G. Nevill-Manning;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类分子生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays [J] . Dong Kyue Kim, Minhwan Kim, Heejin Park Algorithmica . 2008,第3期

机译：线性化后缀树：具有后缀树和后缀数组功能的高效索引数据结构
2. Linearized Suffix Tree: An Efficient Index Data Structure With The Capabilities Of Suffix Trees And Suffix Arrays [J] . Dong Kyue Kim, Minhwan Kim, Heejin Park Algorithmica . 2008,第3期

机译：线性化后缀树：具有后缀树和后缀数组功能的高效索引数据结构
3. Toward optimizing the cache performance of suffix trees for sequence analysis algorithms suffix tree cache performance optimization. [J] . Lee C, Huang CH Advances in Experimental Medicine and Biology . 2010,第Null期

机译：为了优化后缀树的缓存性能，以进行序列分析算法后缀树的缓存性能优化。
4. Accelerating Protein Classification Using Suffix Trees [C] . Bogdan Dorohonceanu, C.G. Nevill-Manning International Conference on Intelligent Systems for Molecular Biology; 20000816-23; La Jolla,CA(US) . 2000

机译：使用后缀树加速蛋白质分类
5. Suffix trees and suffix arrays in primary and secondary storage [D] . Ko, Pang 2007

机译：主存储和辅助存储中的后缀树和后缀数组
6. Using the Sadakane Compressed Suffix Tree to Solve the All-Pairs Suffix-Prefix Problem [O] . Maan Haj Rachid, Qutaibah Malluhi, Mohamed Abouelhoda -1

机译：使用Sadakane压缩后缀树来解决全对后缀前缀问题
7. A Suffix Tree Or Not A Suffix Tree? [O] . Tatiana Starikovskaya, Hjalte Wedel Vildhøj 2016

机译：后缀树或不是后缀树？

Accelerating Protein Classification Using Suffix Trees

摘要

著录项

相似文献

相关主题

期刊订阅