Variable-length intervals in homology search

机译：同源搜索中的可变长度间隔

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fast, accurate, and scalable search techniques for homology searching of large genomic collections are becoming an increasingly important requirement as genomic sequence collections continue to double in size almost yearly. Almost all homology search techniques rely on extracting fixed-length overlapping sequences from queries and database sequences, and comparing these as the first step in query evaluation; this is a feature of well-known tools such as FASTA, BLAST, and our own CAFE technique. In this paper we discuss a novel, variable-length approach to extracting subsequences that is based on homology scoring matrices. Our motivation is to achieve a balance between the speed and accuracy of fixed-length choices, that is, to encapsulate the speed of longer subsequence lengths and the accuracy of shorter ones. We show that incorporating this approach into our CAFE technique leads to a good compromise between accuracy and retrieval efficiency when searching with BLOSUM matrices sensitive to distant evolutionary relationships. We expect the same results would be achieved with other homology search techniques.

机译：随着基因组序列集合的规模几乎每年翻一番，用于大型基因组集合的同源性搜索的快速，准确和可扩展的搜索技术正变得越来越重要。几乎所有的同源搜索技术都依赖于从查询和数据库序列中提取固定长度的重叠序列，并将它们进行比较作为查询评估的第一步；这是FASTA，BLAST和我们自己的CAFE技术等知名工具的功能。在本文中，我们讨论了一种基于同源性评分矩阵的新颖的变长方法来提取子序列。我们的动机是在固定长度选择的速度和准确性之间取得平衡，即封装较长子序列长度的速度和较短子序列长度的准确性。我们表明，将这种方法结合到我们的CAFE技术中时，在使用对远距离进化关系敏感的BLOSUM矩阵进行搜索时，会在准确性和检索效率之间取得很好的折衷。我们希望使用其他同源搜索技术也能获得相同的结果。

著录项

来源
《Proceedings of the Second conference on Asia-Pacific bioinformatics》|2004年|P.85-91|共7页
会议地点 Dunedin(NZ)
作者
Abhijit Chattaraj; Hugh E. Williams;
展开▼
作者单位

RMIT University, Melbourne, Australia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词
scoring matrices;

机译：评分矩阵;
入库时间 2022-08-26 14:30:49

相似文献

外文文献
中文文献
专利

1. 已知可变符号构成的保护间隔在单载波/多载波通信系统中的应用 [J] . 李玮, 程时昕, 陈明东南大学学报（英文版） . 2006,第001期
2. Comparative homology agreement search: An effective combination of homology-search methods [J] . Alam I, Dress A, Rehmsmeier M, Proceedings of the National Academy of Sciences of the United States of America . 2004,第38期

机译：比较同源性一致性搜索：同源性搜索方法的有效组合
3. Software-Defined Multimedia Streaming System Aided By Variable-Length Interval In-Network Caching [J] . Yang Jian, Yao Zhen, Yang Bowen, IEEE transactions on multimedia . 2019,第2期

机译：可变长度间隔网络内缓存辅助的软件定义多媒体流系统
4. Simplified search and construction of capacity-approaching variable-length constrained sequence codes [J] . Andrew Steadman, Ivan Fair Communications, IET . 2016,第14期

机译：逼近变长约束序列码的简化搜索与构建
5. Variable-length intervals in homology search [C] . Abhijit Chattaraj, Hugh E. Williams Conference on Asia-Pacific bioinformatics . 2004

机译：同源性搜索中的可变长度间隔
6. Hidden Markov model-based homology search and gene prediction in NGS ERA [D] . Techa-angkoon, Prapaporn 2017

机译：NGS ERA中基于隐马尔可夫模型的同源性搜索和基因预测
7. Comparative homology agreement search: An effective combination of homology-search methods [O] . Intikhab Alam, Andreas Dress, Marc Rehmsmeier, 2004

机译：比较同源性一致性搜索：同源性搜索方法的有效组合
8. Comparative homology agreement search: An effective combination of homology-search methods [O] . Alam, Intikhab, Dress, Andreas, Rehmsmeier, Marc, 2004

机译：比较同源性一致性搜索：同源性搜索方法的有效组合
9. Gene prediction by pattern recognition and homology search [R] . Xu, Y. , Uberbacher, E. C. 1996

机译：通过模式识别和同源性搜索进行基因预测

Variable-length intervals in homology search

摘要

著录项

相似文献

相关主题

期刊订阅