首页> 外文会议>IEEE 12th International Conference on Bioinformatics amp; Bioengineering : Final Program amp; Abstract Book. >Annotation guided local similarity search in multiple sequences and its application to mitochondrial genomes
【24h】

Annotation guided local similarity search in multiple sequences and its application to mitochondrial genomes

机译:注释指导的多序列局部相似性搜索及其在线粒体基因组中的应用

获取原文
获取原文并翻译 | 示例

摘要

Given a set of nucleotide sequences and corresponding gene annotations which might contain a moderate number of errors we consider the problem to identify common substrings occurring in homologous genes and to identify putative errors in the given annotations. The problem is solved by identifying nodes in a suffix tree that contains all substrings occurring in the data set. Due to the large size of the targeted data set our approach employs a truncated version of suffix trees. The approach is successfully applied to the mitochondrial nucleotide sequences and the corresponding annotations available in RefSeq for more than 2000 metazoan species. We demonstrate that the approach finds appropriate subsequences despite of errors in the given annotations. Moreover, it identifies several hundred errors within the RefSeq annotations.
机译:给定一组核苷酸序列和相应的基因注释,其中可能包含中等数量的错误,我们考虑该问题以鉴定同源基因中常见的子串,并确定给定注释中的假定错误。通过在后缀树中标识包含数据集中出现的所有子字符串的节点来解决该问题。由于目标数据集的规模很大,我们的方法采用了后缀树的截短版本。该方法已成功应用于线粒体核苷酸序列和RefSeq中2000多个后生物种的相应注释。我们证明,尽管给定注释存在错误,该方法仍可以找到适当的子序列。此外,它可以识别RefSeq批注中的数百个错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号