首页> 外文会议>BIBE 2012 >Annotation Guided Local Similarity Search in Multiple Sequences and its Application to Mitochondrial Genomes
【24h】

Annotation Guided Local Similarity Search in Multiple Sequences and its Application to Mitochondrial Genomes

机译:注释在多个序列中引导局部相似性搜索及其在线粒体基因组中的应用

获取原文

摘要

Given a set of nucleotide sequences and corresponding gene annotations which might contain a moderate number of errors we consider the problem to identify common substrings occurring in homologous genes and to identify putative errors in the given annotations. The problem is solved by identifying nodes in a suffix tree that contains all substrings occurring in the data set. Due to the large size of the targeted data set our approach employs a truncated version of suffix trees. The approach is successfully applied to the mitochondrial nucleotide sequences and the corresponding annotations available in RefSeq for more than 2000 metazoan species. We demonstrate that the approach finds appropriate subsequences despite of errors in the given annotations. Moreover, it identifies several hundred errors within the RefSeq annotations.
机译:给定一组核苷酸序列和相应的基因注释,其可能含有适度误差的误差,我们考虑了识别在同源基因中发生的常见亚的常见底片并识别给定注释中的推定误差的问题。通过识别包含在数据集中发生的所有子字符串的后缀树中的节点来解决问题。由于目标数据集的大尺寸,我们的方法采用了截断的后缀树。该方法成功地应用于线粒体核苷酸序列和REFSeq的相应注释,超过2000种Metazoan物种。我们证明,尽管在给定的注释中,但该方法在虽然在给定的注释中存在错误。此外,它识别了Refseq注释中的数百个错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号