...
首页> 外文期刊>Concurrency and Computation >Answering biological questions by querying k-mer databases
【24h】

Answering biological questions by querying k-mer databases

机译:通过查询k-mer数据库回答生物学问题

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper describes a k-mer approach to analysing DNA data and quickly answering certain types of ad hoc biological questions. These k-mers (short DNA strings) are stored in a conventional relational database and indexed to support efficient exact match operations. We show that k-mers around 20-25 bases long have interesting and useful uniqueness properties that can be used to compute a 'relatedness' metric and also allow k-mers to be used as 'unique enough' tags to identify organisms and genes. This relatedness metric is used in SQL queries that can directly answer questions such as how two related species differ, and what genes are unique to an organism. The k-mer tags have proven useful in applications, largely metagenomic ones that can quickly process large volumes of sequencing data to say something about what organisms and genes might be present in an environmental sample. All of this work is based on simple and fast exact matches of k-mer strings using a database, rather than conventional alignment based on inexact matches of much longer strings. These k-mer tools provide ways of rapidly exploring large genome spaces and handling large volumes of sequence data, and complement rather than replace existing alignment and assembly tools.
机译:本文介绍了一种k-mer方法,用于分析DNA数据并快速回答某些类型的特殊生物学问题。这些k-mers(短DNA字符串)存储在常规的关系数据库中,并进行索引以支持有效的精确匹配操作。我们表明,大约20至25个碱基长的k-mers具有有趣且有用的独特性,可用于计算“相关性”量度,还可以将k-mers用作“足够独特”的标签来识别生物和基因。此关联性度量标准用于SQL查询中,该查询可以直接回答诸如两个相关物种之间的差异以及某个生物体独特的基因之类的问题。事实证明,k-mer标签在应用中很有用,主要是宏基因组标签,可以快速处理大量测序数据,以说出环境样品中可能存在哪些生物和基因。所有这些工作都是基于使用数据库对k-mer字符串进行简单,快速的精确匹配,而不是基于更长字符串的不精确匹配的常规比对。这些k-mer工具提供了快速探索大型基因组空间和处理大量序列数据的方法,并且补充而不是替代现有的比对和组装工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号