首页> 外文OA文献 >Nucleotide Sequence Similarity Search Using Techniques from Content-Based Image Retrieval
【2h】

Nucleotide Sequence Similarity Search Using Techniques from Content-Based Image Retrieval

机译:基于基于内容的图像检索技术的核苷酸序列相似性搜索

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The amount of DNA data continues to increase exponentially as a result of high-throughput next generation sequencing. Current state-of-the-art tools for nucleotidesequence similarity search are not equipped to deal with this growth and newthinking is needed to tackle the rising scalability challenges.This thesis investigates the experimental approach of translating DNA sequencesinto images and applying state of the art techniques from the field of content-based image retrieval to index and search the resulting images. The challengesof translating DNA sequences into images are discussed and two algorithms forimage generation are proposed. We look into the different feature descriptors thatare available and evaluate them in the context of the generated images. Lastly theapproach as a whole is evaluated with the mean average precision metric usingBLAST as the gold standard reference.The results show that the proposed approach is not successful in approachingBLAST in retrieval performance, but offers a significant reduce in index sizesand thus better performance and scalability on large DNA databases.
机译:由于高通量的下一代测序,DNA数据量继续呈指数增长。目前还没有配备用于核苷酸序列相似性搜索的最新工具来应对这种增长,并且需要新的思维来应对不断增长的可扩展性挑战。本文研究了将DNA序列翻译成图像并应用最新技术的实验方法。从基于内容的图像检索领域到索引和搜索结果图像。讨论了将DNA序列翻译成图像的挑战,并提出了两种图像生成算法。我们研究可用的不同特征描述符,并在生成的图像的上下文中对其进行评估。最后,以BLAST作为黄金标准参考,使用平均平均精度度量来评估该方法的整体效果。结果表明,该方法在检索性能上不能成功地接近BLAST,但是可以显着减少索引大小,从而在性能上具有更好的可扩展性大型DNA数据库。

著录项

  • 作者

    Lysne Eivind;

  • 作者单位
  • 年度 2015
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号