【24h】

Optimizing DNA sequences using Tetra-nucleotide RankList

机译:使用四核苷酸RankList优化DNA序列

获取原文

摘要

Recent advancement in the field of life science has caused the generation of massive amount of genomic data. Storing such huge amount of data require a lot of memory. However, by using efficient algorithm we can optimize the size of the dataset and save memory storage. In this paper we have proposed an algorithm which uses Tetra-nucleotide RankList for optimizing the storage requirement for storing DNA sequences in database which can be easily retrieved in a time efficient manner. Tetra-nucleotide RankList has been generated by testing several DNA sequences to confirm the uniformity of the RankList over all DNA sequences. The algorithm has been applied on the genome sequences of different bacteria and viruses and the space for storing those DNA sequences has been reduced up to 30%.
机译:生命科学领域的最新进展已导致产生大量的基因组数据。存储如此大量的数据需要大量内存。但是,通过使用高效的算法,我们可以优化数据集的大小并节省内存存储。在本文中,我们提出了一种使用四核苷酸RankList的算法来优化将DNA序列存储在数据库中的存储需求,该算法可以很容易地以省时的方式进行检索。通过测试几个DNA序列以确认RankList在所有DNA序列上的一致性,生成了四核苷酸RankList。该算法已应用于不同细菌和病毒的基因组序列,并且用于存储这些DNA序列的空间已减少了多达30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号