AN EFFICIENT METHOD FOR COMPRESSING AND SEARCHING GENOMIC DATABASES

机译：一种用于压缩和搜索基因组数据库的有效方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Biological databases are growing significantly, as are the number of queries directed at them. In 2005, the genomic databases at the National Center for Biotechnology Information (NCBI) received about 50 million web hits per day, at peak rates of about 1,900 hits per second. As these databases become more popular, there is increased demand to make them faster and more efficient. In this paper, we propose a method for compressing and searching selected genome databases using techniques appropriate for computers of virtually any size. This search technique is expected to produce its best results with large search sequences against large DNA databases, and lends itself to parallel computation techniques with little communication overhead required. Because the compression algorithm uses a lossless binary encoding format, search results are exact – not approximate. Furthermore, searches take place on the compressed data, obviating the need for decompression prior to executing a search.

机译：生物数据库正在显着增长，因此针对它们的查询数量也是如此。 2005年，全国生物技术信息中心（NCBI）的基因组数据库每天收到约5000万个网页，峰值率为每秒约1,900次。由于这些数据库变得更加流行，因此需求增加，使它们更快，更高效。在本文中，我们提出了一种使用适合于几乎任何大小的计算机的技术来压缩和搜索所选择的基因组数据库的方法。该搜索技术有望通过针对大型DNA数据库的大搜索序列产生最佳效果，并将其自身用于并行计算技术，需要几乎需要的通信开销。因为压缩算法使用无损二进制编码格式，所以搜索结果精确 - 不近似。此外，搜索在压缩数据上进行，避免在执行搜索之前对解压缩的需要。

著录项

来源
《European Conference on Modelling and Simulation》|2007年||共7页
会议地点
作者
Jeffrey B. Wallace; Gregory L. Vert; Sara Nasser;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP15-53;
关键词
Genomic sequence; Search; Algorithm; Compression; Bioinformatics; Database;

机译：基因组序列;搜索;算法;压缩;生物信息学;数据库;

相似文献

外文文献
中文文献
专利

1. An Efficient Analysis Method for LTCC Ridge Waveguide Bandpass Filters via Database Searching [J] . Xinmi Yang, Jiayue Li, Xueguan Liu, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2021,第2期

机译：LTCC Ridge波导带通滤波器的高效分析方法通过数据库搜索
2. An Efficient Method for k Nearest Neighbor Searching in Obstructed Spatial Databases [J] . Yu Gu, Ge Yu, Xiaonan Yu Journal of information science and engineering . 2014,第5期

机译：阻塞空间数据库中k最近邻搜索的一种有效方法
3. Compressed Binary Bit Trees: A New Data Structure For Accelerating Database Searching [J] . Smellie A Journal of chemical information and modeling . 2009,第2期

机译：压缩二进制位树：用于加速数据库搜索的新数据结构
4. AN EFFICIENT METHOD FOR COMPRESSING AND SEARCHING GENOMIC DATABASES [C] . Jeffrey B. Wallace, Gregory L. Vert, Sara Nasser European conference on modelling and simulation;ECMS 2007;High performance computing and simulation conference;HPCS 2007 . 2007

机译：压缩和搜索基因数据库的有效方法
5. An efficient method for searching compressed genomic databases. [D] . Wallace, Jeffrey B. 2008

机译：一种搜索压缩基因组数据库的有效方法。
6. Efficient strategies for genomic searching using the affected-pedigree-member method of linkage analysis. [O] . D. L. Brown, M. B. Gorin, D. E. Weeks 1994

机译：使用受影响的谱系成员方法进行连锁分析的有效基因组搜索策略。
7. An efficient index-based protein structure database searching method [O] . Zeyar Aung, Wei Fu, Kian-lee Tan 2003

机译：一种基于索引的高效蛋白质结构数据库搜索方法

AN EFFICIENT METHOD FOR COMPRESSING AND SEARCHING GENOMIC DATABASES

摘要

著录项

相似文献

相关主题

期刊订阅