首页>
外国专利>
DISTRIBUTED STORAGE METHOD AND ARCHITECTURE FOR GENE VARIATION DATA
DISTRIBUTED STORAGE METHOD AND ARCHITECTURE FOR GENE VARIATION DATA
展开▼
机译:基因变异数据的分布式存储方法和体系结构
展开▼
页面导航
摘要
著录项
相似文献
摘要
A distributed storage method and architecture for gene variation data. The method comprises a distributed data storage process, a distributed bitmap index creation process, and a distributed query and retrieval process. The architecture comprises a distributed column storage module, a distributed bitmap index module, and a query and retrieval module. In the method, data distributed storage is performed by using a new column-type storage engine kudu, and distributed local bitmap indexes are established for sample columns, accordingly, the problem of low random data access performance of an existing HDFS solution is effectively resolved; the problem of poor batch analysis performance of an HBase solution is resolved; a storage architecture model is simplified; the limitation problem of dependence of a genotype query tool on multiple tools is resolved; and by means of a distributed local bitmap index solution, high concurrency is implemented and the expandability is improved.
展开▼