首页> 外国专利> DISTRIBUTED STORAGE METHOD AND ARCHITECTURE FOR GENE VARIATION DATA

DISTRIBUTED STORAGE METHOD AND ARCHITECTURE FOR GENE VARIATION DATA

机译:基因变异数据的分布式存储方法和体系结构

摘要

A distributed storage method and architecture for gene variation data. The method comprises a distributed data storage process, a distributed bitmap index creation process, and a distributed query and retrieval process. The architecture comprises a distributed column storage module, a distributed bitmap index module, and a query and retrieval module. In the method, data distributed storage is performed by using a new column-type storage engine kudu, and distributed local bitmap indexes are established for sample columns, accordingly, the problem of low random data access performance of an existing HDFS solution is effectively resolved; the problem of poor batch analysis performance of an HBase solution is resolved; a storage architecture model is simplified; the limitation problem of dependence of a genotype query tool on multiple tools is resolved; and by means of a distributed local bitmap index solution, high concurrency is implemented and the expandability is improved.
机译:基因变异数据的分布式存储方法和体系结构。该方法包括分布式数据存储过程,分布式位图索引创建过程以及分布式查询和检索过程。该体系结构包括分布式列存储模块,分布式位图索引模块以及查询和检索模块。该方法通过使用新的列式存储引擎kudu进行数据分布式存储,并为样本列建立分布式本地位图索引,从而有效解决了现有HDFS解决方案随机数据访问性能低的问题。解决了HBase解决方案的批处理分析性能差的问题。简化了存储架构模型;解决了基因型查询工具对多种工具的依赖性限制问题。通过分布式本地位图索引解决方案,实现了高并发性和可扩展性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号