首页> 外文期刊>Emerging Topics in Computing, IEEE Transactions on >FacetsBase: A Key-Value Store Optimized for Querying on Scholarly Data
【24h】

FacetsBase: A Key-Value Store Optimized for Querying on Scholarly Data

机译:Facetsbase:针对学术数据查询的键值存储优化

获取原文
获取原文并翻译 | 示例
           

摘要

As an emerging topic, scholarly big data is the vast quantity of research output that requires sophisticated platforms and tools for creating applications that can benefit the research community. This paper addresses the applied research in storing, indexing, and querying scholarly big data. The relational databases, which employ a pre-defined and well-partitioned data model are not flexible, while the NoSQL databases lack sophisticated index and partition mechanisms. The proposed FacetsBase, which is a Hadoop-based key-value data store, combines the performance advantages of a relational database, the flexibility of a NoSQL database and the parallelism of a distributed file system. It partitions and indexes the publication information using the concept of facets, it stores facetsin a multi-dimensional logical data model and lower-cost file format, and it provides the attribute-specified query and attribute-unspecific query. In experiments, FacetsBase was compared with Hive, HBase, MongoDB, and Cassandra in terms of query performance. The results indicate that FacetsBase performs 1.4x, 3.8x, 1.4x, and 2.9x faster on average, respectively.
机译:作为一种新兴主题,学术大数据是大量的研究输出,需要精致的平台和工具来创建可以使研究界受益的应用程序。本文解决了存储,索引和查询学术大数据的应用研究。使用预定义和良好分区数据模型的关系数据库不灵活,而NoSQL数据库缺少复杂的索引和分区机制。所提出的FacetsBase是基于Hadoop的键值数据存储,结合了关系数据库的性能优势,NoSQL数据库的灵活性和分布式文件系统的并行性。它使用面部概念分区和索引发布信息,它将Facetsin存储了多维逻辑数据模型和较低成本的文件格式,它提供了属性指定的查询和属性非特定查询。在实验中,在查询性能方面与蜂巢,HBase,MongoDB和Cassandra进行比较。结果表明,平均值平均分别执行1.4倍,3.8倍,1.4倍和2.9倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号