首页> 外文会议> >Improving RDF Query Performance Using In-memory Virtual Columns in Oracle Database
【24h】

Improving RDF Query Performance Using In-memory Virtual Columns in Oracle Database

机译:使用Oracle数据库中的内存虚拟列提高RDF查询性能

获取原文

摘要

Many RDF Knowledge Graph Stores use IDs to represent triples to save storage and for ease of maintenance. Oracle is no exception. While this design is good for a small footprint on disk, it incurs overhead in query processing, as it requires joins with the value table to return results or process aggregates/filter/order-by queries. It becomes especially problematic as the result size increases or the number of projected variables increases. Depending on queries, the value table join could take up most of the query processing time. In this paper, we propose to use in-memory virtual columns to avoid value table joins. It has advantages in that it does not increase the footprint on disk, and at the same time it avoids the value table joins by utilizing in-memory virtual columns. The idea is to materialize the values only in memory and utilize the compression and vector processing that come with Oracle Database In-Memory technology. Typically, the value table in RDF is small compared to the triples table. Therefore, its footprint is manageable in memory, especially with compression in columnar format. The same mechanism can be applied to any application where there exists a one-to-one mapping between an ID and its value, such as data warehousing or data marts. The mechanism has been implemented in Oracle 18c. Experimental results using the LUBM1000 benchmark show up to two orders of magnitude query performance improvement.
机译:许多RDF知识图存储使用ID来表示三元组,以节省存储空间并简化维护。 Oracle也不例外。尽管此设计适合占用较小的磁盘空间,但由于需要与值表联接以返回结果或处理聚合/筛选/排序查询,因此会增加查询处理的开销。随着结果大小的增加或预计变量数量的增加,这尤其成问题。根据查询,值表联接可能会占用大部分查询处理时间。在本文中,我们建议使用内存中的虚拟列来避免值表联接。它的优点是它不会增加磁盘上的占用空间,并且同时通过利用内存中的虚拟列避免了值表的联接。想法是仅在内存中实现值,并利用Oracle数据库内存技术随附的压缩和矢量处理。通常,与三元组表相比,RDF中的值表较小。因此,其占用空间可在内存中管理,尤其是采用列格式压缩时。可以将相同的机制应用于ID与ID值之间存在一对一映射的任何应用程序,例如数据仓库或数据集市。该机制已在Oracle 18c中实现。使用LUBM1000基准测试的实验结果显示查询性能提高了两个数量级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号