首页> 外文会议>IEEE International Conference on Data Engineering >An Efficient Parallel Keyword Search Engine on Knowledge Graphs
【24h】

An Efficient Parallel Keyword Search Engine on Knowledge Graphs

机译:知识图中的有效并行关键字搜索引擎

获取原文

摘要

Keyword search has recently become popular as a way to query relational databases, and even graphs, since it allows users to issue queries without learning a complex query language and data schema. Evaluating a keyword query is usually significantly more expensive than evaluating an equivalent selection query, since the query specification is less complete, and many alternative answers have to be considered by the system, requiring considerable effort to generate and compare. Current interest in big data and AI are putting even more demands on the efficiency of keyword search. In particular, searching of knowledge graphs is gaining popularity. As knowledge graphs often comprise many millions of nodes and edges, performing real-time search on graphs of this size is an open challenge. In this paper, we attempt to address this need by leveraging advances in hardware technologies, e.g. multi-core CPUs and GPUs. Specifically, we implement a parallel keyword search engine for Knowledge Bases (KB). To be able to do so, and to exploit parallelism, we devise a new approach to keyword search, based on a concept we introduce called Central Graph. Unlike the Group Steiner Tree (GST) model, widely used for keyword search, our approach can naturally work in parallel and still return compact answer graphs with rich information. Our approach can work in either multi-core CPUs or a single GPU. In particular, our GPU implementation is two to three orders of magnitudes faster than state-of-the-art keyword search method. We conduct extensive experiments to show that our approach is both efficient and effective.
机译:关键字搜索最近成为查询关系数据库的一种方式,甚至是图形,因为它允许用户在不学习复杂的查询语言和数据模式的情况下发出查询。评估关键字查询通常比评估等效选择查询更昂贵,因为查询规范不太完整,并且系统必须考虑许多替代答案,需要大量努力生成和比较。目前对大数据和AI的兴趣在于对关键字搜索的效率造成更多要求。特别是,搜索知识图表是越来越受欢迎的。作为知识图形通常包括许多数百万节点和边缘,执行对该大小的图形的实时搜索是开放挑战。在本文中,我们试图通过利用硬件技术的进步来解决这种需求,例如,多核CPU和GPU。具体而言,我们实施一个并行关键字搜索引擎,用于知识库(KB)。为了能够这样做,并利用并行性,我们根据我们介绍中央图的概念设计了一种新的关键字搜索方法。与Group Steiner树(GST)模型不同,广泛用于关键字搜索,我们的方法可以自然地正常工作,仍然返回具有丰富信息的紧凑答案图。我们的方法可以在多核CPU或单个GPU中工作。特别是,我们的GPU实现比最先进的关键字搜索方法快两到三个数量级。我们进行了广泛的实验,以表明我们的方法既有效又有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号