首页> 外文会议>IEEE International Conference on Data Engineering >An Efficient Parallel Keyword Search Engine on Knowledge Graphs
【24h】

An Efficient Parallel Keyword Search Engine on Knowledge Graphs

机译:基于知识图的高效并行关键字搜索引擎

获取原文

摘要

Keyword search has recently become popular as a way to query relational databases, and even graphs, since it allows users to issue queries without learning a complex query language and data schema. Evaluating a keyword query is usually significantly more expensive than evaluating an equivalent selection query, since the query specification is less complete, and many alternative answers have to be considered by the system, requiring considerable effort to generate and compare. Current interest in big data and AI are putting even more demands on the efficiency of keyword search. In particular, searching of knowledge graphs is gaining popularity. As knowledge graphs often comprise many millions of nodes and edges, performing real-time search on graphs of this size is an open challenge. In this paper, we attempt to address this need by leveraging advances in hardware technologies, e.g. multi-core CPUs and GPUs. Specifically, we implement a parallel keyword search engine for Knowledge Bases (KB). To be able to do so, and to exploit parallelism, we devise a new approach to keyword search, based on a concept we introduce called Central Graph. Unlike the Group Steiner Tree (GST) model, widely used for keyword search, our approach can naturally work in parallel and still return compact answer graphs with rich information. Our approach can work in either multi-core CPUs or a single GPU. In particular, our GPU implementation is two to three orders of magnitudes faster than state-of-the-art keyword search method. We conduct extensive experiments to show that our approach is both efficient and effective.
机译:关键字搜索最近已成为查询关系数据库甚至图表的一种方式,因为它允许用户在不学习复杂的查询语言和数据模式的情况下发出查询。与查询等效选择查询相比,评估关键字查询通常比评估等效选择查询昂贵得多,因为查询规范不那么完整,并且系统必须考虑许多替代答案,这需要花费大量精力来生成和比较。当前对大数据和AI的兴趣对关键字搜索的效率提出了更高的要求。尤其是,知识图的搜索正变得越来越流行。由于知识图通常包含数百万个节点和边,因此在这种大小的图上执行实时搜索是一个开放的挑战。在本文中,我们尝试通过利用硬件技术的进步来满足这一需求,例如多核CPU和GPU。具体来说,我们为知识库(KB)实现了并行的关键字搜索引擎。为了做到这一点并利用并行性,我们基于引入的称为“中心图”的概念,设计了一种新的关键字搜索方法。与广泛用于关键字搜索的Group Steiner Tree(GST)模型不同,我们的方法可以自然并行运行,并且仍然返回具有丰富信息的紧凑型答案图。我们的方法可以在多核CPU或单个GPU中工作。特别是,我们的GPU实现比最先进的关键字搜索方法快两到三个数量级。我们进行了广泛的实验,以证明我们的方法既有效又有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号