An Efficient Parallel Keyword Search Engine on Knowledge Graphs

机译：知识图中的有效并行关键字搜索引擎

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Keyword search has recently become popular as a way to query relational databases, and even graphs, since it allows users to issue queries without learning a complex query language and data schema. Evaluating a keyword query is usually significantly more expensive than evaluating an equivalent selection query, since the query specification is less complete, and many alternative answers have to be considered by the system, requiring considerable effort to generate and compare. Current interest in big data and AI are putting even more demands on the efficiency of keyword search. In particular, searching of knowledge graphs is gaining popularity. As knowledge graphs often comprise many millions of nodes and edges, performing real-time search on graphs of this size is an open challenge. In this paper, we attempt to address this need by leveraging advances in hardware technologies, e.g. multi-core CPUs and GPUs. Specifically, we implement a parallel keyword search engine for Knowledge Bases (KB). To be able to do so, and to exploit parallelism, we devise a new approach to keyword search, based on a concept we introduce called Central Graph. Unlike the Group Steiner Tree (GST) model, widely used for keyword search, our approach can naturally work in parallel and still return compact answer graphs with rich information. Our approach can work in either multi-core CPUs or a single GPU. In particular, our GPU implementation is two to three orders of magnitudes faster than state-of-the-art keyword search method. We conduct extensive experiments to show that our approach is both efficient and effective.

机译：关键字搜索最近成为查询关系数据库的一种方式，甚至是图形，因为它允许用户在不学习复杂的查询语言和数据模式的情况下发出查询。评估关键字查询通常比评估等效选择查询更昂贵，因为查询规范不太完整，并且系统必须考虑许多替代答案，需要大量努力生成和比较。目前对大数据和AI的兴趣在于对关键字搜索的效率造成更多要求。特别是，搜索知识图表是越来越受欢迎的。作为知识图形通常包括许多数百万节点和边缘，执行对该大小的图形的实时搜索是开放挑战。在本文中，我们试图通过利用硬件技术的进步来解决这种需求，例如，多核CPU和GPU。具体而言，我们实施一个并行关键字搜索引擎，用于知识库（KB）。为了能够这样做，并利用并行性，我们根据我们介绍中央图的概念设计了一种新的关键字搜索方法。与Group Steiner树（GST）模型不同，广泛用于关键字搜索，我们的方法可以自然地正常工作，仍然返回具有丰富信息的紧凑答案图。我们的方法可以在多核CPU或单个GPU中工作。特别是，我们的GPU实现比最先进的关键字搜索方法快两到三个数量级。我们进行了广泛的实验，以表明我们的方法既有效又有效。

著录项

来源
《IEEE International Conference on Data Engineering》|2019年|721p|共12页
会议地点
作者
Yueji Yang; Divykant Agrawal; H. V. Jagadish; Anthony K. H. Tung; Shuang Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词
Keyword search; Steiner trees; Resource description framework; Database languages; Hardware; Knowledge based systems; Relational databases;

机译：关键字搜索;施泰纳树;资源描述框架;数据库语言;硬件;基于知识的系统;关系数据库;

相似文献

外文文献
中文文献
专利

1. Ontology Based Efficient Multi Keyword Query Interface for Search Engines [J] . S. Jayasundar, V.N. Rajavarman, V. Saishanmuga Raja Asian Journal of Information Technology . 2018,第3期

机译：基于本体的搜索引擎高效多关键字查询界面
2. An Efficient Approach for Keyword Selection ; Improving Accessibility of Web Contents by General Search Engines [J] . H. H. Kian, M. Zahedi International Journal of Web & Semantic Technology . 2011,第4期

机译：一种有效的关键字选择方法;通用搜索引擎改善Web内容的可访问性
3. Sync/Async parallel search for the efficient design and construction of web search engines [J] . Mauricio Marin, Veronica Gil-Costa, Carolina Bonacic, Parallel Computing . 2010,第4期

机译：同步/异步并行搜索，可有效设计和构建Web搜索引擎
4. An Efficient Parallel Keyword Search Engine on Knowledge Graphs [C] . Yueji Yang, Divykant Agrawal, H. V. Jagadish, IEEE International Conference on Data Engineering . 2019

机译：基于知识图的高效并行关键字搜索引擎
5. Studying the effect of parallelization on the performance of Andromeda Search Engine: A search engine for peptides. [D] . Shah, Jigna. 2015

机译：研究并行化对仙女座搜索引擎性能的影响：肽搜索引擎。
6. Interest in Anesthesia as Reflected by Keyword Searches using Common Search Engines [O] . Renyu Liu, Paul S. García, Lee A. Fleisher -1

机译：使用常用搜索引擎的关键字搜索反映的麻醉感兴趣
7. Engineering Knowledge Graph for Keyword Discovery in Patent Search [O] . Serhad Sarica, Binyang Song, En Low, 2019

机译：专利搜索中的关键字发现的工程知识图

An Efficient Parallel Keyword Search Engine on Knowledge Graphs

摘要

著录项

相似文献

相关主题

期刊订阅