An Efficient Parallel Keyword Search Engine on Knowledge Graphs

机译：基于知识图的高效并行关键字搜索引擎

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Keyword search has recently become popular as a way to query relational databases, and even graphs, since it allows users to issue queries without learning a complex query language and data schema. Evaluating a keyword query is usually significantly more expensive than evaluating an equivalent selection query, since the query specification is less complete, and many alternative answers have to be considered by the system, requiring considerable effort to generate and compare. Current interest in big data and AI are putting even more demands on the efficiency of keyword search. In particular, searching of knowledge graphs is gaining popularity. As knowledge graphs often comprise many millions of nodes and edges, performing real-time search on graphs of this size is an open challenge. In this paper, we attempt to address this need by leveraging advances in hardware technologies, e.g. multi-core CPUs and GPUs. Specifically, we implement a parallel keyword search engine for Knowledge Bases (KB). To be able to do so, and to exploit parallelism, we devise a new approach to keyword search, based on a concept we introduce called Central Graph. Unlike the Group Steiner Tree (GST) model, widely used for keyword search, our approach can naturally work in parallel and still return compact answer graphs with rich information. Our approach can work in either multi-core CPUs or a single GPU. In particular, our GPU implementation is two to three orders of magnitudes faster than state-of-the-art keyword search method. We conduct extensive experiments to show that our approach is both efficient and effective.

机译：关键字搜索最近已成为查询关系数据库甚至图表的一种方式，因为它允许用户在不学习复杂的查询语言和数据模式的情况下发出查询。与查询等效选择查询相比，评估关键字查询通常比评估等效选择查询昂贵得多，因为查询规范不那么完整，并且系统必须考虑许多替代答案，这需要花费大量精力来生成和比较。当前对大数据和AI的兴趣对关键字搜索的效率提出了更高的要求。尤其是，知识图的搜索正变得越来越流行。由于知识图通常包含数百万个节点和边，因此在这种大小的图上执行实时搜索是一个开放的挑战。在本文中，我们尝试通过利用硬件技术的进步来满足这一需求，例如多核CPU和GPU。具体来说，我们为知识库（KB）实现了并行的关键字搜索引擎。为了做到这一点并利用并行性，我们基于引入的称为“中心图”的概念，设计了一种新的关键字搜索方法。与广泛用于关键字搜索的Group Steiner Tree（GST）模型不同，我们的方法可以自然并行运行，并且仍然返回具有丰富信息的紧凑型答案图。我们的方法可以在多核CPU或单个GPU中工作。特别是，我们的GPU实现比最先进的关键字搜索方法快两到三个数量级。我们进行了广泛的实验，以证明我们的方法既有效又有效。

著录项

来源
《IEEE International Conference on Data Engineering》|2019年|338-349|共12页
会议地点
作者
Yueji Yang; Divykant Agrawal; H. V. Jagadish; Anthony K. H. Tung; Shuang Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Keyword search; Steiner trees; Resource description framework; Database languages; Hardware; Knowledge based systems; Relational databases;

机译：关键字搜索; Steiner树;资源描述框架;数据库语言;硬件;基于知识的系统;关系数据库;

相似文献

外文文献
中文文献
专利

1. Ontology Based Efficient Multi Keyword Query Interface for Search Engines [J] . S. Jayasundar, V.N. Rajavarman, V. Saishanmuga Raja Asian Journal of Information Technology . 2018,第3期

机译：基于本体的搜索引擎高效多关键字查询界面
2. An Efficient Approach for Keyword Selection ; Improving Accessibility of Web Contents by General Search Engines [J] . H. H. Kian, M. Zahedi International Journal of Web & Semantic Technology . 2011,第4期

机译：一种有效的关键字选择方法;通用搜索引擎改善Web内容的可访问性
3. Sync/Async parallel search for the efficient design and construction of web search engines [J] . Mauricio Marin, Veronica Gil-Costa, Carolina Bonacic, Parallel Computing . 2010,第4期

机译：同步/异步并行搜索，可有效设计和构建Web搜索引擎
4. An Efficient Parallel Keyword Search Engine on Knowledge Graphs [C] . Yueji Yang, Divykant Agrawal, H. V. Jagadish, IEEE International Conference on Data Engineering . 2019

机译：知识图中的有效并行关键字搜索引擎
5. Studying the effect of parallelization on the performance of Andromeda Search Engine: A search engine for peptides. [D] . Shah, Jigna. 2015

机译：研究并行化对仙女座搜索引擎性能的影响：肽搜索引擎。
6. Interest in Anesthesia as Reflected by Keyword Searches using Common Search Engines [O] . Renyu Liu, Paul S. García, Lee A. Fleisher -1

机译：使用常用搜索引擎的关键字搜索反映的麻醉感兴趣
7. Engineering Knowledge Graph for Keyword Discovery in Patent Search [O] . Serhad Sarica, Binyang Song, En Low, 2019

机译：专利搜索中的关键字发现的工程知识图

An Efficient Parallel Keyword Search Engine on Knowledge Graphs

摘要

著录项

相似文献

相关主题

期刊订阅