Towards the Completion of a Domain-Specific Knowledge Base with Emerging Query Terms

机译：朝完成具有新兴查询条款的域特定知识库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Domain-specific knowledge bases play an increasingly important role in a variety of real applications. In this paper, we use the product knowledge base in the largest Chinese e-commerce platform, Taobao, as an example to investigate a completion procedure of a domain-specific knowledge base. We argue that the domain-specific knowledge bases tend to be incomplete, and are oblivious to their incompleteness, without a continuous completion procedure in place. The key component of this completion procedure is the classification of emerging query terms into corresponding properties of categories in existing taxonomy. Our proposal is that we use query logs to complete the product knowledge base of Taobao. However, the query driven completion usually faces many challenges including distinguishing the fine-grained semantic of unrecognized terms, handling the sparse data and so on. We propose a graph based solution to overcome these challenges. We first construct a lot of positive evidence to establish the semantical similarity between terms, and then run a shortest path or alternatively a random walk on the similarity graph under a set of constraints derived from a set of negative evidence to find the best candidate property for emerging query terms. We finally conduct extensive experiments on real data of Taobao and a subset of CN-DBpedia. The results show that our solution classifies emerging query terms with a good performance. Our solution is already deployed in Taobao, helping it find nearly 7 million new values for properties. The complete product knowledge base significantly improves the ratio of recognized queries and recognized terms by more than 25% and 32%, respectively.

机译：域特定知识库在各种真实应用中起着越来越重要的作用。在本文中，我们使用中国最大的电子商务平台淘宝的产品知识库作为调查域特定知识库的完成过程的示例。我们认为，具体领域的知识库往往是不完整的，并且不完全不完整，没有连续完成程序。此完成程序的关键组成部分是在现有分类中的类别的相应属性中进行新出现的查询条款的分类。我们的提议是我们使用查询日志来完成淘宝的产品知识库。然而，查询驱动的完成通常面临许多挑战，包括区分微粒语义的无法识别的术语，处理稀疏数据等。我们提出了一种基于图形的解决方案来克服这些挑战。我们首先构建许多积极的证据来建立术语之间的语义相似性，然后在一组否定证据的一组约束下运行最短路径或在相似性图中运行随机步行，以找到最佳候选物业新兴查询条款。我们终于对淘宝的真实数据和CN-DBPedia的子集进行了广泛的实验。结果表明，我们的解决方案通过良好的性能对新兴查询术语进行分类。我们的解决方案已部署在淘宝中，帮助它找到近700万的物业价值。完整的产品知识库显着提高了公认的查询与公认的术语，分别超过25％和32％。

著录项

来源
《IEEE International Conference on Data Engineering》|2019年|721p|共12页
会议地点
作者
Sihang Jiang; Jiaqing Liang; Yanghua Xiao; Haihong Tang; Haikuan Huang; Jun Tan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据处理、数据处理系统;
关键词
Knowledge based systems; Semantics; User experience; Shape; Dogs; Painting;

机译：基于知识的系统;语义;用户体验;形状;狗;绘画;

相似文献

外文文献
中文文献
专利

1. Learning from homologous queries and semantically related terms for query auto completion [J] . Fei Cai, Maarten de Rijke Information Processing & Management . 2016,第4期

机译：从同源查询和语义相关术语中学习以实现查询自动完成
2. Constructing Query Context Knowledge Bases for Relevant Term Suggestion [J] . Wang Jenq-Haur, Shih Meng-Han Journal of information science and engineering . 2015,第2期

机译：构建相关术语建议的查询上下文知识库
3. KnowPoetry: A Knowledge Service Platform for Tang Poetry Research Based on Domain-Specific Knowledge Graph [J] . LIANG HONG, WENJUN HUO, LINA ZHOU Library trends . 2020,第1期

机译：知识：基于域特定知识图的唐诗研究的知识服务平台
4. Towards the Completion of a Domain-Specific Knowledge Base with Emerging Query Terms [C] . Sihang Jiang, Jiaqing Liang, Yanghua Xiao, IEEE International Conference on Data Engineering . 2019

机译：借助新兴查询词完成领域特定的知识库
5. Domain-specific knowledge-based information retrieval model using knowledge reduction. [D] . Yoon, Changwoo. 2005

机译：使用知识约简的基于特定领域知识的信息检索模型。
6. Knowledge-Based Query Construction Using the CDSS Knowledge Base for Efficient Evidence Retrieval [O] . Muhammad Afzal, Maqbool Hussain, Taqdir Ali, 2015

机译：使用CDSS知识库的基于知识的查询构造可进行有效的证据检索
7. Using Domain-Specific Term Frequencies to Identify and Classify Health Queries [O] . Carla Teixeira Lopes, Daniela Dias, Cristina Ribeiro 2013

机译：使用域特定的术语频率来标识和分类健康查询
8. Natural Language Data Base Query:Using the Data Base Itself as the Definition of World Knowledge and as an Extension of the Dictionary. [R] . harris,larry r. 1977

机译：自然语言数据库查询：使用数据库本身作为世界知识的定义和词典的扩展。

Towards the Completion of a Domain-Specific Knowledge Base with Emerging Query Terms

摘要

著录项

相似文献

相关主题

期刊订阅