【24h】

The Role of Linked Data in Content Selection

机译:链接数据在内容选择中的作用

获取原文

摘要

This paper explores the appropriateness of utilizing Linked Data as a knowledge source for content selection. Content Selection is a crucial subtask in Natural Language Generation which has the function of determining the relevancy of contents from a knowledge source based on a communicative goal. The recent online era has enabled us to accumulate extensive amounts of generic online knowledge some of which has been made available as structured knowledge sources for computational natural language processing purposes. This paper proposes a model for content selection by utilizing a generic structured knowledge source, DBpedia, which is a replica of the unstructured counterpart, Wikipedia. The proposed model uses log likelihood to rank the contents from DBpedia Linked Data for relevance to a communicative goal. We performed experiments using DBpedia as the Linked Data resource using two keyword datasets as communicative goals. To optimize parameters we used keywords extracted from QALD-2 training dataset and QALD-2 testing dataset is used for the testing. The results was evaluated against the verbatim based selection strategy. The results showed that our model can perform 18.03% better than verbatim selection.
机译:本文探讨了利用链接数据作为内容选择的知识来源的适当性。内容选择是自然语言生成中至关重要的子任务,其功能是基于交流目标从知识源确定内容的相关性。最近的在线时代使我们能够积累大量的通用在线知识,其中一些已作为结构化知识源提供,用于计算自然语言处理。本文提出了一种利用通用结构化知识源DBpedia进行内容选择的模型,该资源是非结构化对应物Wikipedia的副本。所提出的模型使用对数似然来对DBpedia链接数据中的内容进行排名,以实现沟通目标。我们使用两个关键词数据集作为交流目标,使用DBpedia作为链接数据资源进行了实验。为了优化参数,我们使用从QALD-2训练数据集中提取的关键字,并使用QALD-2测试数据集进行测试。根据逐字选择策略对结果进行了评估。结果表明,我们的模型可以比逐字选择更好地执行18.03%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号