首页> 外文会议>International conference on applications of natural language to information systems >Document Aboutness via Sophisticated Syntactic and Semantic Features
【24h】

Document Aboutness via Sophisticated Syntactic and Semantic Features

机译:通过复杂的句法和语义特征实现文档相关性

获取原文

摘要

The document aboutness problem asks for creating a succinct representation of a document's subject matter via keywords, sentences or entities drawn from a Knowledge Base. In this paper we propose an approach to solve this problem which improves the known solutions over all known datasets [4,19]. It is based on a wide and detailed experimental study of syntactic and semantic features drawn from the input document thanks to the use of some IR/NLP tools. To encourage and support reproducible experimental results on this task, we will make accessible our system via a public API: this is the first, and best performing, tool publicly available for the document aboutness problem.
机译:文档相关性问题要求通过从知识库中提取的关键字,句子或实体来创建文档主题的简洁表示。在本文中,我们提出了一种解决该问题的方法,该方法改进了所有已知数据集的已知解[4,19]。由于使用了一些IR / NLP工具,它基于从输入文档中提取的句法和语义特征的广泛而详细的实验研究。为了鼓励和支持在此任务上可重复的实验结果,我们将通过公共API来访问我们的系统:这是第一个也是性能最好的工具,可公开用于文档有关性问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号