首页> 外文会议>International Conference on Intelligent Information Systems >Related Entity Finding Using Semantic Clustering Based on Wikipedia Categories
【24h】

Related Entity Finding Using Semantic Clustering Based on Wikipedia Categories

机译:相关实体使用基于维基百科类别的语义聚类

获取原文

摘要

We present a system that performs Related Entity Finding, that is, Question Answering that exploits Semantic Information from the WWW and returns URIs as answers. Our system uses a search engine to gather all candidate answer entities and then a linear combination of Information Retrieval measures to choose the most relevant. For each one we look up its Wikipedia page and construct a novel vector representation based on the tokenization of the Wikipedia category names. This novel representation gives our system the ability to compute a measure of semantic relatedness between entities, even if the entities do not share any common category. We use this property to perform a semantic clustering of the candidate entities and show that the biggest cluster contains entities that are closely related semantically and can be considered as answers to the query. Performance measured on 20 topics from the 2009 TREC Related Entity Finding task shows competitive results.
机译:我们展示了一个执行相关实体查找的系统,即问题应答从WWW中利用语义信息并将URIS作为答案返回。我们的系统使用搜索引擎收集所有候选答案实体,然后是信息检索措施的线性组合,以选择最相关的。对于每个我们查找其维基百科页面并根据Wikipedia类别名称的标记构建一个新颖的向量表示。这种新颖的表示使我们的系统能够计算实体之间的语义相关性度量的能力,即使实体不共享任何共同类别。我们使用此属性执行候选实体的语义群集,并显示最大的群集包含语义密切相关的实体,并且可以被视为查询的答案。从2009年TREC相关实体查找任务的20个主题测量的性能显示竞争结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号