首页> 外文会议>International Conference on Theory and Practice of Digital Libraries >Context-Sensitive Ranking Using Cross-Domain Knowledge for Chemical Digital Libraries
【24h】

Context-Sensitive Ranking Using Cross-Domain Knowledge for Chemical Digital Libraries

机译:使用跨域知识进行化学数字图书馆的上下文敏感等级

获取原文
获取外文期刊封面目录资料

摘要

Today, entity-centric searches are common tasks for information gathering. But, due to the huge amount of available information the entity itself is often not sufficient for finding suitable results. Users are usually searching for entities in a specific search context which is important for their relevance assessment. Therefore, for digital library providers it is inevitable to also consider this search context to allow for high quality retrieval. In this paper we present an approach enabling context searches for chemical entities. Chemical entities play a major role in many specific domains, ranging from biomedical over biology to material science. Since most of the domain specific documents lack of suitable context annotations, we present a similarity measure using crossdomain knowledge gathered from Wikipedia. We show that structure-based similarity measures are not suitable for chemical context searches and introduce a similarity measure combining entity- and context similarity. Our experiments show that our measure outperforms structure-based similarity measures for chemical entities. We compare against two baseline approaches: a Boolean retrieval model and a model using statistical query expansion for the context term. We compared the measures computing mean average precision (MAP) using a set of queries and manual relevance assessments from domain experts. We were able to get a total increase of the MAP of 30% (from 31% to 61%). Furthermore, we show a personalized retrieval system which leads to another increase of around 10%.
机译:如今,以实体为中心的搜索是信息收集的常见任务。但是,由于巨额可用信息,实体本身通常不足以找到合适的结果。用户通常在特定的搜索上下文中搜索实体,这对其相关性评估很重要。因此,对于数字库提供商,也不可避免地考虑此搜索上下文以允许高质量的检索。在本文中,我们提出了一种能够对化学实体进行上下文搜索的方法。化学实体在许多具体领域发挥了重要作用,从生物医学的生物医学中对材料科学的影响。由于大多数域的具体文件缺乏合适的上下文注释,我们使用从维基百科收集的横域知识呈现相似度措施。我们表明,基于结构的相似度测量不适合于化学语境搜索并引入相似度测量组合实体和上下文相似度。我们的实验表明,我们的测量优于化学实体的基于结构的相似措施。我们与两种基线方法进行比较:使用统计查询扩展的布尔检索模型和模型,用于上下文术语。我们将使用来自域专家的一组查询和手动相关性评估来比较计算平均平均精度(MAP)的措施。我们能够总增加30%的地图(从31%到61%)。此外,我们显示了个性化的检索系统,导致另一个增加约10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号