首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >Baseline and extensions approach to information retrieval of complex medical data: Poznans approach to the bioCADDIE 2016
【2h】

Baseline and extensions approach to information retrieval of complex medical data: Poznans approach to the bioCADDIE 2016

机译:复杂医学数据信息检索的基线和扩展方法:波兹南的bioCADDIE 2016方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Information retrieval from biomedical repositories has become a challenging task because of their increasing size and complexity. To facilitate the research aimed at improving the search for relevant documents, various information retrieval challenges have been launched. In this article, we present the improved medical information retrieval systems designed by Poznan University of Technology and Poznan University of Medical Sciences as a contribution to the bioCADDIE 2016 challenge—a task focusing on information retrieval from a collection of 794 992 datasets generated from 20 biomedical repositories. The system developed by our team utilizes the Terrier 4.2 search platform enhanced by a query expansion method using word embeddings. This approach, after post-challenge modifications and improvements (with particular regard to assigning proper weights for original and expanded terms), allowed us achieving the second best infNDCG measure (0.4539) compared with the challenge results and infAP 0.3978. This demonstrates that proper utilization of word embeddings can be a valuable addition to the information retrieval process. Some analysis is provided on related work involving other bioCADDIE contributions. We discuss the possibility of improving our results by using better word embedding schemes to find candidates for query expansion. >Database URL:
机译:由于其不断增加的规模和复杂性,从生物医学存储库中检索信息已成为一项具有挑战性的任务。为了促进旨在改善相关文件搜索的研究,已经提出了各种信息检索挑战。在本文中,我们介绍了波兹南工业大学和波兹南医科大学设计的改进的医学信息检索系统,以应对bioCADDIE 2016挑战—该任务的重点是从20种生物医学产生的794 992个数据集中收集信息仓库。我们团队开发的系统利用Terrier 4.2搜索平台,该平台通过使用单词嵌入的查询扩展方法进行了增强。经过挑战后的修改和改进(特别是为原始词条和扩展词条分配适当的权重),这种方法使我们获得了比挑战结果和infAP 0.3978更好的infNDCG度量(0.4539)。这表明正确利用单词嵌入可以成为信息检索过程的宝贵补充。对涉及其他bioCADDIE贡献的相关工作进行了一些分析。我们讨论了通过使用更好的单词嵌入方案来查找查询扩展候选者来改善结果的可能性。 >数据库网址

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号