首页> 美国卫生研究院文献>Journal of Digital Imaging >Analyzing Medical Image Search Behavior: Semantics and Prediction of Query Results
【2h】

Analyzing Medical Image Search Behavior: Semantics and Prediction of Query Results

机译:分析医学图像搜索行为:查询结果的语义和预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Log files of information retrieval systems that record user behavior have been used to improve the outcomes of retrieval systems, understand user behavior, and predict events. In this article, a log file of the ARRS GoldMiner search engine containing 222,005 consecutive queries is analyzed. Time stamps are available for each query, as well as masked IP addresses, which enables to identify queries from the same person. This article describes the ways in which physicians (or Internet searchers interested in medical images) search and proposes potential improvements by suggesting query modifications. For example, many queries contain only few terms and therefore are not specific; others contain spelling mistakes or non-medical terms that likely lead to poor or empty results. One of the goals of this report is to predict the number of results a query will have since such a model allows search engines to automatically propose query modifications in order to avoid result lists that are empty or too large. This prediction is made based on characteristics of the query terms themselves. Prediction of empty results has an accuracy above 88 %, and thus can be used to automatically modify the query to avoid empty result sets for a user. The semantic analysis and data of reformulations done by users in the past can aid the development of better search systems, particularly to improve results for novice users. Therefore, this paper gives important ideas to better understand how people search and how to use this knowledge to improve the performance of specialized medical search engines.
机译:记录用户行为的信息检索系统的日志文件已用于改善检索系统的结果,了解用户行为并预测事件。本文分析了包含222,005个连续查询的ARRS GoldMiner搜索引擎的日志文件。时间戳可用于每个查询以及被屏蔽的IP地址,从而可以识别同一个人的查询。本文介绍了医生(或对医学图像感兴趣的Internet搜索者)进行搜索的方式,并通过建议查询修改来提出潜在的改进建议。例如,许多查询只包含很少的术语,因此不是特定的。其他包含拼写错误或非医学术语,可能会导致不良或空洞的结果。此报告的目标之一是预测查询将要获得的结果数,因为这种模型允许搜索引擎自动提出查询修改建议,以避免结果列表为空或太大。该预测是基于查询字词本身的特征进行的。空结果的预测精度超过88%,因此可以用于自动修改查询以避免用户空结果集。过去用户进行的语义分析和重新安排的数据可以帮助开发更好的搜索系统,尤其是可以改善新手用户的搜索结果。因此,本文提出了重要的想法,以更好地了解人们如何进行搜索以及如何使用这些知识来提高专业医学搜索引擎的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号