首页> 外文会议>ASIST Annual Meeting >Exploring the Use of Natural Language Systems for Fact Identification: Towards the Automatic Construction of Healthcare Portals
【24h】

Exploring the Use of Natural Language Systems for Fact Identification: Towards the Automatic Construction of Healthcare Portals

机译:探索自然语言系统的使用,以便识别:朝向医疗机构的自动构建

获取原文

摘要

In prior work we observed that expert searchers follow well-defined search procedures in order to obtain comprehensive information on the Web. Motivated by that observation, we developed a prototype domain portal called the Strategy Hub that provides expert search procedures to benefit novice searchers. The search procedures in the prototype were entirely handcrafted by search experts, making further expansion of the Strategy Hub cost-prohibitive. However, a recent study on the distribution of healthcare information on the web suggested that search procedures can be automatically generated from pages that have been rated based on the extent to which they cover facts relevant to a topic. This paper presents the results of experiments designed to automate the process of rating the extent to which a page covers relevant facts. To automatically generate these ratings, we used two natural language systems, Latent Semantic Analysis and MEAD, to compute the similarity between sentences on the page and each fact. We then used an algorithm to convert these similarity scores to a single rating that represents the extent to which the page covered each fact. These automatic ratings are compared with manual ratings using inter-rater reliability statistics. Analysis of these statistics reveals the strengths and weaknesses of each tool, and suggests avenues for improvement.
机译:在现有工作中,我们观察到,专家搜索者遵循明确定义的搜索过程,以便在网上获取全面的信息。通过该观察,我们开发了一个称为战略集线器的原型域门户,提供专家搜索程序来使新手搜索者受益。原型中的搜索过程由搜索专家进行全面手工制作,进一步扩展战略集中的成本禁止。然而,最近关于网上医疗保健信息分发的研究表明,可以根据他们涵盖与主题相关的事实的程度自动从评级的页面中自动生成搜索程序。本文提出了实验结果,旨在自动化评级的过程,该过程涉及页面涵盖相关事实的程度。为了自动生成这些评级,我们使用了两个自然语言系统,潜在语言分析和蜂蜜,来计算页面和每个事实的句子之间的相似性。然后,我们使用算法将这些相似性分数转换为单个评级,该评级表示页面每个事实的范围。使用帧间可靠性统计数据与手动额定值进行比较这些自动额定值。这些统计数据的分析揭示了每个工具的优势和缺点,并建议改进途径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号