首页> 外文学位 >Applying Natural Language Processing to Clinical Information Retrieval.
【24h】

Applying Natural Language Processing to Clinical Information Retrieval.

机译:将自然语言处理应用于临床信息检索。

获取原文
获取原文并翻译 | 示例

摘要

Medical literature, such as medical health records are increasingly digitised. As with any large growth of digital data, methods must be developed to manage data as well as to extract any important information. Information Retrieval (IR) techniques, for instance search engines, provide an intuitive medium in locating important information among large volumes of data. With more and more patient records being digitised, the use of search engines in a healthcare setting provides a highly promising method for efficiently overcoming the problem of information overload.;Traditional IR approaches often perform retrieval based solely using term frequency counts, known as a 'bag-of-words' approach. While these approaches are effective in certain settings they fail to account for more complex semantic relationships that are often more prevalent in medical literature such as negation (e.g. 'absence of palpitations'), temporality (e.g. 'previous admission for fracture') or attribution (e.g. 'Father is diabetic'), or even term dependencies ("colon cancer"). Furthermore, the high level of linguistic variation and synonymy found in clinical reports gives rise to issues of vocabulary mismatch whereby concepts in a document and query may be the same, however given differences in their textual representation relevant documents are missed e.g. hypertension and HNT. Given the high cost associated with errors in the medical domain, precise retrieval and reduction of errors is imperative.;Given the growing number of shared tasks in the domain of Clinical Natural Language Processing (NLP), this thesis investigates how best to integrate Clinical NLP technologies into a Clinical Information Retrieval workflow in order to enhance the search engine experience of healthcare professionals. To determine this we apply three current directions in Clinical NLP research to the retrieval task. First, we integrate a Medical Entity Recognition system, developed and evaluated on I2B2 datasets, achieving an f-score of 0.85. The second technique clarifies the Assertion Status of medical conditions by determining who is the actual experiencer of the medical condition in the report, its negation and its temporality. Standalone evaluations on I2B2 datasets, have seen the system achieve a micro f-score of 0.91. The final NLP technique applied is that of Concept Normalisation, whereby textual concepts are mapped to concepts in an ontology in order to avoid problems of vocabulary mismatch. While evaluation scores on the CLEF evaluation corpus are 0.509, this concept normalisation approach is shown in the thesis to be the most effective NLP approach of the three explored in aiding Clinical IR performance.
机译:医学文献,例如医学健康记录,日益数字化。与数字数据的大量增长一样,必须开发方法来管理数据以及提取任何重要信息。信息检索(IR)技术(例如搜索引擎)为在大量数据之间定位重要信息提供了一种直观的媒介。随着越来越多的患者记录被数字化,在医疗机构中使用搜索引擎提供了一种非常有前途的方法,可以有效地解决信息过载的问题。传统的IR方法通常仅基于术语频率计数来执行检索,这称为“言行一致的方法。尽管这些方法在某些情况下是有效的,但它们却无法解决更为复杂的语义关系,而这种语义关系在医学文献中通常更为普遍,例如否定(例如“心pal缺失”),暂时性(例如“骨折前期入院”)或归因(例如“父亲是糖尿病患者”,甚至是长期依存关系(“结肠癌”)。此外,在临床报告中发现的高水平的语言变化和同义词引起了词汇失配的问题,由此文档和查询中的概念可能是相同的,但是由于文本表达方面的差异而忽略了相关文档,例如。高血压和HNT。鉴于医学领域中与错误相关的高成本,必须准确地检索和减少错误。;鉴于临床自然语言处理(NLP)领域中共享任务的数量不断增加,本论文研究了如何最好地整合临床NLP技术整合到临床信息检索工作流程中,以增强医疗保健专业人员的搜索引擎体验。为了确定这一点,我们将临床NLP研究中的三个当前方向应用于检索任务。首先,我们集成了医疗实体识别系统,并在I2B2数据集上进行了开发和评估,得出f值为0.85。第二种技术通过确定谁是报告中实际的医疗状况经验者,其否定和暂时性来阐明医疗状况的断言状态。对I2B2数据集的独立评估显示,该系统的微f得分为0.91。最终使用的NLP技术是概念规范化技术,该技术将文本概念映射到本体中的概念,以避免出现词汇不匹配的问题。尽管CLEF评估语料库的评估得分为0.509,但本文中显示的这种概念归一化方法是在帮助临床IR性能方面探索的三种方法中最有效的NLP方法。

著录项

  • 作者

    Cogley, James.;

  • 作者单位

    University College Dublin (Ireland).;

  • 授予单位 University College Dublin (Ireland).;
  • 学科 Computer Science.;Information Science.
  • 学位 Ph.D.
  • 年度 2014
  • 页码 205 p.
  • 总页数 205
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号