首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Semi-Supervised Maximum Entropy Based Approach to Acronym and Abbreviation Normalization in Medical Texts
【24h】

Semi-Supervised Maximum Entropy Based Approach to Acronym and Abbreviation Normalization in Medical Texts

机译:基于半监督的基于校长的缩写和医学文本缩写标准化的方法

获取原文

摘要

Text normalization is an important aspect of successful information retrieval from medical documents such as clinical notes, radiology reports and discharge summaries. In the medical domain, a significant part of the general problem of text normalization is abbreviation and acronym disambiguation. Numerous abbreviations are used routinely throughout such texts and knowing their meaning is critical to data retrieval from the document. In this paper I will demonstrate a method of automatically generating training data for Maximum Entropy (ME) modeling of abbreviations and acronyms and will show that using ME modeling is a promising technique for abbreviation and acronym normalization. I report on the results of an experiment involving training a number of ME models used to normalize abbreviations and acronyms on a sample of 10,000 rheumatology notes with ~89% accuracy.
机译:文本规范化是从医疗文件中取消检索的重要信息,例如临床票据,放射学报告和排放摘要。在医疗领域,文本规范化的一般问题的一部分是缩写和缩写歧义。在整个这些文本中常规使用众多缩写,并且知道它们的含义对从文档中检索数据检索至关重要。在本文中,我将演示一种自动生成培训数据的方法,用于最大熵(ME)缩写和首字母缩略词的建模,并将显示使用ME建模是一种有希望的缩写和缩写标准化的技术。我报告了一个涉及培训许多ME模型的实验的结果,用于将缩写和缩略词正常化为10,000个风湿学的样本,精度为89%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号