首页> 外文会议>Data Mining and Big Data >Extraction of Temporal Events from Clinical Text Using Semi-supervised Conditional Random Fields
【24h】

Extraction of Temporal Events from Clinical Text Using Semi-supervised Conditional Random Fields

机译:使用半监督条件随机场从临床文本中提取时间事件

获取原文
获取原文并翻译 | 示例

摘要

The huge amount of clinical text in Electronic Medical Records (EMRs) has opened a stage for text processing and information extraction for healthcare and medical research. Extracting temporal information in clinical text is much more difficult than the newswire text due to implicit expression of temporal information, domain-specific nature and lack of writing quality, among others. Despite of these constraints, some of the existing works established rule-based, machine learning and hybrid methods to extract temporal information with the help of annotated corpora. However obtaining the annotated corpora is costly, time consuming and requires much manual effort and thus their small size inevitably affects the processing quality. Motivated by this fact, in this work we propose a novel two-stage semi-supervised framework to exploit the abundant unannotated clinical text to automatically detect temporal events and then subsequently improve the stability and the accuracy of temporal event extraction. We trained and evaluated our semi-supervised model with the selected features of testing dataset, resulting F-measure of 89.76% for event extraction.
机译:电子病历(EMR)中大量的临床文本为医疗和医学研究的文本处理和信息提取打开了一个舞台。由于时态信息的隐含表达,特定领域的性质以及缺乏写作质量,因此在临床文本中提取时态信息比新闻通讯社文本困难得多。尽管有这些限制,一些现有的工作还是建立了基于规则的,机器学习和混合方法,以在带注释的语料库的帮助下提取时间信息。然而,获得带注释的语料库是昂贵的,费时的并且需要大量的人工,因此它们的小尺寸不可避免地影响处理质量。基于这一事实,在这项工作的推动下,我们提出了一种新颖的两阶段半监督框架,以利用大量未注释的临床文本来自动检测时间事件,然后提高时间事件提取的稳定性和准确性。我们使用测试数据集的选定功能训练和评估了我们的半监督模型,得出事件提取的F度量为89.76%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号