【24h】

A Bootstrapping Approach to Symptom Entity Extraction on Chinese Electronic Medical Records

机译:中国电子病历中症状实体提取的自举方法

获取原文

摘要

Symptom entities are widely distributed in Chinese electronic medical records. Previous approaches on symptom entity extraction usually extract continuous strings as symptom entities and require massive human efforts on corpus annotation. We describe the symptom entity as two-tuples of and design a soft pattern matching method to locate them in sentences in the EMR. Our bootstrapping approach which only requires a few annotated symptom tuples and it allows iterative extraction from mass electronic medical record databases without human supervision. Furthermore, the described method annotates symptom entities in EMR by the extracted tuples. Starting with 60 annotated entities, our approach reached an F value of 81.40 % in the extraction task of 3,150 entities from 992 sets of electronic medical records.
机译:症状实体在中国电子病历中广泛分布。症状实体提取的先前方法通常将连续字符串提取为症状实体,并且需要大量的人力来进行语料注释。我们将症状实体描述为的二元组,并设计一种软模式匹配方法以将其定位在EMR中的句子中。我们的引导方法仅需要几个带注释的症状元组,并且允许从大量电子病历数据库中迭代提取而无需人工监督。此外,所描述的方法通过提取的元组注释EMR中的症状实体。从60个带注释的实体开始,在从992套电子病历中提取3150个实体的过程中,我们的方法的F值达到81.40%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号