【24h】

Identification of Occupation Mentions in Clinical Narratives

机译:临床叙事中职业提及的识别

获取原文

摘要

A patient's occupation is an important variable used for disease surveillance and modeling, but such information is often only available in free-text clinical narratives. We have developed a large occupation dictionary that is used as part of both knowledge- (dictionary and rules) and data-driven (machine-learning) methods for the identification of occupation mentions. We have evaluated the approaches on both public and non-public clinical datasets. A machine-learning method using linear chain conditional random fields trained on minimalistic set of features achieved up to 88 % F_1-measure (token-level), with the occupation feature derived from the knowledge-driven method showing a notable positive impact across the datasets (up to additional 32 % F_1 -measure).
机译:患者的职业是用于疾病监测和建模的重要变量,但是此类信息通常只能在自由文本的临床叙述中获得。我们已经开发了一个大型的职业词典,既可以用作知识(词典和规则),又可以用作数据驱动(机器学习)方法的一部分,以识别职业。我们已经评估了公共和非公共临床数据集上的方法。使用在最小特征集上训练的线性链条件随机场的机器学习方法可实现高达88%的F_1测度(令牌级别),而从知识驱动方法得出的职业特征在整个数据集中显示出显着的积极影响(最多32%的F_1-测量)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号