首页> 外文期刊>Journal of Safety Research >Development and evaluation of a Naieve Bayesian model for coding causation of workers' compensation claims
【24h】

Development and evaluation of a Naieve Bayesian model for coding causation of workers' compensation claims

机译:Naieve贝叶斯模型的开发和评估,该模型用于编码工人赔偿要求的因果关系

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Introduction: Tracking and trending rates of injuries and illnesses classified as musculoskeletal disorders caused by ergonomic risk factors such as overexertion and repetitive motion (MSDs) and slips, trips, or falls (STFs) in different industry sectors is of high interest to many researchers. Unfortunately, identifying the cause of injuries and illnesses in large datasets such as workers' compensation systems often requires reading and coding the free form accident text narrative for potentially millions of records. Method: To alleviate the need for manual coding, this paper describes and evaluates a computer auto-coding algorithm that demonstrated the ability to code millions of claims quickly and accurately by learning from a set of previously manually coded claims. Conclusions: The auto-coding program was able to code claims as a musculoskeletal disorders, STF or other with approximately 90% accuracy. Impact on industry: The program developed and discussed in this paper provides an accurate and efficient method for identifying the causation of workers' compensation claims as a STF or MSD in a large database based on the unstructured text narrative and resulting injury diagnoses. The program coded thousands of claims in minutes. The method described in this paper can be used by researchers and practitioners to relieve the manual burden of reading and identifying the causation of claims as a STF or MSD. Furthermore, the method can be easily generalized to code/classify other unstructured text narratives.
机译:简介:在许多行业中,由人体工程学风险因素(例如过度劳累和重复运动(MSD)和滑倒,绊倒或跌倒(STF))引起的伤害和疾病归类为肌肉骨骼疾病的追踪和趋势趋势非常受关注。不幸的是,要在诸如工人补偿系统之类的大型数据集中发现伤害和疾病的原因,通常需要阅读和编码可能包含数百万条记录的自由格式事故文本说明。方法:为减轻对手动编码的需求,本文描述并评估了一种计算机自动编码算法,该算法展示了通过从一组先前手动编码的索赔中学习而快速而准确地对数百万个索赔进行编码的能力。结论:自动编码程序能够将声明编码为肌肉骨骼疾病,STF或其他,准确率约为90%。对行业的影响:本文开发和讨论的程序提供了一种准确有效的方法,可基于非结构化文本叙述和由此产生的伤害诊断,在大型数据库中识别出作为STF或MSD的工人赔偿要求的因果关系。该程序在几分钟之内就编码了数千个索赔。研究人员和从业人员可以使用本文所述的方法来减轻阅读和识别索赔为STF或MSD的因果关系的人工负担。此外,该方法可以容易地概括为对其他非结构化文本叙述进行编码/分类。

著录项

  • 来源
    《Journal of Safety Research》 |2012年第6期|327-332|共6页
  • 作者单位

    Industrywide Studies Branch, Division of Surveillance, Hazard Evaluations and Field Studies, The National Institute for Occupational Safety and Health, 4676 Columbia Parkway, R-15, Cincinnati, OH 45226, USA;

    National Institute for Occupational Safety and Health, Division of Surveillance, Hazard Evaluations, and Field Studies, Industrywide Studies Branch, 4676 Columbia Parkway, Cincinnati, OH 45226, USA;

    National Institute for Occupational Safety and Health, Division of Surveillance, Hazard Evaluations, and Field Studies, Industrywide Studies Branch, 4676 Columbia Parkway, Cincinnati, OH 45226, USA;

    National Institute for Occupational Safety and Health, Division of Safety Research, Analysis and Field Evaluations Branch, 1095 Wiilowdale Road, Morgantown, WV 26505, USA;

    Ohio Bureau of Workers' Compensation, Division of Safety & Hygiene, 13430 Yarmouth Drive, Pickerington, OH 43147, USA;

    Ohio Bureau of Workers' Compensation, Division of Safety & Hygiene, 13430 Yarmouth Drive, Pickerington, OH 43147, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    data-mining; text-mining; bayes; accident narratives; text classification;

    机译:数据挖掘;文本挖掘贝叶斯事故叙述;文字分类;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号