首页> 外文期刊>Automation in construction >Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports
【24h】

Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports

机译:自动化内容分析以确保施工安全:一种自然语言处理系统,可从非结构性损伤报告中提取前体和结果

获取原文
获取原文并翻译 | 示例
           

摘要

In the United States like in many other countries throughout the globe, construction workers are more likely to be injured on the job than workers in any other industry. This poor safety performance is responsible for huge human and financial losses and has motivated extensive research. Unfortunately, safety improvement in construction has decelerated in the last decade and traditional safety programs have reached saturation. Yet major construction companies and federal agencies possess a wealth of empirical knowledge in the form of huge databases of digital construction injury reports. This knowledge could be used to better understand, predict, and prevent the occurrence of construction accidents. Unfortunately, due to the lack of a clear methodology and the high costs of manual large-scale content analysis, these valuable data have yet to be extracted and leveraged. Recently, researchers have proposed a framework allowing meaningful empirical data to be extracted from accident reports. However, the resource limitations inherent to manual content analysis still remain. The present study tested the proposition that manual content analysis of injury reports can be eliminated using natural language processing (NLP). This paper describes (1) the overall strategy and methodology used in developing the system, and specifically how key challenges with decoding unstructured reports were overcome; (2) how the system was built through an iterative process of coding and testing against manual content analysis results from a team of seven independent analysts; and (3) the implications and potential uses of the data extracted. The results indicate that the NLP system is capable of quickly and automatically scanning unstructured injury reports for 101 attributes and outcomes with over 95% accuracy. The main contribution of this research is to empower any organization to quickly obtain a large and highly reliable structured attribute and outcome data set from their databases of unstructured accident reports. Such structured data are a necessary prerequisite to the application of statistical modeling techniques, allowing the extraction of new safety knowledge and finally the amelioration of safety management. (C) 2015 Elsevier B.V. All rights reserved.
机译:与全球其他许多国家一样,在美国,建筑工人比其他任何行业的工人在工作中受伤的可能性更高。这种不良的安全性能会造成巨大的人力和财力损失,并引发了广泛的研究。不幸的是,在过去的十年中,建筑安全改进的步伐有所放缓,传统的安全计划已达到饱和。然而,大型建筑公司和联邦机构以庞大的数字化建筑伤害报告数据库的形式拥有丰富的经验知识。这些知识可以用来更好地理解,预测和预防施工事故的发生。不幸的是,由于缺乏清晰的方法和手动进行大规模内容分析的高昂成本,这些有价值的数据尚未被提取和利用。最近,研究人员提出了一个框架,允许从事故报告中提取有意义的经验数据。但是,手动内容分析固有的资源限制仍然存在。本研究测试了这样的主张,即可以使用自然语言处理(NLP)消除对伤害报告的手动内容分析。本文描述(1)开发系统时使用的总体策略和方法,尤其是如何克服解码非结构化报告的关键挑战; (2)如何通过一个由七个独立分析师组成的团队对手工内容分析结果进行编码和测试的迭代过程来构建系统; (3)提取数据的含义和潜在用途。结果表明,NLP系统能够快速自动扫描非结构性损伤报告中101个属性和结果,准确率超过95%。这项研究的主要贡献是使任何组织都能从其非结构化事故报告数据库中快速获取大型且高度可靠的结构化属性和结果数据集。这样的结构化数据是应用统计建模技术的必要先决条件,从而可以提取新的安全知识,并最终改善安全管理。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号