Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports

Tixier Antoine J. -P.; Hallowell Matthew R.; Rajagopalan Balaji; Bowman Dean

首页> 外文期刊>Automation in construction >Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports

【24h】

Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports

机译：自动化内容分析以确保施工安全：一种自然语言处理系统，可从非结构性损伤报告中提取前体和结果

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the United States like in many other countries throughout the globe, construction workers are more likely to be injured on the job than workers in any other industry. This poor safety performance is responsible for huge human and financial losses and has motivated extensive research. Unfortunately, safety improvement in construction has decelerated in the last decade and traditional safety programs have reached saturation. Yet major construction companies and federal agencies possess a wealth of empirical knowledge in the form of huge databases of digital construction injury reports. This knowledge could be used to better understand, predict, and prevent the occurrence of construction accidents. Unfortunately, due to the lack of a clear methodology and the high costs of manual large-scale content analysis, these valuable data have yet to be extracted and leveraged. Recently, researchers have proposed a framework allowing meaningful empirical data to be extracted from accident reports. However, the resource limitations inherent to manual content analysis still remain. The present study tested the proposition that manual content analysis of injury reports can be eliminated using natural language processing (NLP). This paper describes (1) the overall strategy and methodology used in developing the system, and specifically how key challenges with decoding unstructured reports were overcome; (2) how the system was built through an iterative process of coding and testing against manual content analysis results from a team of seven independent analysts; and (3) the implications and potential uses of the data extracted. The results indicate that the NLP system is capable of quickly and automatically scanning unstructured injury reports for 101 attributes and outcomes with over 95% accuracy. The main contribution of this research is to empower any organization to quickly obtain a large and highly reliable structured attribute and outcome data set from their databases of unstructured accident reports. Such structured data are a necessary prerequisite to the application of statistical modeling techniques, allowing the extraction of new safety knowledge and finally the amelioration of safety management. (C) 2015 Elsevier B.V. All rights reserved.

机译：与全球其他许多国家一样，在美国，建筑工人比其他任何行业的工人在工作中受伤的可能性更高。这种不良的安全性能会造成巨大的人力和财力损失，并引发了广泛的研究。不幸的是，在过去的十年中，建筑安全改进的步伐有所放缓，传统的安全计划已达到饱和。然而，大型建筑公司和联邦机构以庞大的数字化建筑伤害报告数据库的形式拥有丰富的经验知识。这些知识可以用来更好地理解，预测和预防施工事故的发生。不幸的是，由于缺乏清晰的方法和手动进行大规模内容分析的高昂成本，这些有价值的数据尚未被提取和利用。最近，研究人员提出了一个框架，允许从事故报告中提取有意义的经验数据。但是，手动内容分析固有的资源限制仍然存在。本研究测试了这样的主张，即可以使用自然语言处理（NLP）消除对伤害报告的手动内容分析。本文描述（1）开发系统时使用的总体策略和方法，尤其是如何克服解码非结构化报告的关键挑战；（2）如何通过一个由七个独立分析师组成的团队对手工内容分析结果进行编码和测试的迭代过程来构建系统；（3）提取数据的含义和潜在用途。结果表明，NLP系统能够快速自动扫描非结构性损伤报告中101个属性和结果，准确率超过95％。这项研究的主要贡献是使任何组织都能从其非结构化事故报告数据库中快速获取大型且高度可靠的结构化属性和结果数据集。这样的结构化数据是应用统计建模技术的必要先决条件，从而可以提取新的安全知识，并最终改善安全管理。（C）2015 Elsevier B.V.保留所有权利。

著录项

来源
《Automation in construction》 |2016年第2期|45-56|共12页
作者
Tixier Antoine J. -P.; Hallowell Matthew R.; Rajagopalan Balaji; Bowman Dean;
展开▼
作者单位

Univ Colorado, Dept Civil Environm & Architectural Engn, Boulder, CO 80309 USA;

Univ Colorado, Dept Civil Environm & Architectural Engn, Boulder, CO 80309 USA;

Univ Colorado, Dept Civil Environm & Architectural Engn, CIRES, Boulder, CO 80309 USA;

Bentley Syst, Exton, PA USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automated content analysis; Natural language processing; Text mining; Knowledge extraction; Accident; Injury; Safety; Attribute; Risk; R;

机译：自动化内容分析;自然语言处理;文本挖掘;知识提取;事故;伤害;安全性;属性;风险;R;

相似文献

外文文献
中文文献
专利

1. Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing [J] . Lionel T. E. Cheng, Jiaping Zheng, Guergana K. Savova, Journal of Digital Imaging . 2010,第2期

机译：从非结构化MRI报告中识别肿瘤状态-现有报告中信息的完整性和自动自然语言处理的实用性
2. Towards automated generation of curated datasets in radiology: Application of natural language processing to unstructured reports exemplified on CT for pulmonary embolism [J] . European Journal of Radiology . 2020,第期

机译：在放射学自动化的策划数据集中：自然语言处理在肺栓塞CT中举例说明的非结构化报告
3. Extracting Diagnostic Data from Unstructured Bone Marrow Biopsy Reports of Myeloid Neoplasms Utilizing a Customized Natural Language Processing (NLP) Algorithm [J] . Kunz Isaac, Peddinti Ananth, Nguyen Tina, Blood: The Journal of the American Society of Hematology . 2018,第NovaelaTreatmentAppr期

机译：利用定制的自然语言处理（NLP）算法，从非结构化骨髓活检报告中提取骨髓肿瘤的诊断数据
4. Automated Classification of Radiology Reports for Acute Lung Injury: Comparison of Keyword and Machine Learning Based Natural Language Processing Approaches [C] . Imre Solti, Colin R. Cooke, Fei Xia, IEEE International Conference on Bioinformatics and Biomedicine Workshop . 2009

机译：急性肺损伤的放射学报告的自动分类：基于关键字和机器的自然语言处理方法的比较
5. Leveraging unstructured construction injury reports to predict safety outcomes and model safety risk using Natural Language Processing, Machine Learning, and probability theory [D] . Tixier, Antoine Jean-Pierre. 2015

机译：利用非结构化施工损伤报告以预测使用自然语言处理，机器学习和概率理论来预测安全结果和模型安全风险
6. Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing [O] . Lionel T. E. Cheng, Jiaping Zheng, Guergana K. Savova, 2010

机译：从非结构化MRI报告中识别肿瘤状态-现有报告中信息的完整性和自动自然语言处理的实用性
7. Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing [O] . Lionel T. E. Cheng, Jiaping Zheng, Guergana K. Savova, 2009

机译：从非结构化MRI报告中识别肿瘤状态-现有报告中信息的完整性和自动自然语言处理的实用性
8. Security Classification Using Automated Learning (SCALE): Optimizing Statistical Natural Language Processing Techniques to Assign Security Labels to Unstructured Text [R] . Brown, J. D., Charlebois, D. 2010

机译：使用自动学习的安全性分类（sCaLE）：优化统计自然语言处理技术，将安全标签分配给非结构化文本

Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports

摘要

著录项

相似文献

相关主题

期刊订阅