Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing

Seonghyeon Moon; Gitaek Lee; Seokho Chi; Hyunchul Oh

首页> 外文期刊>Journal of Construction Engineering and Management >Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing

【24h】

Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing

机译：使用自然语言处理的命名实体识别自动施工规范审查

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

When bidding on construction projects, contractors need to understand the specifications properly to manage project risks. However, specifications are mainly analyzed based on human cognitive abilities, which can take considerable time and can lead to errors due to misunderstanding. While efforts have been made to automate this process, that the existing academic efforts to automate the process have limitations. To develop an automated specification reviewing model applicable to various kinds of specifications, the authors propose information extraction frameworks consisting of five categories. In addition, a named entity recognition (NER) model was developed based on bidirectional long short-term memory architecture to extract information from text data automatically. The data set for model development comprised 56 construction specifications, which included a total of 4,659 sentences labeled according to five categories of information. Word2Vec was utilized to aconvert labeled text data to the form of numeric vectors to be input into the NER model. The NER model successfully assigned every word in the testing data to an appropriate category with a satisfactory performance of 0.919 precision and 0.914 recall. These results contribute to the automation of the construction specification review process. Although this research focused on road construction projects, the proposed information extraction framework can be applied to other types of construction projects.

机译：在竞标建筑项目时，承包商需要了解管理项目风险的规范。然而，规范主要根据人类认知能力分析，这可能需要相当长的时间，并且可能导致由于误解而导致错误。虽然已经进行了自动化这一过程的努力，但现有的自动化过程的学术努力有局限性。为了开发适用于各种规格的自动规范审查模式，作者提出了由五个类别组成的信息提取框架。此外，基于双向长期内存架构开发了命名实体识别（NER）模型，以自动从文本数据中提取信息。用于模型开发的数据集包括56个施工规范，其中总共包括4,659个句子根据五类信息标记。 Word2VEC用于将标记的文本数据视为要输入的数字向量的形式，以输入NER模型。 NER模型成功将测试数据中的每个单词分配给适当的类别，令人满意的性能为0.919精度和0.914召回。这些结果有助于建设规范审查过程的自动化。虽然这项研究专注于道路建设项目，但建议的信息提取框架可应用于其他类型的建筑项目。

著录项

来源
《Journal of Construction Engineering and Management》 |2021年第1期|04020147.1-04020147.12|共12页
作者
Seonghyeon Moon; Gitaek Lee; Seokho Chi; Hyunchul Oh;
展开▼
作者单位

Dept. of Civil and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea Institute of Construction and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea;

Dept. of Civil and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea;

Dept. of Civil and Environment Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Ku Seoul 08826 Republic of Korea Institute of Construction and Environmental Engineering 1 Gwanak-Ro Gwanak-Ku Seoul 08826 Republic of Korea;

Smart Construction Team Daewoo E&C 20 Suil-Ro 123beon-Gil Jangan-Gu Suwon 16297 Republic of Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. [J] . Todd Lingren, Louise Deleger, Katalin Molnar, Journal of the American Medical Informatics Association : . 2014,第3期

机译：评估预批注对批注速度和潜在偏见的影响：在临床试验公告中为临床命名实体识别开发自然语言处理黄金标准。
2. Bidirectional LSTM joint model for intent classification and named entity recognition in natural language understanding [J] . Varghese Akson Sam, Sarang Saleha, Yadav Vipul, International Journal of Hybrid Intelligent Systems . 2020,第1期

机译：用于自然语言理解的意图分类和命名实体识别的双向LSTM联合模型
3. Mixed deep learning and natural language processing method for fake-food image recognition and standardization to help automated dietary assessment [J] . SimonMezgec, TomeEftimov, TamaraBucher, Public Health Nutrition . 2019,第7期

机译：用于伪食物图像识别和标准化的混合深度学习和自然语言处理方法，以帮助自动饮食评估
4. Identifying Motion Entities in Natural Language and A Case Study for Named Entity Recognition [C] . Ngoc Phuoc An Vo, Irene Manotas, Vadim Sheinin, International Conference on Computational Linguistics . 2020

机译：识别自然语言中的运动实体以及命名实体识别的案例研究
5. An Application of Natural Language Processing: Named Entity Recognition with BLSTM in Chinese Corpora [D] . Mao, Lihui 2019

机译：自然语言处理的应用：BLSTM在中文语料库中的命名实体识别
6. Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements [O] . Todd Lingren, Louise Deleger, Katalin Molnar, 2014

机译：评估预批注对批注速度和潜在偏见的影响：在临床试验公告中为自然语言处理金标准开发的临床命名实体识别
7. NER-FL: A Novel Named Entity Recognizer of Farsi Language using the Web-Based Natural Language Processors and Semantic Annotations [O] . Babak Farhadi 2015

机译：NER-FL：使用基于Web的自然语言处理器和语义注释的波斯语语言的新命名实体识别器

Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing

摘要

著录项

相似文献

相关主题

期刊订阅