首页> 外文期刊>Journal of Construction Engineering and Management >Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing
【24h】

Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing

机译:使用自然语言处理的命名实体识别自动施工规范审查

获取原文
获取原文并翻译 | 示例
       

摘要

When bidding on construction projects, contractors need to understand the specifications properly to manage project risks. However, specifications are mainly analyzed based on human cognitive abilities, which can take considerable time and can lead to errors due to misunderstanding. While efforts have been made to automate this process, that the existing academic efforts to automate the process have limitations. To develop an automated specification reviewing model applicable to various kinds of specifications, the authors propose information extraction frameworks consisting of five categories. In addition, a named entity recognition (NER) model was developed based on bidirectional long short-term memory architecture to extract information from text data automatically. The data set for model development comprised 56 construction specifications, which included a total of 4,659 sentences labeled according to five categories of information. Word2Vec was utilized to aconvert labeled text data to the form of numeric vectors to be input into the NER model. The NER model successfully assigned every word in the testing data to an appropriate category with a satisfactory performance of 0.919 precision and 0.914 recall. These results contribute to the automation of the construction specification review process. Although this research focused on road construction projects, the proposed information extraction framework can be applied to other types of construction projects.
机译:在竞标建筑项目时,承包商需要了解管理项目风险的规范。然而,规范主要根据人类认知能力分析,这可能需要相当长的时间,并且可能导致由于误解而导致错误。虽然已经进行了自动化这一过程的努力,但现有的自动化过程的学术努力有局限性。为了开发适用于各种规格的自动规范审查模式,作者提出了由五个类别组成的信息提取框架。此外,基于双向长期内存架构开发了命名实体识别(NER)模型,以自动从文本数据中提取信息。用于模型开发的数据集包括56个施工规范,其中总共包括4,659个句子根据五类信息标记。 Word2VEC用于将标记的文本数据视为要输入的数字向量的形式,以输入NER模型。 NER模型成功将测试数据中的每个单词分配给适当的类别,令人满意的性能为0.919精度和0.914召回。这些结果有助于建设规范审查过程的自动化。虽然这项研究专注于道路建设项目,但建议的信息提取框架可应用于其他类型的建筑项目。

著录项

  • 来源
    《Journal of Construction Engineering and Management》 |2021年第1期|04020147.1-04020147.12|共12页
  • 作者单位

    Dept. of Civil and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea Institute of Construction and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea;

    Dept. of Civil and Environmental Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Gu Seoul 08826 Republic of Korea;

    Dept. of Civil and Environment Engineering Seoul National Univ. 1 Gwanak-Ro Gwanak-Ku Seoul 08826 Republic of Korea Institute of Construction and Environmental Engineering 1 Gwanak-Ro Gwanak-Ku Seoul 08826 Republic of Korea;

    Smart Construction Team Daewoo E&C 20 Suil-Ro 123beon-Gil Jangan-Gu Suwon 16297 Republic of Korea;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号