首页> 外文会议>Chinese Control and Decision Conference >Chinese Named Entity Recognition for Hazard And Operability Analysis Text
【24h】

Chinese Named Entity Recognition for Hazard And Operability Analysis Text

机译:危害性和可操作性分析文本的中文命名实体识别

获取原文

摘要

To solve the problem that it is difficult to identify the key Chinese entity information in the hazard and operability analysis text, a deep neural network model based on bidirectional long short-term memory and conditional random field (BiLSTM-CRF) is proposed to identify key named entities in the text. In the word vector pre-training process, bidirectional encoder representation from transformers (BERT) model is used to pre-train word vectors instead of the static word vectors in the traditional word2vec model, and then obtain context-related dynamic word vectors, to improve the representational ability of word vectors, and solve the problem of word boundary division when word vectors are used in Chinese corpus training. This model has the ability of complete the task of Chinese named entity recognition. The F1 value on the test corpus reaches 93.31%, which is 21.82% higher than conditional random field (CRF) Baseline, and 4.49% higher than the traditional BiLSTM-CRF model. The experimental results show that the BERT-BiLSTM-CRF model is effective for the named entity recognition (NER) task of the hazard and operability analysis text, and it is helpful to automatically extract the relationship between the entities in the hazard and operability analysis text and build safety analysis knowledge graph.
机译:针对危险性和可操作性分析文本中难以识别关键中文实体信息的问题,提出了一种基于双向长短期记忆和条件随机场(BiLSTM-CRF)的深度神经网络模型来识别关键。文本中的命名实体。在词向量预训练过程中,使用变压器模型(BERT)的双向编码器表示来代替传统word2vec模型中的静态词向量来对词向量进行预训练,然后获得与上下文相关的动态词向量,以进行改进解决了词向量的表示能力,解决了在汉语语料库训练中使用词向量时的词边界划分问题。该模型具有完成中文命名实体识别任务的能力。测试语料库的F1值达到93.31%,比条件随机场(CRF)基准高21.82%,比传统BiLSTM-CRF模型高4.49%。实验结果表明,BERT-BiLSTM-CRF模型对于危险性和可操作性分析文本中的命名实体识别(NER)任务是有效的,并且有助于自动提取危险性和可操作性分析文本中的实体之间的关系。并建立安全分析知识图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号