首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >Chinese Named Entity Recognition Based on CNN-BiLSTM-CRF
【24h】

Chinese Named Entity Recognition Based on CNN-BiLSTM-CRF

机译:基于CNN-Bilstm-CRF的中国名称实体识别

获取原文

摘要

Named Entity Recognition (NER) is an important basic task in natural language processing (NLP). In recent years, the method of word representations enhancement by character embedding has significantly enhanced the effect of entity recognition. However, this kind of character embedding method only works on alphabetic spelling words such as English, and the same method is not suitable for Chinese. Aiming at the inherent characteristics of Chinese as morpheme writing, we propose a novel neural network model based on CNN-BiLSTM-CRF in this paper. Convolution neural network (CNN) extracts the glyph embeddings with morphological features from each Chinese character, which are concatenated with the character embeddings with semantic feature information and fed to the BiLSTM-CRF network. We evaluate our model on the third SIGHAN Bakeoff MSRA dataset for simplified Chinese NER task. The experimental results show that our model reaches 91.09% in F-scores which does not rely on the hand-designed features and domain knowledge.
机译:命名实体识别(ner)是自然语言处理中的重要基本任务(NLP)。近年来,字符嵌入方式增强的文字表示的方法显着提高了实体识别的影响。但是,这种字符嵌入方法仅适用于诸如英语之类的字母拼写词,而相同的方法不适合中文。旨在汉语作为语素写作的固有特征,我们提出了一种基于CNN-Bilstm-CRF的新型神经网络模型。卷积神经网络(CNN)提取具有来自每个汉字的形态特征的字形嵌入,这些特征与具有语义特征信息的字符嵌入的字符嵌入并馈送到Bilstm-CRF网络。我们在第三个Sighan Bakeoff Msra DataSet上评估我们的模型,用于简体中文。实验结果表明,我们的模型在F分数达到91.09%,不依赖于手工设计的功能和域知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号