首页> 外文会议>IEEE International Conference on Anti-counterfeiting, Security, and Identification >Research on Named Entity Recognition for Science and Technology Terms in Chinese Based on Dependent Entity Word Vector
【24h】

Research on Named Entity Recognition for Science and Technology Terms in Chinese Based on Dependent Entity Word Vector

机译:基于依赖实体词向量的中文科技术语命名实体识别研究

获取原文

摘要

Most of the researches on Chinese named entity recognition (NER) focus on the general field, and few on NER in the field of science and technology. On one hand, technical terms in the field of science and technology appear in general texts less frequently, with most of which being compound words, and the performance of word segmentation processing on texts in the field of science and technology is poor. On the other hand, texts in the field of science and technology are more accurate and standardized than those in general fields. By analyzing these characteristics of texts in the field of science and technology, this paper attempts to train word vectors by constructing terminology dictionaries and introducing dependency analysis. Referring to the latest NER research results in the current Chinese general field, i.e. the method of merging character vectors with word vectors, we will perform NER on texts in the field of science and technology. Through experiments, it is proved that the proposed method is more effective comparing to existing works. In addition, the effect of introducing attention mechanism on NER results is also studied.
机译:大多数关于中国名为实体识别(NER)的研究专注于一般领域,少数人在科技领域。一方面,科学和技术领域的技术术语普遍频繁出现,大多数是复合词语,以及科学技术领域文本的文字分割处理的表现差。另一方面,科学技术领域的文本比一般领域的文本更准确和标准化。通过分析科学技术领域文本的这些特征,本文试图通过构建术语词典和引入依赖性分析来培训文字矢量。参考当前中国常规领域的最新NER​​研究结果,即用词向量合并字符向量的方法,我们将在科学和技术领域的文本上执行NER。通过实验,证明该方法比现有工程更有效。此外,还研究了引入注意力机制对NER结果的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号