首页> 外文会议>International conference on advances in swarm intelligence >Improving Chinese Named Entity Recognition with Semantic Information of Character Multi-position Representation
【24h】

Improving Chinese Named Entity Recognition with Semantic Information of Character Multi-position Representation

机译:利用字符多位置表示的语义信息改进中文命名实体的识别

获取原文

摘要

Named entity recognition is an important basic task for information extraction and construction of knowledge graph, but the recognition rate needs to be further improved, especially in Chinese. There are two main implementations based on the sequence tagging method. Among them, the character-based method lacks the support of word information, and the word-based method is affected by the word segmentation efficiency. In order to comprehensively utilize the information of characters and words and to reflect the semantic information that changes due to different combinations of characters and words in a sentence. We designed a tagging scheme based on word segmentation and dictionaries. Then, neural networks are used for learning multi-position feature vectors and character-based tagging task. Experiments with MSRA datasets show that this method outperforms word-based and character-based baselines and achieves a higher recall rate compared to other methods.
机译:命名实体识别是信息提取和知识图的构建的重要基础任务,但是识别率有待进一步提高,尤其是中文。基于序列标记方法的主要实现有两种。其中,基于字符的方法缺乏单词信息的支持,并且基于单词的方法受分词效率的影响。为了全面利用字符和单词的信息并反映由于字符和单词的不同组合而在句子中发生变化的语义信息。我们设计了基于分词和词典的标记方案。然后,将神经网络用于学习多位置特征向量和基于字符的标记任务。与MSRA数据集进行的实验表明,与其他方法相比,该方法优于基于单词和基于字符的基线,并且具有更高的召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号