首页> 外文会议>2018 International Conference on Artificial Intelligence and Big Data >An improved embedding matching model for Chinese word segmentation
【24h】

An improved embedding matching model for Chinese word segmentation

机译:一种改进的中文分词嵌入匹配模型

获取原文
获取原文并翻译 | 示例

摘要

To date, various deep neural network based models have been extensively applied in Chinese word segmentation (CWS) task, however, some of these models either consume so much time to train, or perform weakly without additional dictionary or training corpus. In this paper, we proposed an improved embedding matching model for CWS, which achieved 0.4% and 0.5% improvement of F1-Measure respectively on PKU and MSR dataset compared to the original model. Also, our proposed model achieved 0.77% improvement of F1-Measure and 2.59% improvement of weighted F1-Measure on the NLPCC dataset. The improved model outperforms most of previous models on the F1 measure of segmentation performance, and consumes less time to train and test. After improvement, the model can be a better choice in practical use for its exceedingly fast convergence and excellent performance.
机译:迄今为止,各种基于深度神经网络的模型已广泛应用于中文分词(CWS)任务,但是,其中一些模型要么花费大量时间进行训练,要么在没有附加词典或训练语料库的情况下表现不佳。本文提出了一种改进的CWS嵌入匹配模型,与原始模型相比,在PKU和MSR数据集上F1-Measure的改进分别为0.4 \%和0.5 \%。同样,我们提出的模型在NLPCC数据集上实现了F1-Measure的0.77%的改进和加权F1-Measure的2.59%的改进。改进后的模型在细分效果的F1方面优于大多数以前的模型,并且花费更少的时间进行训练和测试。经过改进后,该模型具有极快的收敛速度和出色的性能,因此在实际使用中可能是更好的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号