首页> 外文会议>CIPS-SIGHAN joint conference on Chinese language processing >COV Model and its Application in Chinese Part-of-Speech Tagging
【24h】

COV Model and its Application in Chinese Part-of-Speech Tagging

机译:COV模型及其在中文词性标注中的应用

获取原文

摘要

This article presents a new sequence labeling model named Context Overlapping (COV) model, which expands observation from single word to n-gram unit and there is an overlapping part between the neighboring units. Due to the co-occurrence constraint and transition constraint, COV model reduces the search space and improves tagging accuracy. The 2-gram COV is applied to Chinese PoS tagging and the precision rate of the open test is as high as 96.83%, which is higher than the second order HMM, which is 95.73%. The result is also comparable to the discriminative models but COV takes much less training time than them. With symbol decoding COV prunes many nodes before statistics decoding and the search space of COV is about10-20% less than that of HMM.
机译:本文介绍了一个新的序列标记模型,称为上下文重叠(Context Overlapping)模型,该模型将观察范围从单个单词扩展到n-gram单元,并且相邻单元之间存在重叠部分。由于共现约束和过渡约束,COV模型减少了搜索空间并提高了标记准确性。 2克COV应用于中文PoS标签,开放测试的准确率高达96.83%,高于二阶HMM的95.73%。结果也可与判别模型相媲美,但COV所需的训练时间比它们少得多。使用符号解码时,COV在统计解码之前会修剪许多节点,并且COV的搜索空间比HMM小约10-20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号