首页> 外文会议>CIPS-SIGHAN joint conference on Chinese language processing >COV Model and its Application in Chinese Part-of-Speech Tagging
【24h】

COV Model and its Application in Chinese Part-of-Speech Tagging

机译:COV模型及其在中国术语标签中的应用

获取原文

摘要

This article presents a new sequence labeling model named Context Overlapping (COV) model, which expands observation from single word to n-gram unit and there is an overlapping part between the neighboring units. Due to the co-occurrence constraint and transition constraint, COV model reduces the search space and improves tagging accuracy. The 2-gram COV is applied to Chinese PoS tagging and the precision rate of the open test is as high as 96.83%, which is higher than the second order HMM, which is 95.73%. The result is also comparable to the discriminative models but COV takes much less training time than them. With symbol decoding COV prunes many nodes before statistics decoding and the search space of COV is about10-20% less than that of HMM.
机译:本文介绍了一个名为Context Rocking(CoV)模型的新序列标记模型,该模型将从单个单词扩展到N-GRAM单元的观察,并且在相邻单元之间存在重叠部分。由于共发生约束和转换约束,COV模型减少了搜索空间并提高了标记精度。将2克COV应用于中国POS标记,开放式测试的精度率高达96.83%,高于二阶迁移率,为95.73%。结果也与鉴别模型相当,但COV比它们更少的培训时间。使用符号解码COV修剪在统计解码之前的许多节点和COV的搜索空间比HMM的搜索范围大约10-20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号