首页> 外文会议>Conference on empirical methods in natural language processing >A Transition-based Model for Joint Segmentation, POS-tagging and Normalization
【24h】

A Transition-based Model for Joint Segmentation, POS-tagging and Normalization

机译:基于过渡的联合细分,POS标记和规范化模型

获取原文

摘要

We propose a transition-based model for joint word segmentation, POS tagging and text normalization. Different from previous methods, the model can be trained on standard text corpora, overcoming the lack of annotated microblog corpora. To evaluate our model, we develop an annotated corpus based on microblogs. Experimental results show that our joint model can help improve the performance of word segmentation on microblogs, giving an error reduction in segmentation accuracy of 12.02%, compared to the traditional approach.
机译:我们提出了一种基于过渡的模型,用于联合分词,POS标记和文本规范化。与以前的方法不同,该模型可以在标准文本语料库上进行训练,从而克服了带注释的微博语料库的不足。为了评估我们的模型,我们基于微博客开发了一个带注释的语料库。实验结果表明,与传统方法相比,我们的联合模型可以帮助提高微博上的分词性能,使分词准确性降低了12.02%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号