首页> 外文会议>Pacific Asia Conference on Language, Information and Computation; 20061101-03; Wuhan(CN) >Which Is Essential for Chinese Word Segmentation: Character versus Word
【24h】

Which Is Essential for Chinese Word Segmentation: Character versus Word

机译:中文分词必不可少的:字符与单词

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes an empirical comparison between word-based method and character-based method for Chinese word segmentation. In three Chinese word segmentation Bakeoffs, character-based method quickly rose as a mainstream technique in this field. We disclose the linguistic background and statistical feature behind this observation. Also, an empirical study between word-based method and character-based method are performed. Our results show that character-based method alone can work well for Chinese word segmentation without additional explicit word information from training corpus.
机译:本文提出了基于单词的方法和基于字符的方法进行中文分词的经验比较。在三项中文分词技巧中,基于字符的方法迅速成为该领域的主流技术。我们披露了此观察结果背后的语言背景和统计特征。此外,进行了基于单词的方法和基于字符的方法之间的实证研究。我们的结果表明,仅基于字符的方法就可以很好地适用于中文分词,而无需来自训练语料库的其他显式单词信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号