首页> 外文会议> >A part of speech estimation method for Japanese unknown words using a statistical model of morphology and context
【24h】

A part of speech estimation method for Japanese unknown words using a statistical model of morphology and context

机译:使用形态和上下文统计模型的日语未知单词的语音估计方法的一部分

获取原文

摘要

We present a statistical model of Japanese unknown words consisting of a set of length and spelling models classified by the character types that constitute a word.The point is quire simple: different character sets should be treated differently and the changes between character types are very important because Japanese script has both ideograms like Chinese (kanji) and phonograms like English (katakana).Both word segmentation accuracy and part of speech tagging accuracy are improved by the proposed model.The model can achieve 96.6
机译:我们提出了一个日语未知单词的统计模型,该模型由一组长度和拼写模型组成,这些单词模型由构成单词的字符类型分类。要点很简单:不同的字符集应区别对待,并且字符类型之间的变化非常重要因为日语脚本既具有汉字(汉字)的表意文字又具有英语(片假名)的文字图,因此该模型提高了分词的准确性和部分语音标记的准确性,该模型可达到96.6。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号