首页> 外文会议>2011 International Conference on Asian Language Processing >Error-Driven Adaptive Language Modeling for Chinese Pinyin-to-Character Conversion
【24h】

Error-Driven Adaptive Language Modeling for Chinese Pinyin-to-Character Conversion

机译:用于汉语拼音到字符转换的错误驱动的自适应语言建模

获取原文

摘要

The performance of Chinese Pinyin-to-Character conversion is severely affected when the characteristics of the training and conversion data differ. As natural language is highly variable and uncertain, it is impossible to build a complete and general language model to suit all the tasks. The traditional adaptive MAP models mix the task independent data with task dependent data using a mixture coefficient but we never can predict what style of language users have and what new domain will appear. This paper presents a statistical error-driven adaptive language modeling approach to Chinese Pinyin input system. This model can be incrementally adapted when an error occurs during Pinyin-to-Character converting time. It significantly improves Pinyin-to-Character conversion rate.
机译:当训练和转换数据的特征不同时,中文拼音到字符转换的性能会受到严重影响。由于自然语言具有很大的可变性和不确定性,因此不可能建立适合所有任务的完整而通用的语言模型。传统的自适应MAP模型使用混合系数将与任务无关的数据与与任务相关的数据混合在一起,但是我们永远无法预测用户使用哪种样式的语言以及将出现什么新领域。本文提出了一种统计误差驱动的汉语拼音输入系统自适应语言建模方法。当拼音到字符转换期间发生错误时,可以逐步调整该模型。它显着提高了拼音到字符的转换率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号