【24h】

Ambiguity Solution of Pinyin Segmentation in Continuous Pinyin-to-Character Conversion

机译:连续拼音字符转换中拼音分割的歧义解

获取原文

摘要

Chinese Pinyin-to-Character conversion is a key technology in Chinese Pinyin input system. In sentence based Pinyin-to-Character conversion, segmentation of Pinyin string has important influence on performance of Pinyin-to-Character conversion. There are lots of ambiguities in segmentation of Pinyin string. This paper classifies them into overlap and combinational ambiguities, and proposes disambiguation algorithms for them respectively. We then combine ambiguity resolution with several different language model to implement Pinyin-to-Character conversion task, experiments show a good performance brought by proposed algorithms
机译:中文拼音到字符转换是中文拼音输入系统的一项关键技术。在基于句子的拼音到字符转换中,拼音字符串的分段对拼音到字符转换的性能有重要影响。拼音字符串的分割有很多歧义。本文将它们分为重叠歧义和组合歧义,并分别提出针对它们的歧义消除算法。然后将歧义解析度与几种不同的语言模型结合起来,以实现拼音到字符的转换任务,实验表明所提出的算法具有良好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号