首页> 外文会议> >A Maximum Entropy Approach to Chinese Pin Yin-To-Character Conversion
【24h】

A Maximum Entropy Approach to Chinese Pin Yin-To-Character Conversion

机译:汉语拼音字符转换的最大熵方法

获取原文

摘要

This paper introduces a new approach based upon Maximum Entropy (ME) frame to solve the Pinyin-to-character (PTC) conversation problem. Mostly there is more than one Chinese characters share the same Pinyin. The task of PTC algorithm is to distinguish such kind ambiguity. PTC can be regards as to classify a Pinyin to a special character according the context which is represented as feature in ME. By taking the advantage of ME, the local and non-local information are included, so the conversation performance is improved. Experiments show that 87% hit rate (without tone) is achieved.
机译:本文介绍了一种基于最大熵(ME)框架的新方法来解决拼音转字符(PTC)会话问题。通常,不止一个汉字共享同一拼音。 PTC算法的任务是区分这种类型的歧义。可以将PTC视为根据ME中表示为特征的上下文将拼音分类为特殊字符。通过利用ME的优势,可以将本地和非本地信息都包括在内,从而提高了会话性能。实验表明,达到了87%的命中率(无提示音)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号