【24h】

Disambiguating Effectively Chinese Polyphonic Ambiguity Based on Unify Approach

机译:基于统一方法有效消除汉语复音歧义

获取原文

摘要

One of the difficult tasks on Natural Language Processing (NLP) is to resolve the sense ambiguity of characters or words on text, such as polyphones, homonymy, and homograph. The paper addresses the ambiguity issue of Chinese character polyphones and disambiguity approaches for such issues. Three methods, dictionary matching, language models and voting scheme, are used to disambiguate the prediction of polyphones. The best precision rate for these methods achieves 92.65%. Furthermore we proposed the unify approaches to improve the performance with respect to various threshold value. Comparing with the well-known MS Word 2007, our approach is superior and enhances the final precision rate up to 93.32%.
机译:自然语言处理(NLP)的一项艰巨任务是解决文本上字符或单词(例如多音素,同音和同形异义词)的含糊不清。本文讨论了汉字复音器的歧义问题以及针对此类问题的歧义方法。字典匹配,语言模型和投票方案这三种方法可用来消除对复音的预测的歧义。这些方法的最佳精度达到92.65%。此外,我们提出了统一的方法来提高各种阈值的性能。与著名的MS Word 2007相比,我们的方法更加出色,最终精度高达93.32%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号