首页> 外文期刊>ACM transactions on Asian language information processing >Using Data Mining Techniques and Rough Set Theory for Language Modeling
【24h】

Using Data Mining Techniques and Rough Set Theory for Language Modeling

机译:使用数据挖掘技术和粗糙集理论进行语言建模

获取原文
获取原文并翻译 | 示例
           

摘要

In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word trigger-pair language model for Chinese character recognizers. With the word suggestion strategy, Chinese character recognizers may even achieve a recognition rate greater than the top-n candidate recognition rate. To construct the multiple word trigger-pair model, data mining techniques are used to alleviate the intensive computation problem. Furthermore, rough set theory is first used in the study to discover negatively correlated relationships between words in order to prevent introducing wrong words in the process of word suggestion.
机译:在本文中,我们基于汉字识别器的多单词触发对语言模型,提出了一种新的后处理策略,即单词建议。通过单词建议策略,汉字识别器甚至可以实现比前n位候选者识别率更高的识别率。为了构建多字触发对模型,使用数据挖掘技术来缓解密集的计算问题。此外,在研究中首先使用粗糙集理论来发现单词之间的负相关关系,以防止在单词建议过程中引入错误的单词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号