首页> 外文会议>International Conference on Applied Information Technology and Innovation >Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model
【24h】

Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model

机译:使用愚蠢的退避和词嵌入模型预测语义正确的孟加拉词

获取原文

摘要

Word prediction is an essential technique used in different text entry environment to facilitate error-free writing. It also used as a helping hand for people with different types of disabilities. Word prediction technique is available in different languages. But developing an optimized Bangla word predictor is still a great research challenge. To overcome the challenge we propose a hybrid method to predict Bangla words. The stupid backoff language model is used to detect the most-probable words that may fit into the previously typed sentence by calculating the word sequence frequency. The novelty of this work is that it can provide the semantically correct words as a suggestion. The Word-Embedding model is used to maintain the semantic context of the word. To test this approach, a large corpus is built consisting of almost 0.5 million data. We compared our approach with other well-established methods. The proposed methodology surpasses them by obtaining 83% accuracy. The approach is also computationally efficient as the running time is linear with the prediction length.
机译:Word预测是在不同文本输入环境中使用的基本技术,以便于无差错写入。它还用作具有不同类型残疾人的帮助手。单词预测技术以不同的语言提供。但是开发优化的孟加拉词预测器仍然是一个很好的研究挑战。为了克服挑战,我们提出了一种混合方法来预测孟加拉语言。愚蠢的退避语言模型用于通过计算单词序列频率来检测可能适合先前键入的句子的最可能的单词。这项工作的新颖之处在于它可以提供语义正确的单词作为建议。嵌入式模型用于维护单词的语义上下文。为了测试这种方法,大型语料库由近05万数据组成。我们将我们的方法与其他良好的方法进行了比较。所提出的方法通过获得83%的精度来超越它们。当运行时间与预测长度线性是线性的,该方法也是计算的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号