Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model

机译：使用愚蠢的退避和词嵌入模型预测语义正确的孟加拉词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Word prediction is an essential technique used in different text entry environment to facilitate error-free writing. It also used as a helping hand for people with different types of disabilities. Word prediction technique is available in different languages. But developing an optimized Bangla word predictor is still a great research challenge. To overcome the challenge we propose a hybrid method to predict Bangla words. The stupid backoff language model is used to detect the most-probable words that may fit into the previously typed sentence by calculating the word sequence frequency. The novelty of this work is that it can provide the semantically correct words as a suggestion. The Word-Embedding model is used to maintain the semantic context of the word. To test this approach, a large corpus is built consisting of almost 0.5 million data. We compared our approach with other well-established methods. The proposed methodology surpasses them by obtaining 83% accuracy. The approach is also computationally efficient as the running time is linear with the prediction length.

机译：Word预测是在不同文本输入环境中使用的基本技术，以便于无差错写入。它还用作具有不同类型残疾人的帮助手。单词预测技术以不同的语言提供。但是开发优化的孟加拉词预测器仍然是一个很好的研究挑战。为了克服挑战，我们提出了一种混合方法来预测孟加拉语言。愚蠢的退避语言模型用于通过计算单词序列频率来检测可能适合先前键入的句子的最可能的单词。这项工作的新颖之处在于它可以提供语义正确的单词作为建议。嵌入式模型用于维护单词的语义上下文。为了测试这种方法，大型语料库由近05万数据组成。我们将我们的方法与其他良好的方法进行了比较。所提出的方法通过获得83％的精度来超越它们。当运行时间与预测长度线性是线性的，该方法也是计算的。

著录项

来源
《International Conference on Applied Information Technology and Innovation》|2019年|1 v.|共5页
会议地点
作者
Tanni Mittra; Linta Islam; Deepak Chandra Roy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
natural language processing; probability; speech recognition; text analysis; word processing;

机译：自然语言处理;概率;语音识别;文本分析;文字处理;

相似文献

外文文献
中文文献
专利

1. Word-embeddings Italian semantic spaces: A semantic model for psycholinguistic research [J] . Marelli Marco Psihologija . 2017,第4期

机译：词嵌入意大利语语义空间：心理语言学研究的语义模型
2. Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections [J] . El-Assady Mennatallah, Kehlbeck Rebecca, Collins Christopher, IEEE transactions on visualization and computer graphics . 2020,第1期

机译：语义概念空间：使用词嵌入投影的导引主题模型细化
3. An Exploratory Approach to Find a Novel Metric Based Optimum Language Model for Automatic Bangla Word Prediction [J] . Md. Tarek Habib, Abdullah Al-Mamun, Md. Sadekur Rahman, International Journal of Intelligent Systems and Applications . 2018,第2期

机译：一种新的基于度量的最佳孟加拉语言自动预测语言模型的探索方法
4. Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model [C] . Tanni Mittra, Linta Islam, Deepak Chandra Roy International Conference on Applied Information Technology and Innovation . 2019

机译：基于愚蠢的退避和单词嵌入模型的语义正确孟加拉语单词预测
5. Question Bias and Biased Question Words in Mandarin, German and Bangla [D] . Xu, Beibei. 2017

机译：普通话，德语和孟加拉语的问题偏向和偏向疑问词
6. T55. DETECTING SEMANTIC DISTANCE ABNORMALITIES IN PSYCHOSIS: QUANTIFICATION OF WORD ASSOCIATIONS USING SEMANTIC SPACE MODELING [O] . Andrea Pintos, Charlton Cheung, Simon De Deyne, 2020

机译：T55。检测心理学中的语义距离异常：使用语义空间建模对单词联想进行量化
7. Word-embeddings Italian semantic spaces: A semantic model for psycholinguistic research [O] . Marco Marelli 2017

机译：Word-Embeddings意大利语义空间：精神语言学研究的语义模型
8. Context as the Building Blocks of Meaning: A Retrieval Model for the Semantic Representation of Words. [R] . Kwantes, P. J. 2003

机译：作为意义构建块的语境：词语语义表征的检索模型。

Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model

摘要

著录项

相似文献

相关主题

期刊订阅