Natural Language Processing based Text Imputation for Malayalam Corpora

机译：基于自然语言处理的马拉雅拉语语料库的文本插补

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The prediction task in Natural Language Processing intends to figure out the missing characters, letter, word, expression, or sentence that conceivable follows in a given fragment of a book. Since the beginning of NLP, numerous frameworks with various techniques were produced for various dialects. The missing content prediction is one among the significant concerns of Natural Language Processing. Moreover, most of the text prediction related tasks are conducted in different dialects but not in Malayalam. Over these years though having some of the best classics, historical records and many more to keep the world interested with, the Malayalam was lacking many of the advantages that any other language process in the digital world. This is because there are only a few standard models that could fit into the Malayalam. In this paper, an attempt is made to fit the BERT into Malayalam. A Malayalam pre-trained model will be creating and implementing some of its applications with the model and discussing its scope and areas of development to be addressed and future research scope.

机译：自然语言处理中的预测任务旨在找出在给定书本片段中可能出现的缺少的字符，字母，单词，表达方式或句子。自从NLP诞生以来，针对各种方言产生了具有各种技术的众多框架。缺少内容的预测是自然语言处理的重要关注之一。此外，大多数与文本预测相关的任务都是在不同的方言中进行的，而在马拉雅拉姆语中则没有。这些年来，尽管马拉雅拉姆语有一些最好的经典，历史记录以及许多让世界感兴趣的东西，但它却缺乏数字世界中任何其他语言所具有的许多优势。这是因为只有少数标准模型可以放入马拉雅拉姆语中。在本文中，尝试将BERT装入马拉雅拉姆语。马拉雅拉姆语预训练的模型将使用该模型来创建和实现其某些应用程序，并讨论其要解决的范围和发展领域以及未来的研究范围。

著录项

来源
《International Conference on Electronics and Sustainable Communication Systems》|2020年|161-165|共5页
会议地点
作者
Annlin Rojan; Edwin Alias; Georgy M. Rajan; Jithin Mathew; Dhanya Sudarsan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Bit error rate; Task analysis; Predictive models; Training; Natural language processing; Data models; Computational modeling;

机译：误码率;任务分析;预测模型;训练;自然语言处理;数据模型;计算模型;

相似文献

外文文献
中文文献
专利

1. Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) [J] . S.Kumar, M. AnandKumar, K.P.Soman Journal of Intelligent Systems . 2019,第3期

机译：基于深入学习的Malayalam Twitter数据的语音标记（特殊问题：自然语言处理的深度学习技巧）
2. Steven Bird, Ewan Klein and Edward Loper:Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit [J] . Wiebke Wagner Language Resources and Evaluation . 2010,第4期

机译：Steven Bird，Ewan Klein和Edward Loper：使用Python进行自然语言处理，使用自然语言工具包分析文本
3. Semantic similarity of short texts in languages with a deficient natural language processing support [J] . Bojan Furlan, Vuk Batanovic, Bosko Nikolic Decision support systems . 2013,第3期

机译：缺乏自然语言处理支持的语言中的短文本的语义相似性
4. Speech database and text corpora for Malayalam language automatic speech recognition technology [C] . Cini Kurian 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Technique . 2016

机译：用于马拉雅拉姆语语言自动语音识别技术的语音数据库和文本语料库
5. Methods for Improving Natural Language Processing Techniques with Linguistic Regularities Extracted from Large Unlabeled Text Corpora [D] . Lucas, Michael Ryan. 2019

机译：提高了大型未标记文本语料库语言规律的自然语言处理技术的方法
6. Empirical automated vocabulary discovery using large text corpora and advanced natural language processing tools. [O] . W. R. Hersh, E. H. Campbell, D. A. Evans, 1996

机译：使用大型文本语料库和先进的自然语言处理工具进行经验性的自动词汇发现。
7. Deep Learning Based Part-of-Speech Tagging for Malayalam Twitter Data (Special Issue: Deep Learning Techniques for Natural Language Processing) [O] . S. Kumar, M. Anand Kumar, K.P. Soman 2019

机译：基于深度学习的Malayalam Twitter数据的演讲标记（特殊问题：自然语言处理的深度学习技术）

Natural Language Processing based Text Imputation for Malayalam Corpora

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅