Correcting writing errors in turkish with a character-level neural language model

机译：使用字符级神经语言模型纠正土耳其语中的书写错误

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

A large part of the written content on the Internet is composed of social media posts, articles written for content platforms and user comments. In contrast to the content prepared for print media, these types of texts include a large number of writing errors. Automating the detection and correction of writing errors in content created for commercial purposes would decrease editing costs dramatically. Although word-level language models have performed well in processing analytic languages, they are not ideal for agglutinative languages, which include Turkish. Models built on smaller elements such as morphemes or characters are more suitable for agglutinative languages. In this study, we propose a method that uses a character-level language model to correct writing errors in Turkish. Character-level text generation is used to calculate the probabilities of possible syntaxes. The syntax that is the most probable is inferred to be correct. The proposed method is implemented to correct errors in writing the conjunction “de” and the suffix “-de”.

机译：互联网上的大部分书面内容由社交媒体帖子，为内容平台撰写的文章和用户评论组成。与为打印介质准备的内容相反，这些类型的文本包含大量书写错误。自动检测和纠正为商业目的而创建的内容中的书写错误将大大降低编辑成本。尽管单词级语言模型在处理分析语言方面表现良好，但对于包括土耳其语在内的凝集性语言而言，它们并不是理想的选择。基于词素或字符等较小元素的模型更适合于凝集语言。在这项研究中，我们提出了一种使用字符级语言模型来纠正土耳其语书写错误的方法。字符级文本生成用于计算可能语法的概率。推断最可能的语法是正确的。实施所提出的方法以纠正写连词“ de”和后缀“ -de”时的错误。

著录项

来源
《Signal Processing and Communications Applications Conference》|2018年|1-4|共4页
会议地点
作者
Burak BenlIgIray;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Dogs; Writing; Internet; Encyclopedias; Electronic publishing; Speech recognition;

机译：狗;写作;互联网;百科全书;电子出版;语音识别;

相似文献

外文文献
中文文献
专利

1. Correcting Arabic OCR Errors Using Improved Topic-Based Language Models [J] . Safeya Mamish, Mohamed Cheriet International journal of computer processing of languages . 2009,第4期

机译：使用改进的基于主题的语言模型纠正阿拉伯语OCR错误
2. IMPLICATION FOR SECOND LANGUAGE LEARNING AND LANGUAGE PEDAGOGY BY ANALYZING ERRORS IN COLLEGE STUDENTS＇ WRITINGS [J] . YanLidong 中国英语教学：英文版 . 2004,第001期

机译：分析大学生写作中的错误对第二语言学习和语言教学的启示
3. Automatically correcting adverb placement errors in the writings of French users of English [J] . Marie Garnier Procedia - Social and Behavioral Sciences . 2012,第2期

机译：自动纠正法语（英语）用户的副词放置错误
4. Correcting writing errors in turkish with a character-level neural language model [C] . Burak BenlIgIray Signal Processing and Communications Applications Conference . 2018

机译：用字符级神经语言模型纠正土耳其语中的书写错误
5. Correcting Europe's error: Venetian cosmopolites on Turkish 'literature'. [D] . Mintner, Terrance J. 2017

机译：纠正欧洲的错误：威尼斯大都会对土耳其的“文学”。
6. Critical neural substrates for correcting unexpected trajectory errors and learning from them [O] . Pratik K. Mutha, Robert L. Sainburg, Kathleen Y. Haaland -1

机译：关键神经底物用于纠正意外的轨迹错误并从中学习
7. Character-Level Language Modeling with Hierarchical Recurrent Neural Networks [O] . Hwang, Kyuyeon, Sung, Wonyong 2017

机译：基于分层递归神经网络的字符级语言建模网络

Correcting writing errors in turkish with a character-level neural language model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅