Context-Sensitive Spelling Error Correction Using Inter-Word Semantic Relation Analysis

机译：基于词间语义关系分析的上下文敏感拼写错误纠正

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Error words that appear in Korean texts can be largely categorized into non-word spelling errors and context-sensitive spelling errors. Of the two, context-sensitive spelling errors are shown only when considering the meaning of the word in the given context and its syntactic relation, and they are the most difficult to correct among spelling errors. Context-sensitive spelling errors can be categorized into homophone errors, typographical errors, grammatical errors, and cross-word boundary errors. To correct context-sensitive spelling errors that occur due to typographical errors, this study proposes a statistical context-sensitive spelling check using inter-word semantic relation analysis. With confusion sets created in advance, we can find and correct context-sensitive spelling errors using reliability based on the conditional probability and chi-square statistics between each word of the confusion sets and the context as well as the typing error rate. As a result of applying the proposed method, all 5 confusion sets showed higher precision (92.68%) and recall (83.95%) than the baseline (precision 80%, recall 80%).

机译：韩国文字中出现的错误词可以大致分为非单词拼写错误和上下文相关的拼写错误。在这两种情况中，仅在考虑给定上下文中单词的含义及其句法关系时才会显示上下文相关的拼写错误，并且在拼写错误中，最难纠正的错误是拼写错误。上下文相关的拼写错误可分为同音异义词，印刷错误，语法错误和填字游戏边界错误。为了纠正由于印刷错误而导致的上下文敏感拼写错误，本研究提出了一种使用词间语义关系分析的统计上下文敏感拼写检查方法。借助预先创建的混淆集，我们可以基于混淆集每个单词与上下文之间的条件概率和卡方统计以及键入错误率，使用可靠性来查找和纠正上下文相关的拼写错误。应用该方法的结果是，所有5个混淆集都显示出比基线（精度80％，召回80％）更高的准确性（92.68％）和召回率（83.95％）。

著录项

来源
《International Conference on Information Science and Applications》|2014年|1-4|共4页
会议地点
作者
Kim Minho; Choi Sung-Ki; Kwon Hyuk-Chul;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Statistical semantic and clinician confidence analysis for correcting abbreviations and spelling errors in clinical progress notes [J] . Wilson Wong, David Glance Artificial intelligence in medicine . 2011,第3期

机译：统计语义和临床医生信心分析，用于纠正临床进展记录中的缩写和拼写错误
2. Error-tolerant finite-state recognition with applications to morphological analysis and spelling correction [J] . Kemal Oflazer Computational linguistics . 1996,第1期

机译：容错有限状态识别及其在形态分析和拼写校正中的应用
3. The relation between content and structure in language production: an analysis of speech errors in semantic dementia. [J] . Meteyard L, Patterson K Brain and language . 2009,第3期

机译：语言生产中内容与结构之间的关系：语义痴呆中语音错误的分析。
4. Context-Sensitive Spelling Error Correction Using Inter-Word Semantic Relation Analysis [C] . Kim Minho, Choi Sung-Ki, Kwon Hyuk-Chul International Conference on Information Science and Applications . 2014

机译：使用词语语义关系分析的上下文敏感拼写错误校正
5. Analysis of third- and fifth-grade spelling errors on the Test of Written Spelling - 4: Do error types indicate levels of linguistic knowledge? [D] . Conway, Barbara Tenney 2011

机译：对书面拼写测试的第三和第五年级拼写误差的分析 - 4：误差类型表明语言知识水平？
6. End-of-Kindergarten Spelling Outcomes: How Can Spelling Error Analysis Data Inform Beginning Reading Instruction? [O] . Julia Ai Cheng Lee, Stephanie Al Otaiba -1

机译：幼儿园末期拼写结果：拼写错误分析数据如何通知开始阅读说明？
7. Deep Learning-Based Context-Sensitive Spelling Typing Error Correction [O] . Jung-Hun Lee, Minho Kim, Hyuk-Chul Kwon 2020

机译：基于深度学习的上下文敏感拼写键入纠错

Context-Sensitive Spelling Error Correction Using Inter-Word Semantic Relation Analysis

摘要

著录项

相似文献

相关主题

期刊订阅