A Framework for Indonesian Grammar Error Correction

Lin Nankai; Chen Boyu; Lin Xiaotian; Wattanachote Kanoksak; Jiang Shengyi

首页> 外文期刊>ACM transactions on Asian and low-resource language information processing >A Framework for Indonesian Grammar Error Correction

【24h】

A Framework for Indonesian Grammar Error Correction

机译：印度尼西亚语法纠错的框架

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Grammatical Error Correction (GEC) is a challenge in Natural Language Processing research. Although many researchers have been focusing on GEC in universal languages such as English or Chinese, few studies focus on Indonesian, which is a low-resource language. In this article, we proposed a GEC framework that has the potential to be a baseline method for Indonesian GEC tasks. This framework treats GEC as a multi-classification task. It integrates different language embedding models and deep learning models to correct 10 types of Part of Speech (POS) error in Indonesian text. In addition, we constructed an Indonesian corpus that can be utilized as an evaluation dataset for Indonesian GEC research. Our framework was evaluated on this dataset. Results showed that the Long Short-Term Memory model based on word-embedding achieved the best performance. Its overall macro-average F-0.5 in correcting 10 POS error types reached 0.551. Results also showed that the framework can be trained on a low-resource dataset.

机译：语法纠错（GEC）是自然语言处理研究中的挑战。虽然许多研究人员一直专注于GEC，如英语或中文，如英语或中文，少数研究专注于印度尼西亚，这是一种低资源语言。在本文中，我们提出了一个GEC框架，有可能成为印度尼西亚GEC任务的基线方法。此框架将GEC视为多分类任务。它集成了不同语言嵌入模型和深度学习模型，以纠正印度尼西亚文本中的10种类型的语音（POS）错误。此外，我们构建了一个可用于印度尼西亚GEC研究的评估数据集的印度尼西亚语料库。我们的框架是在此数据集上进行评估。结果表明，基于Word-EmbEdding的长短期内存模型实现了最佳性能。其整体宏观平均F-0.5校正10 POS错误类型达到0.551。结果还表明，框架可以在低资源数据集上培训。

著录项

来源
《ACM transactions on Asian and low-resource language information processing》 |2021年第4期|57.1-57.12|共12页
作者
Lin Nankai; Chen Boyu; Lin Xiaotian; Wattanachote Kanoksak; Jiang Shengyi;
展开▼
作者单位

Guangdong Univ Foreign Studies Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China;

Guangdong Univ Foreign Studies Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China;

Guangdong Univ Foreign Studies Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China;

Guangdong Univ Foreign Studies Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China;

Guangdong Univ Foreign Studies Sch Comp Sci & Technol Guangzhou Guangdong Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Grammatical error correction; indonesian language processing; low-resource language; word-embedding;

机译：语法纠错;印度尼西亚语言处理;低资源语言;单词嵌入;

相似文献

外文文献
中文文献
专利

1. English Grammar Error Correction Algorithm Based on Classification Model [J] . Shanchun Zhou, Wei Liu Complexity . 2021,第a期

机译：基于分类模型的英语语法纠错算法
2. Error Correction of Enumerative Induction of Deterministic Context-free L-system Grammar [J] . Ryohei Nakano IAENG Internaitonal journal of computer science . 2013,第1期

机译：确定性上下文无关L系统语法的枚举归纳的误差校正
3. Foreign language learners’ beliefs about grammar instruction and error correction [J] . Volkan Incecay, Ye?im Ke?li Dollar Procedia - Social and Behavioral Sciences . 2011,第2期

机译：外语学习者对语法教学和错误纠正的信念
4. A Unified Framework for Grammar Error Correction [C] . Longkai Zhang, Houfeng Wang Conference on computational natural language learning . 2014

机译：语法错误校正的统一框架
5. Examining three types of correctional feedback about errors in mechanics and grammar in students with writing difficulties in grades 4--8. [D] . Du, Xiaoqing. 2009

机译：检查4--8年级写作困难学生有关力学和语法错误的三种类型的校正反馈。
6. Linking speech errors and phonological grammars: Insights from Harmonic Grammar networks [O] . Matthew Goldrick, Robert Daland -1

机译：链接语音错误和语音语法：来自谐波语法网络的见解
7. A Unified Framework for Grammar Error Correction [O] . Longkai Zhang, Houfeng Wang 2015

机译：语法错误纠正的统一框架

A Framework for Indonesian Grammar Error Correction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅