Automatic Proofreading in Chinese: Detect and Correct Spelling Errors in Character-Level with Deep Neural Networks

机译：中文自动校对：使用深度神经网络检测并纠正字符级拼写错误

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Rapid increase of the scale of text carries huge costs for manual proofreading. In comparison, automatic proofreading shows great advantages on time and human resource, drawing more researchers into it. In this paper, we propose two attention based deep neural network models combined with confusion sets to detect and correct possible Chinese spelling errors in character-level. Our proposed approaches first model the context of Chinese character embedding using Long Short-Term Memory (LSTM) networks, then score the probabilities of candidates from its confusion set through attention mechanism, choosing the highest one as the prediction answer. Also, we define a new methodology for obtaining (preceding text, following text, candidates, target) quads and provides a supervised dataset for training and testing (Our data has been released to the public in https://github.com/ccit-proofread.). Performance evaluation indicates that our models achieve the state-of-the-art performance and outperform a set of baselines.

机译：文本规模的迅速增加为人工校对带来了巨大的成本。相比之下，自动校对在时间和人力资源上显示出巨大的优势，吸引了更多的研究者。在本文中，我们提出了两种基于注意力的深度神经网络模型，并结合了混淆集来检测和纠正字符级别的可能中文拼写错误。我们提出的方法首先使用长短期记忆（LSTM）网络对汉字嵌入的上下文进行建模，然后通过注意力机制从候选者的混淆集中对候选者的概率进行评分，选择最高者作为预测答案。此外，我们定义了一种用于获取（前文本，后文本，候选对象，目标）四边形的新方法，并提供了用于训练和测试的受监督数据集（我们的数据已在https://github.com/ccit-上公开发布。校对。）。绩效评估表明，我们的模型达到了最先进的绩效，并且优于一组基准。

著录项

来源
《CCF International Conference on Natural Language Processing and Chinese Computing》|2019年|349-359|共11页
会议地点 Dunhuang(CN)
作者
Qiufeng Wang; Minghuan Liu; Weijia Zhang; Yuhang Guo; Tianrui Li;
展开▼
作者单位

School of Information Science and Technology Southwest Jiaotong University 999 Xi'an Road Chengdu China;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Error detection of Chinese text; Error correction of Chinese text; LSTM model; Attention mechanism;

机译：中文文本错误检测；中文文本的纠错； LSTM模型；注意机制;

相似文献

外文文献
中文文献
专利

1. Experimental validation for N-ary error correcting output codes for ensemble learning of deep neural networks [J] . Zhao Kaikai, Matsukawa Tetsu, Suzuki Einoshin Journal of Intelligent Information Systems . 2019,第2期

机译：用于深度神经网络集成学习的N元纠错输出代码的实验验证
2. Deep-learned 3D black-blood imaging using automatic labelling technique and 3D convolutional neural networks for detecting metastatic brain tumors [J] . Yohan Jun, Taejoon Eo, Taeseong Kim, Scientific reports. . 2018,第1期

机译：使用自动标记技术和3D卷积神经网络进行深度学习的3D黑血成像，以检测转移性脑肿瘤
3. Exploiting Deeply Supervised Inception Networks for Automatically Detecting Traffic Congestion on Freeway in China Using Ultra-Low Frame Rate Videos [J] . Sun Zhu, Wang Ping, Wang Jun, Quality Control, Transactions . 2020,第期

机译：利用超低帧速率视频在中国自动检测交通拥堵的深度监督网络
4. Automatic Proofreading in Chinese: Detect and Correct Spelling Errors in Character-Level with Deep Neural Networks [C] . Qiufeng Wang, Minghuan Liu, Weijia Zhang, CCF International Conference on Natural Language Processing and Chinese Computing . 2019

机译：中文自动校对：用深神经网络检测和纠正字符级别的拼写错误
5. Neural networks for correction of pointing and focal errors on large deep space network antennas at Ka-band. [D] . Mukai, Ryan. 2003

机译：用于校正Ka波段大型深空网络天线上指向和聚焦误差的神经网络。
6. Deep-learned 3D black-blood imaging using automatic labelling technique and 3D convolutional neural networks for detecting metastatic brain tumors [O] . Yohan Jun, Taejoon Eo, Taeseong Kim, -1

机译：使用自动标记技术和3D卷积神经网络进行深度学习的3D黑血成像以检测转移性脑肿瘤
7. Automatic detecting/correcting errors in Chinese text by an approximate word-matching algorithm [O] . Lei Zhang, Changning Huang, Ming Zhou, 2000

机译：近似词匹配算法自动检测/校正中文文本中的错误

Automatic Proofreading in Chinese: Detect and Correct Spelling Errors in Character-Level with Deep Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅