Exploiting Unlabeled Data for Neural Grammatical Error Detection

Zhuo-Ran Liu; Yang Liu

首页> 中文期刊> 《计算机科学技术学报：英文版》 >Exploiting Unlabeled Data for Neural Grammatical Error Detection

Exploiting Unlabeled Data for Neural Grammatical Error Detection

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Identifying and correcting grammatical errors in the text written by non-native writers have received increasingattention in recent years. Although a number of annotated corpora have been established to facilitate data-driven gram-matical error detection and correction approaches, they are still limited in terms of quantity and coverage because lmmanannotation is labor-intensive, time-consuming, and expensive. In this work, we propose to utilize unlabeled data to trainneural network based graminatical error detection models. The basic idea is to cast error detection ms a binary classificationproblem and derive positive and negative training examples from unlabeled data. We introduce an attention-based neuralnetwork to capture long-distance dependencies that influence the word being detected. Experiments show that the proposedapproach significantly outperforms SVM and convolutional networks with fixed-size context window.

著录项

来源
《计算机科学技术学报：英文版》 |2017年第4期|758-767|共10页
作者
Zhuo-Ran Liu; Yang Liu;
展开▼
作者单位

School of Software;

Beihang University;

Beijing 100191;

China;

State Key Laboratory of Intelligent Technology and Systems;

Tsinghua University;

Beijing 100084;

China;

Tsinghua National Laboratory for Information Science and Technology;

Tsinghua University;

Beijing 100084;

China;

Department of Computer Science and Technology;

Tsinghua University;

Beijing 100084;

China;

Jiangsu Collaborative Innovation Center for Language Competence;

Xuzhou 221009;

China;

展开▼
原文格式 PDF
正文语种 chi
中图分类自动化技术、计算机技术;
关键词
unlabeled; data; grammatical; error; detection; neural; network;

Exploiting Unlabeled Data for Neural Grammatical Error Detection

摘要

著录项

相关主题

期刊订阅