首页> 中文期刊> 《计算机科学技术学报:英文版》 >Exploiting Unlabeled Data for Neural Grammatical Error Detection

Exploiting Unlabeled Data for Neural Grammatical Error Detection

         

摘要

Identifying and correcting grammatical errors in the text written by non-native writers have received increasingattention in recent years. Although a number of annotated corpora have been established to facilitate data-driven gram-matical error detection and correction approaches, they are still limited in terms of quantity and coverage because lmmanannotation is labor-intensive, time-consuming, and expensive. In this work, we propose to utilize unlabeled data to trainneural network based graminatical error detection models. The basic idea is to cast error detection ms a binary classificationproblem and derive positive and negative training examples from unlabeled data. We introduce an attention-based neuralnetwork to capture long-distance dependencies that influence the word being detected. Experiments show that the proposedapproach significantly outperforms SVM and convolutional networks with fixed-size context window.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号