【24h】

Cross-Sentence Grammatical Error Correction

机译:跨句语法错误校正

获取原文

摘要

Automatic grammatical error correction (GEC) research has made remarkable progress in the past decade. However, all existing approaches to GEC correct errors by considering a single sentence alone and ignoring crucial cross-sentence context. Some errors can only be corrected reliably using cross-sentence context and models can also benefit from the additional contextual information in correcting other errors. In this paper, we address this serious limitation of existing approaches and improve strong neural encoder-decoder models by appropriately modeling wider contexts. We employ an auxiliary encoder that encodes previous sentences and incorporate the encoding in the decoder via attention and gating mechanisms. Our approach results in statistically significant improvements in overall GEC performance over strong baselines across multiple test sets. Analysis of our cross-sentence GEC model on a synthetic dataset shows high performance in verb tense corrections that require cross-sentence context.
机译:在过去的十年中,自动语法错误校正(GEC)研究取得了显着进展。但是,所有现有的GEC方法都通过仅考虑单个句子并忽略关键的跨句子上下文来纠正错误。某些错误只能使用跨句上下文可靠地纠正,并且模型还可以从其他上下文信息中受益,以纠正其他错误。在本文中,我们解决了现有方法的严重局限性,并通过适当地建模更广泛的上下文来改进强大的神经编码器-解码器模型。我们采用辅助编码器,该辅助编码器对先前的句子进行编码,并通过注意力和门控机制将编码合并到解码器中。我们的方法在多个测试集的强基准上导致GEC整体性能在统计上有显着改善。对我们在综合数据集上的跨句GEC模型进行的分析显示,在需要跨句上下文的动词时态校正中具有很高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号