【24h】

Candidate re-ranking for SMT-based grammatical error correction

机译:基于SMT的语法错误纠正的候选者重新排名

获取原文
获取外文期刊封面目录资料

摘要

We develop a supervised ranking model to re-rank candidates generated from an SMT-based grammatical error correction (GEC) system. A range of novel features with respect to GEC are investigated and implemented in our re-ranker. We train a rank preference SVM model and demonstrate that this outperforms both Minimum Bayes-Risk and Multi-Engine Machine Translation based re-ranking for the GEC task. Our best system yields a significant improvement in I-measure when testing on the publicly available FCE test set (from 2.87% to 9.78%). It also achieves an F_(0.5) score of 38.08% on the CoNLL-2014 shared task test set, which is higher than the best original result. The oracle score (upper bound) for the re-ranker achieves over 40% I-measure performance, demonstrating that there is considerable room for improvement in the re-ranking component developed here, such as incorporating features able to capture long-distance dependencies.
机译:我们开发了一种监督排序模型来对从基于SMT的语法错误纠正(GEC)系统生成的候选者重新排序。在我们的重新排名中,研究和实现了与GEC相关的一系列新颖功能。我们训练了等级偏好的SVM模型,并证明了它优于GEC任务基于最小贝叶斯风险和基于多引擎机器翻译的重新排名。当在公开的FCE测试仪上进行测试时,我们最好的系统可以显着改善I-measure(从2.87%到9.78%)。 CoNLL-2014共享任务测试集的F_(0.5)分数也达到38.08%,高于最佳原始结果。重新排名的预言分数(上限)达到了40%的I-measure性能,表明此处开发的重新排名组件有很大的改进空间,例如合并了能够捕获长距离依赖性的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号