首页> 外文会议>LREC-2012 >A Framework for Evaluating Text Correction
【24h】

A Framework for Evaluating Text Correction

机译:评估文本校正的框架

获取原文

摘要

Computer-based aids for writing assistance have been around since at least the early 1980s, focussing primarily on aspects such as spelling, grammar and style. The potential audience for such tools is very large indeed, and this is a clear case where we might expect to see language processing applications having a significant real-world impact. However, existing comparative evaluations of applications in this space are often no more than impressionistic and anecdotal reviews of commercial offerings as found in software magazines, making it hard to determine which approaches are superior. More rigorous evaluation in the scholarly literature has been held back in particular by the absence of shared datasets of texts marked-up with errors, and the lack of an agreed evaluation framework. Significant collections of publicly available data are now appearing; this paper describes a complementary evaluation framework, which has been piloted in the Helping Our Own shared task. The approach, which uses stand-off annotations for representing edits to text, can be used in a wide variety of text-correction tasks, and easily accommodates different error tagsets.
机译:自20世纪80年代早期至少自20世纪80年代初以来,基于计算机的书写援助辅助,主要集中在拼写,语法和风格等方面。此类工具的潜在观众确实非常大,这是一个明确的案例,我们可能期望看到具有重要真实影响的语言处理应用程序。然而,本空间中的应用的现有比较评估通常不仅仅是软件杂志中发现的商业产品的印象和轶事审查,使得难以确定哪种方法优越。在学术文学中更严格的评估尤其被缺乏与错误的案件的共享数据集,以及缺乏商定的评估框架。现在出现大量公开数据集合;本文介绍了一个互补的评估框架,这些框架已在帮助我们自己的共享任务中被驾驶。使用用于代表文本的脱扣注释的方法可以用于各种文本校正任务,并且容易容纳不同的错误标签。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号