首页> 外文会议>International Conference on Computational Linguistics >Don't take 'nswvtnvakgxpm' for an answer -The surprising vulnerability of automatic content scoring systems to adversarial input
【24h】

Don't take 'nswvtnvakgxpm' for an answer -The surprising vulnerability of automatic content scoring systems to adversarial input

机译:不要以“nswvtnvakgxpm”为答案 - 自动内容评分系统的令人惊讶的脆弱性对抗对抗输入

获取原文

摘要

Automatic content scoring systems are widely used on short answer tasks to save human effort. However, the use of these systems can invite cheating strategies, such as students writing irrelevant answers in the hopes of gaining at least partial credit. We generate adversarial answers for benchmark content scoring datasets based on different methods of increasing sophistication and show that even simple methods lead to a surprising decrease in content scoring performance. As an extreme example, up to 60% of adversarial answers generated from random shuffling of words in real answers are accepted by a state-of-the-art scoring system. In addition to analyzing the vulnerabilities of content scoring systems, we examine countermeasures such as adversarial training and show that these measures improve system robustness against adversarial answers considerably but do not suffice to completely solve the problem.
机译:自动内容评分系统广泛应用于短暂的答案任务,以节省人力努力。 然而,这些系统的使用可以邀请作弊策略,例如学生以至少赢得至少部分信贷的希望。 基于增加复杂性的不同方法,我们为基准内容评分数据集产生了对抗性答案,并表明即使简单的方法也导致内容评分性能的令人惊讶的降低。 作为一个极端的例子,最多可通过艺术答案的随机洗涤中产生的对抗答案的60%的对抗答案被最先进的评分系统接受。 除了分析内容评分系统的脆弱性外,我们还研究了对抗培训等对策,并表明这些措施显着改善了对抗对抗答案的系统鲁棒性,但不足以完全解决问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号