【24h】

Detection of Aberrant Responses in OMR Documents

机译:检测OMR文件中的异常响应

获取原文

摘要

Optical Mark Recognition (OMR) is a well-known process of capturing human-marked data in documents. It is an extremely accurate and rapid form of data capture especially when each response can be entered as a single mark and for the same reason this process has been employed in various examinations/tests across the world. It is clearly understood that conducting these tests in an equitable manner is of the utmost importance. Sadly, in the past few years, there have been several cases in which teachers/administrators of elementary and high schools across the United States were identified for fraudulently correcting the answers written by their students in order to improve the success rate of their respective schools. In order to identify this format of cheating, a procedure was developed to autonomously determine if cheating has occurred by detecting the presence of aberrant responses in scanned OMR test books. The challenges introduced by the significant imbalance in the numbers of typical and aberrant bubbles were identified. The aberrant bubble detection problem was formulated as an outlier detection problem. A pool of features is initially selected by examining bubbles that are penciled by a group of individuals and analyzing the differences between them. Several possible outlier detection methods were considered and a feature based procedure in conjunction with a one-class SVM classifier was developed. A multi-criteria rank-of-rank-sum technique was introduced to rank and select a subset of features from a pool of candidate features. Using the data set of 11 individuals, it was shown that a detection accuracy of over 90% is possible.
机译:光学标记识别(OMR)是捕获文档中的人为标记数据的众所周知的过程。它是一种极其准确的数据捕获形式,特别是当每个响应都可以作为单个标记输入时,同样原因,这一过程已在全球各种检查/测试中使用。清楚地理解,以公平的方式进行这些测试是最重要的。遗憾的是,在过去的几年里,有几个案例,其中既有几个案例,美国小学和高中的教师/管理员被确定为欺诈性地纠正学生撰写的答案,以提高各自学校的成功率。为了识别这种作弊格式,开发了一种程序,以自主地确定是否通过检测扫描OMR测试书中的异常响应而发生作弊。鉴定了典型和异常气泡数量的显着不平衡引入的挑战。异常气泡检测问题被配制为异常检测问题。最初通过检查由一群人铅笔的气泡和分析它们之间的差异来选择的功能池。考虑了几种可能的异常检测方法,并且开发了与单级SVM分类器结合的基于特征的过程。引入了一个多标准秩和技术来等级,从候选功能池中选择一个功能的子集。使用11个个人的数据集,显示出超过90%的检测精度是可能的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号