【24h】

Detection of Aberrant Responses in OMR Documents

机译:检测OMR文档中的异常响应

获取原文

摘要

Optical Mark Recognition (OMR) is a well-known process of capturing human-marked data in documents. It is an extremely accurate and rapid form of data capture especially when each response can be entered as a single mark and for the same reason this process has been employed in various examinations/tests across the world. It is clearly understood that conducting these tests in an equitable manner is of the utmost importance. Sadly, in the past few years, there have been several cases in which teachers/administrators of elementary and high schools across the United States were identified for fraudulently correcting the answers written by their students in order to improve the success rate of their respective schools. In order to identify this format of cheating, a procedure was developed to autonomously determine if cheating has occurred by detecting the presence of aberrant responses in scanned OMR test books. The challenges introduced by the significant imbalance in the numbers of typical and aberrant bubbles were identified. The aberrant bubble detection problem was formulated as an outlier detection problem. A pool of features is initially selected by examining bubbles that are penciled by a group of individuals and analyzing the differences between them. Several possible outlier detection methods were considered and a feature based procedure in conjunction with a one-class SVM classifier was developed. A multi-criteria rank-ofrank-sum technique was introduced to rank and select a subset of features from a pool of candidate features. Using the data set of 11 individuals, it was shown that a detection accuracy of over 90% is possible.
机译:光学标记识别(OMR)是在文档中捕获人类标记数据的众所周知的过程。这是一种非常准确和快速的数据捕获形式,尤其是当每个响应都可以作为单个标记输入时,并且出于相同的原因,此过程已在全球范围内的各种检查/测试中采用。显然,以公平的方式进行这些测试是至关重要的。可悲的是,在过去的几年中,有很多情况下,美国各地的小学和中学的老师/行政人员被认定为欺诈性地纠正他们的学生所写的答案,以提高各自学校的成功率。为了识别这种作弊形式,开发了一种程序来通过检测扫描的OMR测试书中是否存在异常响应来自主确定是否发生了作弊行为。确定了典型气泡和异常气泡的数量显着不平衡所带来的挑战。将异常气泡检测问题表述为异常检测问题。首先,通过检查由一组人绘制的气泡并分析它们之间的差异来选择特征池。考虑了几种可能的异常检测方法,并结合一类SVM分类器开发了基于特征的过程。引入了多标准秩和求和技术,以从候选特征池中对特征子集进行排名和选择。使用11个人的数据集,显示出90%以上的检测精度是可能的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号