【24h】

Finding errors in the Enron spreadsheet corpus

机译:在Enron电子表格语料库中查找错误

获取原文

摘要

Spreadsheet environments like MS Excel are the most widespread type of end-user software development tools and spreadsheet-based applications can be found almost everywhere in organizations. Since spreadsheets are prone to error, several approaches were proposed in the research literature to help users locate formula errors. However, the proposed methods were often designed based on assumptions about the nature of errors and were evaluated with mutations of correct spreadsheets. In this work we propose a method and tool to identify realworld formula errors within the Enron spreadsheet corpus. Our approach is based on heuristics that help us identify versions of the same spreadsheet and our software helps the user identify spreadsheets of which we assume that they contain error corrections. An initial manual inspection of a subset of such candidates led to the identification of more than two dozen formula errors. We publicly share the new collection of real-world spreadsheet errors.
机译:像MS Excel这样的电子表格环境是最终用户软件开发工具中最广泛的类型,基于电子表格的应用程序几乎可以在组织中的任何地方找到。由于电子表格容易出错,因此在研究文献中提出了几种方法来帮助用户定位公式错误。但是,建议的方法通常是基于对错误性质的假设而设计的,并通过正确电子表格的突变进行评估。在这项工作中,我们提出了一种方法和工具来识别Enron电子表格语料库中的现实世界公式错误。我们的方法基于启发式技术,可以帮助我们识别同一电子表格的版本,而我们的软件可以帮助用户识别我们认为其中包含错误更正的电子表格。最初对此类候选物的子集进行手动检查导致发现了两个以上的公式错误。我们公开分享了现实世界中电子表格错误的新集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号