【24h】

Finding errors in the Enron spreadsheet corpus

机译:在SENRON电子表格语料库中找到错误

获取原文

摘要

Spreadsheet environments like MS Excel are the most widespread type of end-user software development tools and spreadsheet-based applications can be found almost everywhere in organizations. Since spreadsheets are prone to error, several approaches were proposed in the research literature to help users locate formula errors. However, the proposed methods were often designed based on assumptions about the nature of errors and were evaluated with mutations of correct spreadsheets. In this work we propose a method and tool to identify realworld formula errors within the Enron spreadsheet corpus. Our approach is based on heuristics that help us identify versions of the same spreadsheet and our software helps the user identify spreadsheets of which we assume that they contain error corrections. An initial manual inspection of a subset of such candidates led to the identification of more than two dozen formula errors. We publicly share the new collection of real-world spreadsheet errors.
机译:像MS Excel这样的电子表格环境是最广泛类型的最终用户软件开发工具,并且可以在组织中的任何地方找到基于电子产品的应用程序。由于电子表格易于错误,因此在研究文献中提出了几种方法,以帮助用户找到公式错误。然而,拟议的方法通常基于关于误差性质的假设,并用正确电子表格的突变进行评估。在这项工作中,我们提出了一种方法和工具来识别安源电子表格语料库中的RealWorld公式错误。我们的方法是基于启发式信息,帮助我们识别相同电子表格的版本,我们的软件可帮助用户识别我们假设它们包含错误校正的电子表格。这种候选者子集的初始手动检查导致识别超过2打的配方误差。我们公开分享新的现实电子表格错误集合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号