首页> 外文会议>IEEE International Conference on Software Analysis, Evolution, and Reengineering >Automated quality assessment for crowdsourced test reports of mobile applications
【24h】

Automated quality assessment for crowdsourced test reports of mobile applications

机译:自动化质量评估,用于移动应用众包测试报告

获取原文

摘要

In crowdsourced mobile application testing, crowd workers help developers perform testing and submit test reports for unexpected behaviors. These submitted test reports usually provide critical information for developers to understand and reproduce the bugs. However, due to the poor performance of workers and the inconvenience of editing on mobile devices, the quality of test reports may vary sharply. At times developers have to spend a significant portion of their available resources to handle the low-quality test reports, thus heavily decreasing their efficiency. In this paper, to help developers predict whether a test report should be selected for inspection within limited resources, we propose a new framework named TERQAF to automatically model the quality of test reports. TERQAF defines a series of quantifiable indicators to measure the desirable properties of test reports and aggregates the numerical values of all indicators to determine the quality of test reports by using step transformation functions. Experiments conducted over five crowdsourced test report datasets of mobile applications show that TERQAF can correctly predict the quality of test reports with accuracy of up to 88.06% and outperform baselines by up to 23.06%. Meanwhile, the experimental results also demonstrate that the four categories of measurable indicators have positive impacts on TERQAF in evaluating the quality of test reports.
机译:在众包的移动应用程序测试中,人群工作者可以帮助开发人员执行测试并提交针对意外行为的测试报告。这些提交的测试报告通常为开发人员提供重要信息,以帮助他们理解和重现这些错误。但是,由于工作人员表现不佳以及在移动设备上进行编辑带来的不便,测试报告的质量可能会发生很大的变化。有时,开发人员不得不花费大量可用资源来处理低质量的测试报告,从而大大降低了效率。在本文中,为了帮助开发人员预测是否应在有限的资源范围内选择测试报告进行检查,我们提出了一个名为TERQAF的新框架来自动对测试报告的质量进行建模。 TERQAF定义了一系列可量化的指标,以测量测试报告的理想属性,并汇总所有指标的数值,以使用逐步转换函数来确定测试报告的质量。对五个移动应用程序的众包测试报告数据集进行的实验表明,TERQAF可以正确预测测试报告的质量,准确度高达88.06 \%,并且比基准性能高出23.6%\%。同时,实验结果还表明,这四类可衡量的指标对TERQAF评估测试报告的质量具有积极的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号