【24h】

Statistical Debugging Using Latent Topic Models

机译:使用潜在主题模型进行统计调试

获取原文
获取原文并翻译 | 示例

摘要

Statistical debugging uses machine learning to model program failures and help identify root causes of bugs. We approach this task using a novel Delta-Latent-Dirichlet-Allocation model. We model execution traces attributed to failed runs of a program as being generated by two types of latent topics: normal usage topics and bug topics. Execution traces attributed to successful runs of the same program, however, are modeled by usage topics only. Joint modeling of both kinds of traces allows us to identify weak bug topics that would otherwise remain undetected. We perform model inference with collapsed Gibbs sampling. In quantitative evaluations on four real programs, our model produces bug topics highly correlated to the true bugs, as measured by the Rand index. Qualitative evaluation by domain experts suggests that our model outperforms existing statistical methods for bug cause identification, and may help support other software tasks not addressed by earlier models.
机译:统计调试使用机器学习对程序故障进行建模,并帮助确定错误的根本原因。我们使用新颖的Delta-Latent-Dirichlet-分配模型来完成此任务。我们将归因于程序失败运行的执行跟踪建模为由两种类型的潜在主题生成的:正常使用主题和错误主题。但是,归因于同一程序成功运行的执行跟踪仅由使用主题建模。两种踪迹的联合建模使我们能够识别弱的bug主题,否则这些主题将不会被发现。我们使用折叠的吉布斯采样执行模型推断。在对四个真实程序的定量评估中,我们的模型会生成与真实错误高度相关的错误主题(以兰德指数衡量)。领域专家进行的定性评估表明,我们的模型优于现有的统计方法来识别错误原因,并且可能有助于支持早期模型无法解决的其他软件任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号