...
首页> 外文期刊>Information and software technology >Bug localization using latent Dirichlet allocation
【24h】

Bug localization using latent Dirichlet allocation

机译:使用潜在的Dirichlet分配进行错误本地化

获取原文
获取原文并翻译 | 示例

摘要

Context: Some recent static techniques for automatic bug localization have been built around modern information retrieval (IR) models such as latent semantic indexing (LSI). Latent Dirichlet allocation (LDA) is a generative statistical model that has significant advantages, in modularity and extensibility, over both LSI and probabilistic LSI (pLSI). Moreover, LDA has been shown effective in topic model based information retrieval. In this paper, we present a static LDA-based technique for automatic bug localization and evaluate its effectiveness.rnObjective: We evaluate the accuracy and scalability of the LDA-based technique and investigate whether it is suitable for use with open-source software systems of varying size, including those developed using agile methods.rnMethod: We present five case studies designed to determine the accuracy and scalability of the LDA-based technique, as well as its relationships to software system size and to source code stability. The studies examine over 300 bugs across more than 25 iterations of three software systems. Results: The results of the studies show that the LDA-based technique maintains sufficient accuracy across all bugs in a single iteration of a software system and is scalable to a large number of bugs across multiple revisions of two software systems. The results of the studies also indicate that the accuracy of the LDA-based technique is not affected by the size of the subject software system or by the stability of its source code base.rnConclusion: We conclude that an effective static technique for automatic bug localization can be built around LDA. We also conclude that there is no significant relationship between the accuracy of the LDA-based technique and the size of the subject software system or the stability of its source code base. Thus, the LDA-based technique is widely applicable.
机译:上下文:围绕现代信息检索(IR)模型(例如潜在语义索引(LSI))建立了一些用于自动错误定位的最新静态技术。潜在狄利克雷分配(LDA)是一种生成统计模型,与LSI和概率LSI(pLSI)相比,在模块化和可扩展性方面具有显着优势。此外,LDA已显示在基于主题模型的信息检索中有效。在本文中,我们提出了一种基于静态LDA的自动bug定位技术,并对其有效性进行了评估。rn目的:我们评估基于LDA的技术的准确性和可伸缩性,并研究其是否适合与开源软件系统配合使用。方法:我们提供了五个案例研究,旨在确定基于LDA的技术的准确性和可扩展性,以及与软件系统大小和源代码稳定性之间的关系。这些研究在三个软件系统的25个以上迭代中检查了300多个错误。结果:研究结果表明,基于LDA的技术可在软件系统的一次迭代中对所有错误保持足够的准确性,并且可扩展到两个软件系统的多个修订版之间的大量错误。研究结果还表明,基于LDA的技术的准确性不受主题软件系统的大小或其源代码库的稳定性的影响。rn结论:我们得出结论:有效的静态技术可用于自动错误定位可以围绕LDA构建。我们还得出结论,基于LDA的技术的准确性与主题软件系统的大小或其源代码库的稳定性之间没有显着关系。因此,基于LDA的技术是广泛适用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号