首页> 外文会议>International conference on Software Engineering >Detection of Duplicate Defect Reports Using Natural Language Processing
【24h】

Detection of Duplicate Defect Reports Using Natural Language Processing

机译:使用自然语言处理检测重复的缺陷报告

获取原文

摘要

Defect reports are generated from various testing and development activities in software engineering. Sometimes two reports are submitted that describe the same problem, leading to duplicate reports. These reports are mostly written in structured natural language, and as such, it is hard to compare two reports for similarity with formal methods. In order to identify duplicates, we investigate using Natural Language Processing (NLP) techniques to support the identification. A prototype tool is developed and evaluated in a case study analyzing defect reports at Sony Ericsson Mobile Communications. The evaluation shows that about 2/3 of the duplicates can possibly be found using the NLP techniques. Different variants of the techniques provide only minor result differences, indicating a robust technology. User testing shows that the overall attitude towards the technique is positive and that it has a growth potential.
机译:缺陷报告是通过软件工程中的各种测试和开发活动生成的。有时会提交两个描述相同问题的报告,从而导致重复的报告。这些报告大部分是用结构化的自然语言编写的,因此,很难将两个报告的相似性与形式方法进行比较。为了识别重复项,我们调查使用自然语言处理(NLP)技术来支持识别。在分析索尼爱立信移动通信缺陷报告的案例研究中,开发并评估了原型工具。评估表明,使用NLP技术可能会发现大约2/3的重复项。该技术的不同变体仅提供较小的结果差异,表明该技术是可靠的。用户测试表明,对该技术的总体态度是积极的,并且具有增长潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号