Detection of Duplicate Defect Reports Using Natural Language Processing

机译：使用自然语言处理检测重复的缺陷报告

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Defect reports are generated from various testing and development activities in software engineering. Sometimes two reports are submitted that describe the same problem, leading to duplicate reports. These reports are mostly written in structured natural language, and as such, it is hard to compare two reports for similarity with formal methods. In order to identify duplicates, we investigate using Natural Language Processing (NLP) techniques to support the identification. A prototype tool is developed and evaluated in a case study analyzing defect reports at Sony Ericsson Mobile Communications. The evaluation shows that about 2/3 of the duplicates can possibly be found using the NLP techniques. Different variants of the techniques provide only minor result differences, indicating a robust technology. User testing shows that the overall attitude towards the technique is positive and that it has a growth potential.

机译：缺陷报告是通过软件工程中的各种测试和开发活动生成的。有时会提交两个描述相同问题的报告，从而导致重复的报告。这些报告大部分是用结构化的自然语言编写的，因此，很难将两个报告的相似性与形式方法进行比较。为了识别重复项，我们调查使用自然语言处理（NLP）技术来支持识别。在分析索尼爱立信移动通信缺陷报告的案例研究中，开发并评估了原型工具。评估表明，使用NLP技术可能会发现大约2/3的重复项。该技术的不同变体仅提供较小的结果差异，表明该技术是可靠的。用户测试表明，对该技术的总体态度是积极的，并且具有增长潜力。

著录项

来源
《International conference on Software Engineering》|2007年|P.499-510|共12页
会议地点
作者
Per Runeson; Magnus Alexandersson; Oskar Nyholm; PPer Runeson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Agreement between neuroimages and reports for natural language processing-based detection of silent brain infarcts and white matter disease [J] . Lester Y. Leung, Sunyang Fu, Patrick H. Luetmer, BMC Neurology . 2021,第1期

机译：神经阴部与自然语言加工的报告的协议，基于语言处理的无声脑梗死和白质疾病
2. Automated Detection of Radiology Reports that Require Follow-up Imaging Using Natural Language Processing Feature Engineering and Machine Learning Classification [J] . Journal of digital imaging: the official journal of the Society for Computer Applications in Radiology . 2020,第1期

机译：自动检测放射学报告，需要使用自然语言处理的后续成像功能工程和机器学习分类
3. MalDy: Portable, data-driven malware detection using natural language processing and machine learning techniques on behavioral analysis reports [J] . Karbab ElMouatez Billah, Debbabi Mourad Digital investigation . 2019,第APRa期

机译：MalDy：使用自然语言处理和机器学习技术对行为分析报告进行便携式，数据驱动的恶意软件检测
4. Detection of Duplicate Defect Reports Using Natural Language Processing [C] . Runeson, Per, Alexandersson, . 2007

机译：使用自然语言处理检测重复的缺陷报告
5. Qualitative information in annual reports & the detection of corporate fraud: A natural language processing perspective. [D] . Goel, Sunita. 2009

机译：年度报告中的定性信息和公司欺诈的发现：自然语言处理的角度。
6. Controlled Vocabularies Indexing and Medical Language Processing. Medical Language Processing: Database Capture of Natural Language Echocardiographic Reports: A Unified Medical Language System Approach [O] . K. Canfield, B. Bray, S. Huff, 1989

机译：受控词汇表索引编制和医学语言处理。医学语言处理：自然语言超声心动图报告的数据库捕获：统一医学语言系统方法
7. Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm [O] . Selen Bozkurt, Emel Alkim, Imon Banerjee, 2019

机译：使用混合自然语言处理算法自动检测放射学报告中的测量和描述符

Detection of Duplicate Defect Reports Using Natural Language Processing

摘要

著录项

相似文献

相关主题

期刊订阅