Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering

机译：培训一对一免费：部分受监督的神经网络，用于错误报告重复检测和聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Tracking user reported bugs requires considerable engineering effort in going through many repetitive reports and assigning them to the correct teams. This paper proposes a neural architecture that can jointly (1) detect if two bug reports are duplicates, and (2) aggregate them into latent topics. Leveraging the assumption that learning the topic of a bug is a sub-task for detecting duplicates, we design a loss function that can jointly perform both tasks but needs supervision for only duplicate classification, achieving topic clustering in an unsupervised fashion. We use a two-step attention module that uses self-attention for topic clustering and conditional attention for duplicate detection. We study the characteristics of two types of real world datasets that have been marked for duplicate bugs by engineers and by nontechnical annotators. The results demonstrate that our model not only can outperform state-of-the-art methods for duplicate classification on both cases, but can also learn meaningful latent clusters without additional supervision.

机译：跟踪用户报告的错误需要花费大量的工程工作来遍历许多重复的报告，并将它们分配给正确的团队。本文提出了一种神经体系结构，它可以联合（1）检测两个错误报告是否重复，以及（2）将它们汇总为潜在主题。利用假设学习错误主题是检测重复项的子任务的假设，我们设计了一个损失函数，可以共同执行这两项任务，但只需要对重复项分类进行监督，以无监督的方式实现了主题聚类。我们使用一个分为两步的注意力模块，该模块使用自我注意力进行主题聚类，使用条件注意力进行重复检测。我们研究了两种类型的现实世界数据集的特征，这些特征已被工程师和非技术注释者标记为重复错误。结果表明，我们的模型不仅可以在两种情况下都优于最新的重复分类方法，而且可以在无需额外监督的情况下学习有意义的潜在聚类。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 》|2019年|157-165|共9页
会议地点
作者
Lahari Poddar; Leonardo Neves; William Brendel; Luis Marujo; Sergey Tulyakov; Pradeep Karuturi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Lung cancer detection from CT image using improved profuse clustering and deep learning instantaneously trained neural networks [J] . Shakeel P. Mohamed, Burhanuddin M. A., Desa Mohamad Ishak Measurement . 2019 ,第期

机译：利用改进的丰富聚类和深度学习瞬时培训的神经网络，从CT图像检测来自CT图像的肺癌
2. Efficient feature extraction model for validation performance improvement of duplicate bug report detection in software bug triage systems [J] . Neysiani Behzad Soleimani, Babamir Seyed Morteza, Aritsugi Masayoshi Information and software technology . 2020 ,第Octa期

机译：软件BUG分类系统中验证性能提高验证性能提升的高效特征提取模型
3. SUPERVISED EXTENDED ART - A FAST NEURAL NETWORK CLASSIFIER TRAINED BY COMBINING SUPERVISED AND UNSUPERVISED LEARNING [J] . Lee HM., Lai CS. Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 1996 ,第2期

机译：监督扩展艺术-通过结合监督学习和监督学习来训练的快速神经网络分类器
4. Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering [C] . Lahari Poddar, Leonardo Neves, William Brendel, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：火车一个免费获取一体：部分监督的神经网络用于错误报告重复检测和群集
5. A contextual approach towards more accurate duplicate bug report detection. [D] . Alipour, Anahita. 2013

机译：一种用于更准确地检测重复错误报告的上下文方法。
6. Speech reconstruction using a deep partially supervised neural network [O] . Ian McLoughlin, Jingjie Li, Yan Song, 2017

机译：使用深度部分监督的神经网络进行语音重建
7. Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering [O] . Lahari Poddar, Leonardo Neves, William Brendel, 2019

机译：火车一个免费获取一体：部分监督的神经网络用于错误报告重复检测和群集
8. Ischemia Detection Using Supervised Learning for Hierarchical Neural Networks Based on Kohonen-Maps. [R] . Vladutu, L., Papadimitriou, S., Mauroudi, S., 2001

机译：基于Kohonen-maps的分层神经网络监督学习缺血检测。

Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering

摘要

著录项

相似文献

相关主题

期刊订阅