Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)

Huy Tu; Zhe Yu; Tim Menzies

首页> 外文期刊>IEEE Transactions on Software Engineering >Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)

【24h】

Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)

机译：Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Standard automatic methods for recognizing problematic development commits can be greatly improved via the incremental application of human+artificial expertise. In this approach, called EMBLEM, an AI tool first explore the software development process to label commits that are most problematic. Humans then apply their expertise to check those labels (perhaps resulting in the AI updating the support vectors within their SVM learner). We recommend this human+AI partnership, for several reasons. When a new domain is encountered, EMBLEM can learn better ways to label which comments refer to real problems. Also, in studies with 9 open source software projects, labelling via EMBLEM's incremental application of human+AI is at least an order of magnitude cheaper than existing methods ($approx$≈ eight times). Further, EMBLEM is very effective. For the data sets explored here, EMBLEM better labelling methods significantly improved $P_{opt}20$Popt20 and G-scores performance in nearly all the projects studied here.

著录项

来源
《IEEE Transactions on Software Engineering》 |2022年第1期|278-294|共17页
作者
Huy Tu; Zhe Yu; Tim Menzies;
展开▼
作者单位

Computer Science, North Carolina State University College of Engineering, Raleigh, NC, USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类程序设计;计算机软件;
关键词
Labeling; Computer bugs; Data models; Software; Support vector machines; Standards; Task analysis; Human-in-the-loop AI; data labelling; defect prediction; software analytics;
入库时间 2024-01-25 00:09:25

Better Data Labelling With EMBLEM (and how that Impacts Defect Prediction)

摘要

著录项

相关主题

期刊订阅