Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

Rod Hilton; Ellen Gethner

首页> 外文期刊>International journal of software engineering and knowledge engineering >Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

【24h】

Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

机译：使用机器学习从面向对象的指标预测开源软件中的代码热点

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software engineers are able to measure the quality of their code using a variety of metrics that can be derived directly from analyzing the source code. These internal quality metrics are valuable to engineers, but the organizations funding the software development effort find external quality metrics such as defect rates and time to develop features more valuable. Unfortunately, external quality metrics can only be calculated after costly software has been developed and deployed for end-users to utilize. Here, we present a method for mining data from freely available open source codebases written in Java to train a Random Forest classifier to predict which files are likely to be external quality hotspots based on their internal quality metrics with over 75% accuracy. We also used the trained model to predict hotspots for a Java project whose data was not used to train the classifier and achieved over 75% accuracy again, demonstrating the method's general applicability to different projects.

机译：软件工程师能够使用可以直接从源代码分析中得出的各种指标来衡量其代码的质量。这些内部质量指标对工程师来说很有价值，但是资助软件开发工作的组织发现外部质量指标，例如缺陷率和开发功能的时间更有价值。不幸的是，只有在开发并部署了昂贵的软件供最终用户使用之后，才能计算外部质量指标。在这里，我们提出了一种方法，该方法可从Java编写的免费开放源代码库中挖掘数据，以训练随机森林分类器根据其内部质量指标（其准确性超过75％）预测哪些文件可能是外部质量热点。我们还使用训练有素的模型来预测Java项目的热点，该Java项目的数据未用于训练分类器，并且再次达到了75％以上的准确性，这证明了该方法在不同项目中的普遍适用性。

著录项

来源
《International journal of software engineering and knowledge engineering》 |2018年第3期|311-331|共21页
作者
Rod Hilton; Ellen Gethner;
展开▼
作者单位

Department of Computer Science and Engineering University of Colorado Denver Denver, CO 80217, USA;

Department of Computer Science and Engineering University of Colorado Denver Denver, CO 80217, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Software engineering; software quality; quality metrics; machine learning; open source; object-oriented metrics; hotspot prediction; data mining; java; classification; random forest;

机译：软件工程;软件质量;质量指标;机器学习开源;面向对象的指标;热点预测;数据挖掘;java;分类;随机森林;

相似文献

外文文献
中文文献
专利

1. Predicting Code Smells and Analysis of Predictions: Using Machine Learning Techniques and Software Metrics [J] . Mohammad Y.Mhawish, Manjari Gupta 计算机科学技术学报（英文版） . 2020,第006期

机译：预测代码气味并进行预测分析：使用机器学习技术和软件指标
2. Predicting different levels of the unit testing effort of classes using source code metrics: a multiple case study on open-source software [J] . Fadel Toure, Mourad Badri, Luc Lamontagne Innovations in Systems and Software Engineering . 2018,第1期

机译：使用源代码指标预测类的不同层次测试努力：开源软件的多个案例研究
3. Detecting Design Patterns in Object-Oriented Program Source Code by Using Metrics and Machine Learning [J] . Satoru Uchiyama, Atsuto Kubo, Hironori Washizaki, Journal of Software Engineering and Applications . 2014,第12期

机译：通过使用度量和机器学习在面向对象的程序源代码中检测设计模式
4. Applying Machine Learning to Predict Software Fault Proneness Using Change Metrics, Static Code Metrics, and a Combination of Them [C] . Yasser Ali Alshehri, Katerina Goseva-Popstojanova, Dale G. Dzielski, SoutheastCon . 2018

机译：应用机器学习使用变更指标，静态代码指标以及它们的组合来预测软件故障倾向
5. Predicting open-source software quality using statistical and machine learning techniques. [D] . Phadke, Amit Ashok. 2004

机译：使用统计和机器学习技术预测开源软件的质量。
6. Glycemic-aware metrics and oversampling techniques for predicting blood glucose levels using machine learning [O] . Michael Mayo, Lynne Chepulis, Ryan G. Paul 2019

机译：使用机器学习预测血糖水平的血糖感知指标和过采样技术
7. Detecting Design Patterns in Object-Oriented Program Source Code by Using Metrics and Machine Learning [O] . Satoru Uchiyama, Atsuto Kubo, Hironori Washizaki, 2014

机译：通过使用指标和机器学习检测面向对象的程序源代码的设计模式

Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning

摘要

著录项

相似文献

相关主题

期刊订阅