首页> 外文会议>Hellenic Conference on Artificial Intelligence >Combining Clustering and Classification for Software Quality Evaluation
【24h】

Combining Clustering and Classification for Software Quality Evaluation

机译:组合集群和分类软件质量评估

获取原文

摘要

Source code and metric mining have been used to successfully assist with software quality evaluation. This paper presents a data mining approach which incorporates clustering Java classes, as well as classifying extracted clusters, in order to assess internal software quality. We use Java classes as entities and static metrics as attributes for data mining. We identify outliers and apply K-means clustering in order to establish clusters of classes. Outliers indicate potentially fault prone classes, whilst clusters are examined so that we can establish common characteristics. Subsequently, we apply C4.5 to build classification trees for identifying metrics which determine cluster membership. We evaluate the proposed approach with two well known open source software systems, Jedit and Apache Geronimo. Results have consolidated key findings from previous work and indicated that combining clustering with classification produces better results than stand alone clustering.
机译:源代码和公制挖掘已被用于成功协助软件质量评估。本文介绍了一种包含群集Java类的数据挖掘方法,以及分类提取的集群,以便评估内部软件质量。我们将Java类作为实体和静态度量作为数据挖掘的属性。我们识别异常值并应用K-Means Clustering以建立课堂集群。异常值表明潜在的易于容易阶层,而群体被检查,以便我们建立共同的特征。随后,我们应用C4.5来构建用于识别确定群集成员资格的度量的分类树。我们评估了两个众所周知的开源软件系统,JEDIT和Apache Geronimo的提出方法。结果已从以前的工作中巩固了关键结果,并指出将聚类与分类组合产生比单独聚类更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号