首页> 外文会议>International conference on advanced data mining and applications >Software Clustering Using Automated Feature Subset Selection
【24h】

Software Clustering Using Automated Feature Subset Selection

机译:使用自动特征子集选择的软件聚类

获取原文

摘要

This paper proposes a feature selection technique for software clustering which can be used in the architecture recovery of software systems. The recovered architecture can then be used in the subsequent phases of software maintenance, reuse and re-engineering. A number of diverse features could be extracted from the source code of software systems, however, some of the extracted features may have less information to use for calculating the entities, which result in dropping the quality of software clusters. Therefore, further research is required to select those features which have high relevancy in finding associations between entities. In this article first we propose a supervised feature selection technique for unlabeled data, and then we apply this technique for software clustering. A number of feature subset selection techniques in software architecture recovery have been proposed. However none of them focus on automated feature selection in this domain. Experimental results on three software test systems reveal that our proposed approach produces results which are closer to the decompositions prepared by human experts, as compared to those discovered by the well-known K-Means algorithm.
机译:本文提出了一种用于软件群集的特征选择技术,可用于软件系统的架构恢复。然后可以在软件维护,重用和重新设计的后续阶段中使用恢复的架构。可以从软件系统的源代码中提取许多不同的特征,然而,一些提取的特征可以具有用于计算实体的信息较少的信息,从而导致丢弃软件集群的质量。因此,需要进一步的研究来选择在发现实体之间的关联方面具有高相关性的这些特征。在本文中,首先我们提出了一个用于未标记数据的监督功能选择技术,然后我们应用此技术进行软件群集。已经提出了许多软件架构恢复中的特征子集选择技术。但是,它们都不关注该域中的自动功能选择。三种软件测试系统的实验结果表明,与通过众所周知的K-Mean算法发现的那些相比,我们所提出的方法产生更接近人类专家编制的分解的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号