Software Clustering Using Automated Feature Subset Selection

机译：使用自动特征子集选择的软件聚类

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper proposes a feature selection technique for software clustering which can be used in the architecture recovery of software systems. The recovered architecture can then be used in the subsequent phases of software maintenance, reuse and re-engineering. A number of diverse features could be extracted from the source code of software systems, however, some of the extracted features may have less information to use for calculating the entities, which result in dropping the quality of software clusters. Therefore, further research is required to select those features which have high relevancy in finding associations between entities. In this article first we propose a supervised feature selection technique for unlabeled data, and then we apply this technique for software clustering. A number of feature subset selection techniques in software architecture recovery have been proposed. However none of them focus on automated feature selection in this domain. Experimental results on three software test systems reveal that our proposed approach produces results which are closer to the decompositions prepared by human experts, as compared to those discovered by the well-known K-Means algorithm.

机译：本文提出了一种用于软件群集的特征选择技术，可用于软件系统的架构恢复。然后可以在软件维护，重用和重新设计的后续阶段中使用恢复的架构。可以从软件系统的源代码中提取许多不同的特征，然而，一些提取的特征可以具有用于计算实体的信息较少的信息，从而导致丢弃软件集群的质量。因此，需要进一步的研究来选择在发现实体之间的关联方面具有高相关性的这些特征。在本文中，首先我们提出了一个用于未标记数据的监督功能选择技术，然后我们应用此技术进行软件群集。已经提出了许多软件架构恢复中的特征子集选择技术。但是，它们都不关注该域中的自动功能选择。三种软件测试系统的实验结果表明，与通过众所周知的K-Mean算法发现的那些相比，我们所提出的方法产生更接近人类专家编制的分解的结果。

著录项

来源
《International conference on advanced data mining and applications》|2013年||共12页
会议地点
作者
Zubair Shah; Rashid Naseem; Mehmet A. Orgun; Abdun Mahmood; Sara Shahzad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词
Software Clustering; Feature Selection; K-Means;

机译：软件聚类;特征选择;k均值;

相似文献

外文文献
中文文献
专利

1. An Automated Parameter Selection Approach for Simultaneous Clustering and Feature Selection [J] . Dinesh Kumar, Jitender Kumar Chhabra, Vijay kumar Journal of Engineering Research . 2016,第2期

机译：同时进行聚类和特征选择的自动参数选择方法
2. An Empirical Investigation of Combining Filter-Based Feature Subset Selection and Data Sampling for Software Defect Prediction [J] . Kehan Gao, Taghi M. Khoshgoftaar, Amri Napolitano International Journal of Reliability, Quality and Safety Engineering . 2015,第6期

机译：基于滤波器的特征子集选择和数据采样相结合进行软件缺陷预测的实证研究
3. Aggregating Data Sampling with Feature Subset Selection to Address Skewed Software Defect Data [J] . Kehan Gao, Taghi M. Khoshgoftaar, Amri Napolitano International journal of software engineering and knowledge engineering . 2015,第9a10期

机译：聚合具有特征子集选择的数据采样以解决歪斜的软件缺陷数据
4. Software Clustering Using Automated Feature Subset Selection [C] . Zubair Shah, Rashid Naseem, Mehmet A. Orgun, International conference on advanced data mining and applications . 2013

机译：使用自动特征子集选择的软件集群
5. Feature Selection Via Random Subsets of Uncorrelated Features [D] . Long, Dang Kim. 2020

机译：通过无相关的功能的随机子集选择功能选择
6. Application of feature selection methods for automated clustering analysis: a review on synthetic datasets [O] . Aliyu Usman Ahmad, Andrew Starkey -1

机译：特征选择方法在自动聚类分析中的应用：综述综合数据集
7. Software Defect Prediction Based on Feature Subset Selection and Ensemble Classification [O] . Ahmad A Saifan, Lina Abu-wardih 2020

机译：基于特征子集选择和集合分类的软件缺陷预测

Software Clustering Using Automated Feature Subset Selection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅