首页> 外文会议>Southeastcon, 2012 Proceedings of IEEE >Aspect mining using model-based clustering
【24h】

Aspect mining using model-based clustering

机译:使用基于模型的聚类进行方面挖掘

获取原文
获取原文并翻译 | 示例

摘要

Legacy systems contain critical and complex business code that has been in use for a long time. This code is difficult to understand, maintain, and evolve, in large part due to crosscutting concerns: software system features, such as persistence, logging, and error handling, whose implementation is spread across multiple modules. Aspect-oriented techniques separate crosscutting concerns from the base code, using separate modules called aspects and, thus, simplify the legacy code. Aspect mining techniques identify aspect candidates so that the legacy code can then be refactored into aspects. This study shows that model-based clustering using a carefully selected vector-space of features can be more effective than extant aspect mining methods based on heuristic methods as such hierarchical or partitional clustering. Three model-based algorithms were experimentally compared against existing heuristic methods, such as k-means clustering and agglomerative hierarchical clustering, using six different vector-space models. Model-based algorithms performed better in not spreading the methods of the concerns across the multiple clusters and were significantly better at partitioning the data such that, given an ordered list of clusters, fewer clusters and methods were needed to be analyzed to find all the concerns. In addition, model-based algorithms automatically determined the optimal number of clusters, a great advantage over the heuristic-based algorithms. Lastly, the newly defined vector-space models performed better, relative to aspect mining, than the previously defined vector-space models
机译:旧版系统包含已使用了很长时间的关键和复杂的业务代码。该代码很难理解,维护和发展,这在很大程度上是由于横切关注点:软件系统功能(例如持久性,日志记录和错误处理),其实现分布在多个模块中。面向方面的技术使用称为方面的单独模块将横切关注点与基本代码分开,从而简化了遗留代码。方面挖掘技术可以识别方面候选者,以便可以将遗留代码重构为方面。这项研究表明,使用经过精心选择的特征向量空间进行的基于模型的聚类比基于启发式方法(例如分层或分区聚类)的现有方面挖掘方法更为有效。使用六个不同的向量空间模型,将三种基于模型的算法与现有的启发式方法(例如k均值聚类和聚集层次聚类)进行了实验比较。基于模型的算法在不将关注点方法分散到多个聚类中的情况下表现更好,并且在划分数据方面也显着更好,因此,在给出聚类的有序列表的情况下,需要分析较少的聚类和方法来查找所有关注点。此外,基于模型的算法会自动确定最佳的聚类数量,这比基于启发式的算法有很大优势。最后,相对于方面挖掘,新定义的向量空间模型比先前定义的向量空间模型表现更好

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号