首页> 外文会议>Mining Complex Data >Using Text Mining and Link Analysis for Software Mining
【24h】

Using Text Mining and Link Analysis for Software Mining

机译:使用文本挖掘和链接分析进行软件挖掘

获取原文
获取原文并翻译 | 示例

摘要

Many data mining techniques are these days in use for ontology learning - text mining, Web mining, graph mining, link analysis, relational data mining, and so on. In the current state-of-the-art bundle there is a lack of "software mining" techniques. This term denotes the process of extracting knowledge out of source code. In this paper we approach the software mining task with a combination of text mining and link analysis techniques. We discuss how each instance (i.e. a programming construct such as a class or a method) can be converted into a feature vector that combines the information about how the instance is interlinked with other instances, and the information about its (textual) content. The so-obtained feature vectors serve as the basis for the construction of the domain ontology with OntoGen, an existing system for semi-automatic data-driven ontology construction.
机译:如今,许多数据挖掘技术已用于本体学习-文本挖掘,Web挖掘,图形挖掘,链接分析,关系数据挖掘等等。在当前最新的捆绑软件中,缺少“软件挖掘”技术。该术语表示从源代码中提取知识的过程。在本文中,我们结合了文本挖掘和链接分析技术来处理软件挖掘任务。我们讨论了如何将每个实例(即诸如类或方法之类的编程构造)转换为特征向量,该特征向量将有关该实例如何与其他实例互连的信息及其有关(文本)内容的信息组合在一起。如此获得的特征向量作为使用OntoGen构建领域本体的基础,该系统是现有的半自动数据驱动的本体构建系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号