【24h】

EMF Patterns of Usage on GitHub

机译:GitHub上的EMF使用模式

获取原文

摘要

Mining software repositories is a common activity in software engineering with diverse use cases such as understanding project quality, technology usage, and developer profiles. Such mining activities involve, more often than not, a phase for data extraction from the source code in the repository with recurring tasks such as processing the folder structure (possibly on the timeline), classifying repository artifacts (e.g., in terms of the languages or technologies used), and extracting facts from the artifacts by parsing or otherwise. We describe a new approach for such data extraction; its key pillar is a declarative rule-based language for the uniform, inference-based extraction of facts from the repository (the file system), the artifacts in the repository (their content), and previously extracted facts. All inferred facts are maintained in a triple store. We describe a case study for the purpose of understanding the usage of EMF. To this end, we describe an emerging catalog of patterns of using EMF in repositories and we detect these patterns on GitHub. In our implementation, we use Apache Jena for which we provide dedicated language support tailored towards mining software repositories.
机译:挖掘软件存储库是软件工程中的常见活动,具有多种用例,例如了解项目质量,技术使用情况和开发人员资料。此类挖掘活动通常涉及一个阶段,该阶段用于从存储库中的源代码中提取数据,并具有重复性任务,例如处理文件夹结构(可能在时间轴上),对存储库工件进行分类(例如,根据语言或技术),并通过解析或其他方式从工件中提取事实。我们描述了一种用于这种数据提取的新方法。它的关键支柱是一种基于声明的基于规则的语言,用于从存储库(文件系统),存储库中的工件(其内容)以及以前提取的事实中统一,基于推理的事实提取。所有推断的事实都保存在三元组中。我们描述了一个案例研究,旨在了解EMF的用法。为此,我们描述了在存储库中使用EMF的新兴模式目录,并在GitHub上检测到了这些模式。在我们的实现中,我们使用Apache Jena,为此我们提供了专门针对挖掘软件存储库的语言支持。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号