首页> 外文会议>IEEE/ACM International Conference on Mining Software Repositories >git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories
【24h】

git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories

机译:git2net-从大型git仓库中挖掘带有时间戳的联合编辑网络

获取原文

摘要

Data from software repositories have become an important foundation for the empirical study of software engineering processes. A recurring theme in the repository mining literature is the inference of developer networks capturing e.g. collaboration, coordination, or communication, from the commit history of projects. Most of the studied networks are based on the co-authorship of software artefacts defined at the level of files, modules, or packages. While this approach has led to insights into the social aspects of software development, it neglects detailed information on code changes and code ownership, e.g. which exact lines of code have been authored by which developers, that is contained in the commit log of software projects. Addressing this issue, we introduce git2net, a scalable python software that facilitates the extraction of fine-grained co-editing networks in large git repositories. It uses text mining techniques to analyse the detailed history of textual modifications within files. This information allows us to construct directed, weighted, and time-stamped networks, where a link signifies that one developer has edited a block of source code originally written by another developer. Our tool is applied in case studies of an Open Source and a commercial software project. We argue that it opens up a massive new source of high-resolution data on human collaboration patterns.
机译:来自软件存储库的数据已成为对软件工程过程进行实证研究的重要基础。存储库挖掘文献中反复出现的主题是推断开发人员网络捕获例如项目提交历史中的协作,协调或沟通。大多数研究的网络都基于在文件,模块或软件包级别定义的软件伪像的共同著作。虽然这种方法导致了对软件开发的社会方面的见解,但它忽略了有关代码更改和代码所有权的详细信息,例如:哪些开发人员编写了哪些确切的代码行,这些代码行包​​含在软件项目的提交日志中。为了解决这个问题,我们引入了git2net,这是一个可扩展的python软件,可帮助提取大型git存储库中的细粒度共同编辑网络。它使用文本挖掘技术来分析文件中文本修改的详细历史记录。该信息使我们能够构建定向的,加权的和带时间戳的网络,其中的链接表示一个开发人员已经编辑了另一位开发人员最初编写的源代码块。我们的工具用于开源和商业软件项目的案例研究。我们认为,这为人类协作模式开辟了高分辨率数据的巨大新来源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号