首页> 外文期刊>Empirical Software Engineering >Why and how developers fork what from whom in GitHub
【24h】

Why and how developers fork what from whom in GitHub

机译:开发人员为什么以及如何从GitHub中从谁那里得到什么

获取原文
获取原文并翻译 | 示例
           

摘要

Forking is the creation of a new software repository by copying another repository. Though forking is controversial in traditional open source software (OSS) community, it is encouraged and is a built-in feature in GitHub. Developers freely fork repositories, use codes as their own and make changes. A deep understanding of repository forking can provide important insights for OSS community and GitHub. In this paper, we explore why and how developers fork what from whom in GitHub. We collect a dataset containing 236,344 developers and 1,841,324 forks. We make surveys, and analyze programming languages and owners of forked repositories. Our main observations are: (1) Developers fork repositories to submit pull requests, fix bugs, add new features and keep copies etc. Developers find repositories to fork from various sources: search engines, external sites (e.g., Twitter, Reddit), social relationships, etc. More than 42 % of developers that we have surveyed agree that an automated recommendation tool is useful to help them pick repositories to fork, while more than 44.4 % of developers do not value a recommendation tool. Developers care about repository owners when they fork repositories. (2) A repository written in a developer's preferred programming language is more likely to be forked. (3) Developers mostly fork repositories from creators. In comparison with unattractive repository owners, attractive repository owners have higher percentage of organizations, more followers and earlier registration in GitHub. Our results show that forking is mainly used for making contributions of original repositories, and it is beneficial for OSS community. Moreover, our results show the value of recommendation and provide important insights for GitHub to recommend repositories.
机译:分叉是通过复制另一个存储库来创建新的软件存储库。尽管分叉在传统的开源软件(OSS)社区中引起争议,但它受到鼓励,并且是GitHub中的内置功能。开发人员可以自由地存储库,使用代码作为自己的代码并进行更改。对存储库分支的深入了解可以为OSS社区和GitHub提供重要的见解。在本文中,我们探讨了为什么以及如何开发人员在GitHub中从谁那里得到什么。我们收集了一个包含236,344个开发人员和1,841,324个fork的数据集。我们进行调查,并分析编程语言和分支存储库的所有者。我们的主要观察结果是:(1)开发人员分叉存储库以提交请求,修复错误,添加新功能并保留副本等。开发人员从各种来源找到存储库以分叉:搜索引擎,外部站点(例如Twitter,Reddit),社交我们调查过的开发人员中,有42%以上的开发人员同意自动推荐工具可帮助他们选择要存储的存储库,而超过44.4%的开发人员不喜欢推荐工具。开发人员在分叉存储库时会关心存储库所有者。 (2)用开发人员首选的编程语言编写的存储库更有可能被分叉。 (3)开发人员通常会从创建者那里分叉存储库。与没有吸引力的存储库所有者相比,有吸引力的存储库所有者拥有更高的组织百分比,更多的关注者以及在GitHub中的早期注册。我们的结果表明,分叉主要用于原始存储库的贡献,并且对OSS社区有利。此外,我们的结果显示了推荐的价值,并为GitHub推荐存储库提供了重要的见解。

著录项

  • 来源
    《Empirical Software Engineering》 |2017年第1期|547-578|共32页
  • 作者单位

    Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China;

    Singapore Management Univ, Sch Informat Syst, Singapore, Singapore;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China;

    Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China;

    Singapore Management Univ, Sch Informat Syst, Singapore, Singapore;

    Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Fork; Open source software; GitHub;

    机译:Fork;开源软件;GitHub;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号