首页> 外文会议>International Conference on Information and Communication Technology >Big data analytics on large-scale socio-technical software engineering archives
【24h】

Big data analytics on large-scale socio-technical software engineering archives

机译:大型社会技术软件工程档案中的大数据分析

获取原文

摘要

Given the fast growing nature of software engineering data in online software repositories and open source communities, it would be helpful to analyse these assets to discover valuable information about the software engineering development process and other related data. Big Data Analytics (BDA) techniques and frameworks can be applied on these data resources to achieve a high-performance and relevant data collection and analysis. Software engineering is a socio-technical process which needs development team collaboration and technical knowledge to develop a high-quality application. GitHub, as an online social coding foundation, contains valuable information about the software engineers' communications and project life cycles. In this paper, unsupervised data mining techniques are applied on the data collected by general Big Data approaches to analyse GitHub projects, source codes and interactions. Source codes and projects are clustered using features and metrics derived from historical data in repositories, object oriented programming metrics and the influences of developers on source codes.
机译:鉴于在线软件存储库和开放源代码社区中软件工程数据的快速增长性质,分析这些资产以发现有关软件工程开发过程和其他相关数据的有价值的信息将是有帮助的。大数据分析(BDA)技术和框架可以应用于这些数据资源,以实现高性能和相关的数据收集和分析。软件工程是一个社会技术过程,需要开发团队的协作和技术知识才能开发出高质量的应用程序。 GitHub作为在线社交编码基础,包含有关软件工程师的交流和项目生命周期的宝贵信息。在本文中,将无监督数据挖掘技术应用于通过常规大数据方法收集的数据,以分析GitHub项目,源代码和交互。源代码和项目使用从存储库中的历史数据得出的功能和指标,面向对象的编程指标以及开发人员对源代码的影响进行聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号