首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Automatic classification of scientific groups as productive: An approach based on motif analysis
【24h】

Automatic classification of scientific groups as productive: An approach based on motif analysis

机译:将科学群体自动分类为生产性:基于主题分析的方法

获取原文

摘要

One of the key aspects instrumental in the advancement of science relates to “team science,” or in other words “group” collaborations. There have been extensive studies analyzing various statistical properties of collaborations of individual or pairs of authors. However, the number of studies pertaining to groups/teams of scientists working together is limited in number. In this paper, we set an objective to study the productivity of group collaborations where groups are represented as small substructures usually termed as network motifs in the literature. A preliminary observation is that star-like motifs have the largest productivity (defined as a function of citation count) followed by 4-cliques. We then introduce a bunch of features and study their individual relations with the productivity of a team. Building on these observations, we develop a supervised classification model that can automatically distinguish the highly productive teams from the low productive ones based on the set of identified features. The accuracy of the classification is 82% on an average for all the motifs with the accuracy reaching as high as 95% for 4-cliques. Finally, we present a detailed analysis of the time-transition behavior of different motifs along with some of the real world highly productive motifs found in our dataset. This empirical study is a first step toward the development of a full-fledged recommendation system that can predict how productive a team would be in the future.
机译:促进科学进步的关键方面之一与“团队科学”有关,换句话说就是“小组”合作。已经进行了广泛的研究,分析了个人或成对作者合作的各种统计属性。但是,与一起工作的科学家小组/团队有关的研究数量是有限的。在本文中,我们设定了一个目标,以研究群体协作的生产力,其中群体以小的子结构表示,在文献中通常被称为网络主题。初步观察发现,星形图案的生产力最高(定义为引用次数的函数),其次是4位数。然后,我们介绍一些功能,并研究它们与团队生产力之间的个人关系。基于这些观察,我们开发了一种监督分类模型,该模型可以基于已识别的特征集自动将高生产率的团队与低生产率的团队区分开。所有主题的分类准确度平均为82%,而4个字型的准确度高达95%。最后,我们提供了对不同图案的时变行为的详细分析,以及在我们的数据集中发现的一些现实世界中高产的图案。这项经验研究是开发成熟的推荐系统的第一步,该系统可以预测团队未来的生产力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号