首页> 外文OA文献 >Analysis and Modeling of Social In uence in High Performance Computing Workloads
【2h】

Analysis and Modeling of Social In uence in High Performance Computing Workloads

机译:高性能计算工作量中的社交影响力分析和建模

摘要

High Performance Computing (HPC) is becoming a common tool in many researchareas. Social influence (e.g., project collaboration) among increasing users of HPCsystems creates bursty behavior in underlying workloads. This bursty behavior isincreasingly common with the advent of grid computing and cloud computing. Miningthe user bursty behavior is important for HPC workloads prediction and scheduling,which has direct impact on overall HPC computing performance.A representative work in this area is the Mixed User Group Model (MUGM),which clusters users according to the resource demand features of their submissions,such as duration time and parallelism. However, MUGM has some difficulties whenimplemented in real-world system. First, representing user behaviors by the featuresof their resource demand is usually difficult. Second, these features are not alwaysavailable. Third, measuring the similarities among users is not a well-defined problem.In this work, we propose a Social Influence Model (SIM) to identify, analyze,and quantify the level of social influence across HPC users. The advantage of theSIM model is that it finds HPC communities by analyzing user job submission time, thereby avoiding the difficulties of MUGM. An offline algorithm and a fast-converging,computationally-efficient online learning algorithm for identifying social groups areproposed. Both offline and online algorithms are applied on several HPC and gridworkloads, including Grid 5000, EGEE 2005 and 2007, and KAUST SupercomputingLab (KSL) BGP data. From the experimental results, we show the existence of a socialgraph, which is characterized by a pattern of dominant users and followers. In orderto evaluate the effectiveness of identified user groups, we show the pattern discoveredby the offline algorithm follows a power-law distribution, which is consistent withthose observed in mainstream social networks. We finally conclude the thesis anddiscuss future directions of our work.
机译:高性能计算(HPC)成为许多研究领域中的通用工具。越来越多的HPC系统用户之间的社会影响力(例如项目协作)会在基础工作负载中产生突发性行为。随着网格计算和云计算的出现,这种突发行为越来越普遍。挖掘用户突发行为对于HPC工作负载的预测和调度很重要,这直接影响整个HPC计算性能。该领域的代表工作是混合用户组模型(MUGM),它根据用户的资源需求特征对其进行聚类提交,例如持续时间和并行性。但是,MUGM在现实系统中实施时存在一些困难。首先,通常很难用用户的资源需求特征来表现他们的行为。其次,这些功能并非始终可用。第三,衡量用户之间的相似性并不是一个明确的问题。在这项工作中,我们提出了一种社会影响力模型(SIM),以识别,分析和量化整个HPC用户的社会影响力水平。 SIM模型的优点是它可以通过分析用户作业提交时间来找到HPC社区,从而避免了MUGM的麻烦。提出了一种用于识别社会群体的离线算法和一种快速收敛,计算效率高的在线学习算法。离线算法和在线算法都适用于多种HPC和网格工作负载,包括Grid 5000,EGEE 2005和2007以及KAUST SupercomputingLab(KSL)BGP数据。从实验结果中,我们显示了社交图的存在,该社交图的特征在于主导用户和关注者的模式。为了评估确定的用户组的有效性,我们展示了离线算法发现的模式遵循幂律分布,这与主流社交网络中观察到的模式一致。最后我们总结了论文并讨论了我们的工作未来的方向。

著录项

  • 作者

    Zheng Shuai;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号