【24h】

Object Level Physics Data Replication in the Grid

机译:对象级别物理数据复制网格中的数据复制

获取原文

摘要

To support distributed physics analysis on a scale as foreseen by the LHC experiments, 'Grid' systems are needed that manage and streamline data distribution, replication, and synchronization. We report on the development of a tool that allows large physics datasets to be managed and replicated at the granularity level of single objects. Efficient and convenient support for data extraction and replication at the level of individual objects and events will enable for types of interactive data analysis that would be too inconvenient or costly to perform with tools that work on a file level only. Our tool development effort is intended as both a demonstrator project for various types of existing Grid technology, and as a research effort to develop Grid technology further. The basic use case supported by our tool is one in which a physicist repeatedly selects some physics objects located at a central repository, and replicates them to a local site. The selection can be done using 'tag' or 'ntuple' analysis at the local site. The tool replicates the selected objects, and merges all replicated objects into a single single coherent 'virtual' dataset. This allows all objects to be used together seamlessly, even if they were replicated at different times or from different locations. The version of the tool that is reported on in this paper replicates ORCA based physics data created by CMS in its ongoing high level trigger design studies. The basic capabilities and limitations of the tool are discussed, together with some performance results. Some tool internals are also presented. Finally we will report on experiences so far and on future plans.
机译:为了支持LHC实验预见的分布式物理分析,需要管理和简化数据分发,复制和同步的“网格”系统。我们报告了一个允许在单个对象的粒度级别管理和复制大量物理数据集的工具的开发。在各个对象和事件级别的数据提取和复制的高效和方便的支持将为交互式数据分析的类型提供,这对于仅在文件级别工作的工具来执行太不方便或昂贵。我们的工具开发工作旨在作为各种类型的现有网格技术的演示项目,并作为进一步开发网格技术的研究努力。我们工具支持的基本用例是物理学家重复选择位于中央存储库的某些物理对象,并将其复制到本地站点。可以使用本地站点的“标签”或“NTUPLE”分析来完成选择。该工具复制所选对象,并将所有复制的对象合并到单个连贯的“虚拟”数据集中。这允许无缝地一起使用的所有对象,即使它们在不同时间或来自不同位置复制。本文报告的工具的版本在其持续的高级触发设计研究中重复了CMS创建的基于ORCA的物理数据。讨论工具的基本功能和限制以及一些性能结果。还提供了一些工具内部。最后,我们将报告到目前为止和未来计划的经验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号