首页> 外文会议>Annual petascale data storage workshop >Fusing Data Management Services with File Systems
【24h】

Fusing Data Management Services with File Systems

机译:使用文件系统融合数据管理服务

获取原文

摘要

File systems are the backbone of large-scale data processing for scientific applications. Motivated by the need to provide an extensible and flexible framework beyond the ions provided by API libraries for files to manage and analyze large-scale data, we are developing Damasc, an enhanced file system where rich data management services for scientific computing are provided as a native part of the file system. This paper presents our vision for Damasc, a performant file system that would allow scientists or even casual users to pose declarative queries and updates over views of underlying files that are stored in their native bytestream format. In Damasc, a configurable layer is added on top of the file system to expose the contents of files in a logical data model through which views can be defined and used for queries and updates. The logical data model and views are leveraged to optimize access to files through caching and self-organizing indexing. In addition, provenance capture and analysis to file access is also built into Damasc. We describe the salient features of our proposal and discuss how it can benefit the development of scientific code.
机译:文件系统是科学应用的大规模数据处理的骨干。有必要提供超出API库提供的API库提供的可扩展和灵活的框架,用于管理和分析大规模数据,我们正在开发Damasc,这是一个增强的文件系统,提供了用于科学计算的丰富数据管理服务文件系统的本机部分。本文介绍了我们对Damasc的愿景,这是一个表演文件系统,允许科学家们甚至临时用户造成声明性查询并更新底层文件的视图,这些文件存储在其本机ByteSteam格式中。在大凡士体中,在文件系统的顶部添加可配置层,以暴露逻辑数据模型中的文件内容,通过该应用程序可以通过该视图来定义和用于查询和更新。利用逻辑数据模型和视图,通过缓存和自组织索引来优化对文件的访问。此外,还构建了对文件访问的出处捕获和分析也被建立在达摩集中。我们描述了我们提案的显着特征,并讨论了如何将其有益于科学规范的发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号