首页> 美国卫生研究院文献>Springer Open Choice >DCMS: A data analytics and management system for molecular simulation
【2h】

DCMS: A data analytics and management system for molecular simulation

机译:DCMS:用于分子模拟的数据分析和管理系统

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Molecular Simulation (MS) is a powerful tool for studying physical/chemical features of large systems and has seen applications in many scientific and engineering domains. During the simulation process, the experiments generate a very large number of atoms and intend to observe their spatial and temporal relationships for scientific analysis. The sheer data volumes and their intensive interactions impose significant challenges for data accessing, managing, and analysis. To date, existing MS software systems fall short on storage and handling of MS data, mainly because of the missing of a platform to support applications that involve intensive data access and analytical process. In this paper, we present the database-centric molecular simulation (DCMS) system our team developed in the past few years. The main idea behind DCMS is to store MS data in a relational database management system (DBMS) to take advantage of the declarative query interface (i.e., SQL), data access methods, query processing, and optimization mechanisms of modern DBMSs. A unique challenge is to handle the analytical queries that are often compute-intensive. For that, we developed novel indexing and query processing strategies (including algorithms running on modern co-processors) as integrated components of the DBMS. As a result, researchers can upload and analyze their data using efficient functions implemented inside the DBMS. Index structures are generated to store analysis results that may be interesting to other users, so that the results are readily available without duplicating the analysis. We have developed a prototype of DCMS based on the PostgreSQL system and experiments using real MS data and workload show that DCMS significantly outperforms existing MS software systems. We also used it as a platform to test other data management issues such as security and compression.
机译:分子模拟(MS)是研究大型系统的物理/化学特征的强大工具,并且已在许多科学和工程领域中得到了应用。在模拟过程中,实验会生成大量原子,并打算观察它们的时空关系以进行科学分析。庞大的数据量及其密集的交互作用对数据访问,管理和分析提出了重大挑战。迄今为止,现有的MS软件系统在MS数据的存储和处理方面还很欠缺,这主要是因为缺少一个平台来支持涉及大量数据访问和分析过程的应用程序。在本文中,我们介绍了我们团队在过去几年中开发的以数据库为中心的分子模拟(DCMS)系统。 DCMS的主要思想是将MS数据存储在关系数据库管理系统(DBMS)中,以利用声明性查询接口(即SQL),数据访问方法,查询处理和现代DBMS的优化机制。一个独特的挑战是处理通常需要大量计算的分析查询。为此,我们开发了新颖的索引和查询处理策略(包括在现代协处理器上运行的算法)作为DBMS的集成组件。结果,研究人员可以使用DBMS内部实现的高效功能上传和分析数据。生成索引结构来存储其他用户可能感兴趣的分析结果,以便可以在不重复分析的情况下轻松获得结果。我们已经开发了基于PostgreSQL系统的DCMS原型,并且使用实际MS数据和工作量进行的实验表明DCMS明显优于现有的MS软件系统。我们还将它用作测试其他数据管理问题(例如安全性和压缩)的平台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号