首页> 外文期刊>PLoS Computational Biology >Ten Simple Rules for Digital Data Storage
【24h】

Ten Simple Rules for Digital Data Storage

机译:数字数据存储的十个简单规则

获取原文
           

摘要

Data is the central currency of science, but the nature of scientific data has changed dramatically with the rapid pace of technology. This change has led to the development of a wide variety of data formats, dataset sizes, data complexity, data use cases, and data sharing practices.Improvements in high-throughput DNA sequencing, sustained institutional support for largesensor networks [1,2], and sky surveys with large-format digital cameras [3] have created massive quantities of data. At the same time, the combination of increasingly diverse researchteams [4] and data aggregation in portals (e.g., for biodiversity data, GBIF.org or iDigBio)necessitates increased coordination among data collectors and institutions [5,6]. As a consequence, “data” can now mean anything from petabytes of information stored in professionallymaintained databases, to spreadsheets on a single computer, to handwritten tables in lab notebooks on shelves. All remain important, but data curation practices must continue to keeppace with the changes brought about by new forms of data and new data collection and storagepractices.
机译:数据是科学的中心货币,但是科学数据的性质随着技术的飞速发展而发生了巨大变化。这一变化导致了各种各样的数据格式,数据集大小,数据复杂性,数据用例和数据共享实践的发展。高通量DNA测序的改进,对大型传感器网络的持续机构支持[1,2],大型数码相机进行的空中勘测[3]产生了大量数据。同时,越来越多的研究团队[4]和门户网站中的数据汇总(例如,对于生物多样性数据,GBIF.org或iDigBio)的结合,需要加强数据收集者和机构之间的协调[5,6]。因此,“数据”现在意味着从专业维护的数据库中存储的PB级信息到单台计算机上的电子表格,再到架子上实验室笔记本中的手写表,无所不包。所有这些仍然很重要,但是数据管理实践必须继续跟上新数据形式以及新数据收集和存储实践带来的变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号