首页> 美国卫生研究院文献>GigaScience >Datastorr: a workflow and package for delivering successive versions of evolving data directly into R

【2h】

Datastorr: a workflow and package for delivering successive versions of evolving data directly into R

机译：Datastorr：一个工作流和软件包用于将连续的不断发展的数据版本直接传递到R中

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The sharing and re-use of data has become a cornerstone of modern science. Multiple platforms now allow easy publication of datasets. So far, however, platforms for data sharing offer limited functions for distributing and interacting with evolving datasets— those that continue to grow with time as more records are added, errors fixed, and new data structures are created. In this article, we describe a workflow for maintaining and distributing successive versions of an evolving dataset, allowing users to retrieve and load different versions directly into the R platform. Our workflow utilizes tools and platforms used for development and distribution of successive versions of an open source software program, including version control, GitHub, and semantic versioning, and applies these to the analogous process of developing successive versions of an open source dataset. Moreover, we argue that this model allows for individual research groups to achieve a dynamic and versioned model of data delivery at no cost.

机译：数据的共享和重用已经成为现代科学的基石。多个平台现在允许轻松发布数据集。但是，到目前为止，用于数据共享的平台提供了有限的功能来分配和与不断发展的数据集进行交互-随着添加更多记录，修复错误和创建新数据结构，随着时间的推移，这些功能会继续增长。在本文中，我们描述了用于维护和分发不断发展的数据集的连续版本的工作流，允许用户检索不同版本并将其直接加载到R平台中。我们的工作流程利用了用于开发和分发开源软件程序的后续版本（包括版本控制，GitHub和语义版本控制）的工具和平台，并将这些工具和平台应用于开发开源数据集的后续版本的类似过程。此外，我们认为该模型允许各个研究小组免费获得动态的版本化数据交付模型。

著录项

期刊名称 GigaScience
作者
Daniel S Falster; Richard G FitzJohn; Matthew W Pennell; William K Cornwell;
展开▼
作者单位

展开▼
年(卷),期 2019(8),5
年度 2019
页码 giz035
总页数 8
原文格式 PDF
正文语种
中图分类
关键词
data sharing version control semantic versioning;

机译：数据共享;版本控制;语义版本控制;

相似文献

外文文献
中文文献
专利

1. Datastorr: a workflow and package for delivering successive versions of 'evolving data' directly into R [J] . Falster Daniel S, FitzJohn Richard G, Pennell Matthew W, GigaScience . 2019,第5期

机译：Datastorr：一个工作流程和软件包，用于将“不断发展的数据”的后续版本直接传递到R中
2. Light- and medium-duty focus: PACKAGE FLEETS - Productive packages: Evolving UPS, FedEx package fleets deliver productivity gains, fuel economy [J] . JOHN G. SMITH Commercial Carrier Journal . 2012,第9期

机译：专注于中型和轻型：包装机-生产性包装：不断发展的UPS，FedEx包装机队可提高生产率，节省燃油
3. Caldera Flow+ 2.0: Revamped version of the wide-form at workflow package [J] . Simon Eccles Printweek . 2015,第Nova9期

机译：Caldera Flow + 2.0：工作流程包中的宽版面的修订版
4. Datatrack: An R package for managing data in a multi-stage experimental workflow data versioning and provenance considerations in interactive scripting [C] . Philip Eichinski, Paul Roe IEEE International Conference on e-Science . 2016

机译：Datatrack：R包，用于管理多阶段实验工作流中的数据，交互式脚本中的数据版本控制和出处注意事项
5. A Locality-Aware Scientific Workflow Engine for Fast-Evolving Spatiotemporal Sensor Data. [D] . Kachikaran Arulswamy, Johnson Charles. 2017

机译：本地感知科学工作流引擎，用于快速发展的时空传感器数据。
6. Impact framework: A python package for writing data analysis workflows to interpret microbial physiology [O] . Naveen Venayak, Kaushik Raj, Radhakrishnan Mahadevan 2019

机译：Impact框架：用于编写数据分析工作流程以解释微生物生理的python包
7. Datastorr: a workflow and package for delivering successive versions of 'evolving data' directly into R [O] . Daniel S Falster, Richard G FitzJohn, Matthew W Pennell, 2019

机译：DataStorR：用于将连续版本的“不断发展的数据”直接传送到R的工作流程和包

Datastorr: a workflow and package for delivering successive versions of evolving data directly into R

摘要

著录项

相似文献

相关主题

期刊订阅