首页> 外文会议>International Conference on Advances in Information Mining and Management >An Extensible Conceptual Model for Tabular Scientific Datasets
【24h】

An Extensible Conceptual Model for Tabular Scientific Datasets

机译:表格科学数据集的可扩展概念模型

获取原文

摘要

There is a proliferation of datasets generated by various scientists of different scientific disciplines. Therefore, there is a growing need to construct and develop platforms that enable scientists to capture, exchange, process, and interpret data for immediate use, as well as to store and manage data to support future reuse. Modeling and organizing data within such platforms are key challenges. To this end, in this paper, we introduce the dataset model of the BExIS 2 platform and how data can be organized inside the model. In particular, we describe the anatomy of a general purpose tabular dataset, which consists of data tuples to represent the table rows and data cells that are compound objects holding the obtained values and their auxiliary information. The structure of datasets is defined and applied separately in order to factor out shared concepts such as unit of measurement, methodology, data type, valid and missing values, processing functions and so on. The datasets are extensible in multiple ways and can be annotated on various levels utilizing taxonomies, ontologies, and custom metadata structures.
机译:有不同科学学科的各种科学家产生的数据集的扩散。因此,越来越需要构建和开发能够捕获科学家们捕获,交换,处理和解释数据的平台,以便立即使用,以及存储和管理数据以支持未来的重用。在此类平台内建模和组织数据是关键挑战。为此,在本文中,我们介绍了Bexis 2平台的数据集模型以及如何在模型内部组织数据。特别地,我们描述了通用表格数据集的解剖,该数据集包括数据元组,以表示具有保持所获得的值的复合对象的表行和数据单元。数据集的结构是单独定义和应用的,以便考虑共享概念,例如测量单元,方法,数据类型,有效且缺少值,处理功能等。数据集以多种方式可扩展,可以在使用分类,本体和自定义元数据结构的各种级别上注释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号