...
首页> 外文期刊>International Journal of Spatial Data Infrastructures Research >Analysis of quality metadata in the GEOSS Clearinghouse
【24h】

Analysis of quality metadata in the GEOSS Clearinghouse

机译:GEOSS信息交换所中的质量元数据分析

获取原文
           

摘要

The Global Earth Observation System of Systems (GEOSS) Clearinghouse is part of the GEOSS Common Infrastructure (GCI) that supports the discovery of the data made available by the Group on Earth Observations (GEO) members and participant organizations in GEOSS. It also acts as a unified metadata catalogue that stores complete metadata records, not only about datasets but also for other kinds of components and services. By exploring these records, users often try to find the fit-for-use data. Quality indicators and provenance are included in the metadata and are potentially useful variables that allow users to make an informed decision avoiding to download and to assess the data themselves. However, no previous studies have been made on the completeness and correctness of the metadata records in the Clearinghouse. The objective of this paper is to analyze the data quality information distributed by the GEOSS Clearinghouse. The aim is to quantify its completeness and to provide clues on how the current status of the Clearinghouse could be improved and how useful quality aware tools could be. The methodology used in the current analysis consists in first harvesting of the Clearinghouse and then quantify the quality information found in 97203 metadata records, by using a semi-automatic approach. The results reveal that the inclusion of quality information on metadata records is not rare: 19.66% of the metadata records contain some quality element. However, this is not general enough and several aspects could be improved. For instance, 77.78% of quantitative measures lack measure units. When quality indicators are not sufficient, the lineage metadata information could be used to mitigate this situation by analysing the process steps and sources used to create a dataset. However, even though lineage is reported in 15.55% of the records, only 1.27% of the cases return a complete list of process steps with sources. This paper also provides indications on what is lacking in the current producer metadata model and, detected a gap in usage or user feedback metadata in GEOSS. Moreover, information extracted from GeoViQua interviews with users indicates that they value informal comments and user feedback on datasets as a complement of the more formal producer-oriented metadata description of the data. Although, many efforts within the scientific community and the Quality Assurance Framework for Earth Observation (QA4EO) group have been invested in describing how to parameterize data quality and uncertainty, we conclude that still extra work can be done to provide complete quality information in the metadata catalogues. In brief, since the GEOSS Clearinghouse references data from the most important agencies and research organizations, the results presented in this paper provide a perspective on how well quality is disseminated in the Earth observation community in general.
机译:全球地球观测系统系统(GEOSS)信息交换所是GEOSS通用基础设施(GCI)的一部分,该基础设施支持发现地球观测组(GEO)成员和GEOSS参与组织提供的数据。它还充当统一的元数据目录,该目录不仅存储有关数据集的完整元数据记录,还存储其他种类的组件和服务的完整元数据记录。通过浏览这些记录,用户经常尝试查找适合使用的数据。质量指标和出处包括在元数据中,它们是潜在有用的变量,可让用户做出明智的决定,而无需自己下载和评估数据。但是,以前尚未对信息交换所中元数据记录的完整性和正确性进行过研究。本文的目的是分析GEOSS信息交换所分发的数据质量信息。目的是量化其完整性,并提供有关如何改善信息交换所当前状态以及如何使用质量意识工具的线索。当前分析中使用的方法包括:首先收集信息交换所,然后使用半自动方法量化在97203元数据记录中找到的质量信息。结果表明,在元数据记录中包含质量信息的情况并不罕见:19.66%的元数据记录中包含一些质量元素。但是,这还不够普遍,可以改进几个方面。例如,定量指标中有77.78%缺乏指标单位。当质量指标不足时,沿袭元数据信息可用于通过分析用于创建数据集的过程步骤和来源来缓解这种情况。但是,即使在记录的15.55%中报告了沿袭,也只有1.27%的案例返回了带有来源的处理步骤的完整列表。本文还提供了有关当前生产者元数据模型中缺少哪些内容的指示,并发现了GEOSS中使用或用户反馈元数据方面的差距。此外,从与用户的GeoViQua访谈中提取的信息表明,他们重视对数据集的非正式评论和用户反馈,作为对数据的更为正式的面向生产者的元数据描述的补充。尽管在科学界和“地球观测质量保证框架”(QA4EO)小组内进行了许多努力,以描述如何参数化数据质量和不确定性,但我们得出结论,仍可以做更多的工作来在元数据中提供完整的质量信息目录。简而言之,由于GEOSS信息交换所引用了最重要的机构和研究组织的数据,因此本文提供的结果为人们对如何在地球观测社区中普遍传播质量进行了展望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号