首页> 外文会议>European semantic web conference on satellite events >Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets
【24h】

Merging and Enriching DCAT Feeds to Improve Discoverability of Datasets

机译:合并和丰富DCAT提要以提高数据集的可发现性

获取原文

摘要

Data Catalog Vocabulary (DCAT) is a W3C specification to describe datasets published on the Web. However, these catalogs are not easily discoverable based on a user's needs. In this paper, we introduce the Node.js module 'dcat-merger' which allows a user agent to download and semantically merge different DCAT feeds from the Web into one DCAT feed, which can be republished. Merging the input feeds is followed by enriching them. Besides determining the subjects of the datasets, using DBpedia Spotlight, two extensions were built: one categorizes the datasets according to a taxonomy, and the other adds spatial properties to the datasets. These extensions require the use of information available in DBpedia's SPARQL endpoint. However, public SPARQL endpoints often suffer from low availability, its Triple Pattern Fragments alternative is used. However, the need for DCAT Merger sparks the discussion for more high level functionality to improve a catalog's discoverability.
机译:数据目录词汇表(DCAT)是W3C规范,用于描述在Web上发布的数据集。但是,根据用户的需求不容易发现这些目录。在本文中,我们介绍了Node.js模块“ dcat-merger”,该模块允许用户代理从Web下载不同的DCAT提要并将其语义合并到一个DCAT提要中,然后可以将其重新发布。合并输入提要之后,再进行丰富。除了确定数据集的主题外,还使用DBpedia Spotlight构建了两个扩展:一个扩展是根据分类法对数据集进行分类,另一个扩展功能是向数据集添加空间属性。这些扩展要求使用DBpedia的SPARQL端点中可用的信息。但是,公共SPARQL端点通常遭受低可用性的困扰,因此使用了“三重模式片段”替代方案。但是,对DCAT合并的需求引发了关于更多高级功能以提高目录的可发现性的讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号