首页> 外文会议>IEEE international conference on data engineering >Profiling and mining RDF data with ProLOD++
【24h】

Profiling and mining RDF data with ProLOD++

机译:使用ProLOD ++分析和挖掘RDF数据

获取原文

摘要

Before reaping the benefits of open data to add value to an organizations internal data, such new, external datasets must be analyzed and understood already at the basic level of data types, constraints, value patterns etc. Such data profiling, already difficult for large relational data sources, is even more challenging for RDF datasets, the preferred data model for linked open data. We present ProLod++, a novel tool for various profiling and mining tasks to understand and ultimately improve open RDF data. ProLod++ comprises various traditional data profiling tasks, adapted to the RDF data model. In addition, it features many specific profiling results for open data, such as schema discovery for user-generated attributes, association rule discovery to uncover synonymous predicates, and uniqueness discovery along ontology hierarchies. ProLod++ is highly efficient, allowing interactive profiling for users interested in exploring the properties and structure of yet unknown datasets.
机译:在利用开放数据为组织内部数据增加价值之前,必须在数据类型,约束,价值模式等基本层面上分析和理解此类新的外部数据集。数据源,对于RDF数据集(链接的开放数据的首选数据模型)更具挑战性。我们介绍了ProLod ++,这是一种用于各种分析和挖掘任务的新颖工具,可以理解并最终改善开放的RDF数据。 ProLod ++包含各种适合RDF数据模型的传统数据分析任务。此外,它还具有许多针对开放数据的特定分析结果,例如用于用户生成的属性的架构发现,用于发现同义谓词的关联规则发现以及沿着本体层次结构的唯一性发现。 ProLod ++效率很高,允许对有兴趣探索未知数据集的属性和结构的用户进行交互式配置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号