首页> 外文会议>Knowledge engineering and management by the masses >Using Semantic Web Resources for Data Quality Management
【24h】

Using Semantic Web Resources for Data Quality Management

机译:使用语义Web资源进行数据质量管理

获取原文
获取原文并翻译 | 示例

摘要

The quality of data is a critical factor for all kinds of decision-making and transaction processing. While there has been a lot of research on data quality in the past two decades, the topic has not yet received sufficient attention from the Semantic Web community. In this paper, we discuss (1) the data quality issues related to the growing amount of data available on the Semantic Web, (2) how data quality problems can be handled within the Semantic Web technology framework, namely using SPARQL on RDF representations, and (3) how Semantic Web reference data, e.g. from DBPedia, can be used to spot incorrect literal values and functional dependency violations. We show how this approach can be used for data quality management of public Semantic Web data and data stored in relational databases in closed settings alike. As part of our work, we developed generic SPARQL queries to identify (1) missing datatype properties or literal values, (2) illegal values, and (3) functional dependency violations. We argue that using Semantic Web datasets reduces the effort for data quality management substantially. As a use-case, we employ Geonames, a publicly available Semantic Web resource for geographical data, as a trusted reference for managing the quality of other data sources.
机译:数据质量是各种决策和交易处理的关键因素。尽管在过去的二十年中对数据质量进行了大量研究,但该主题尚未得到语义Web社区的足够重视。在本文中,我们讨论(1)与语义Web上可用数据的增长有关的数据质量问题,(2)如何在语义Web技术框架内处理数据质量问题,即在RDF表示形式上使用SPARQL, (3)语义网如何引用数据,例如DBPedia的产品可用于发现不正确的文字值和功能依赖关系违规。我们将展示如何将此方法用于公共语义Web数据以及在封闭环境中存储在关系数据库中的数据的数据质量管理。作为我们工作的一部分,我们开发了通用SPARQL查询以识别(1)缺少的数据类型属性或文字值,(2)非法值和(3)功能依赖关系违规。我们认为使用语义Web数据集会大大减少数据质量管理的工作量。作为一个用例,我们采用了Geonames(地理数据的公开语义Web资源)作为管理其他数据源质量的可信赖参考。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号