首页> 外文期刊>Heritage >LODsyndesis: Global Scale Knowledge Services
【24h】

LODsyndesis: Global Scale Knowledge Services

机译:LODsyndesis:全球规模的知识服务

获取原文
       

摘要

In this paper, we present LODsyndesis, a suite of services over the datasets of the entire Linked Open Data Cloud, which offers fast, content-based dataset discovery and object co-reference. Emphasis is given on supporting scalable cross-dataset reasoning for finding all information about any entity and its provenance. Other tasks that can be benefited from these services are those related to the quality and veracity of data since the collection of all information about an entity, and the cross-dataset inference that is feasible, allows spotting the contradictions that exist, and also provides information for data cleaning or for estimating and suggesting which data are probably correct or more accurate. In addition, we will show how these services can assist the enrichment of existing datasets with more features for obtaining better predictions in machine learning tasks. Finally, we report measurements that reveal the sparsity of the current datasets, as regards their connectivity, which in turn justifies the need for advancing the current methods for data integration. Measurements focusing on the cultural domain are also included, specifically measurements over datasets using CIDOC CRM (Conceptual Reference Model), and connectivity measurements of British Museum data. The services of LODsyndesis are based on special indexes and algorithms and allow the indexing of 2 billion triples in around 80 min using a cluster of 96 computers.
机译:在本文中,我们介绍了LODsyndesis,这是整个链接开放数据云的数据集上的一组服务,可提供基于内容的快速数据集发现和对象共引用。重点是支持可伸缩的跨数据集推理,以查找有关任何实体及其来源的所有信息。可以从这些服务中受益的其他任务是与数据的质量和准确性有关的任务,因为收集有关实体的所有信息,以及可行的跨数据集推断,可以发现存在的矛盾并提供信息用于数据清理或估计和建议哪些数据可能是正确的或更准确的。此外,我们将展示这些服务如何通过更多功能帮助丰富现有数据集,从而在机器学习任务中获得更好的预测。最后,我们报告了一些度量标准,这些度量标准揭示了当前数据集的稀疏性以及它们之间的连通性,这反过来又证明了推进当前数据集成方法的必要性。还包括针对文化领域的度量,特别是使用CIDOC CRM(概念参考模型)对数据集的度量,以及大英博物馆数据的连通性度量。 LODsyndesis的服务基于特殊的索引和算法,并允许使用96台计算机的群集在大约80分钟内为20亿个三元组建立索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号