首页> 外文期刊>Journal of web semantics: >Enhancing answer completeness of SPARQL queries via crowdsourcing
【24h】

Enhancing answer completeness of SPARQL queries via crowdsourcing

机译:通过众包提高SPARQL查询的答案完整性

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Linked Open Data initiatives have encouraged the publication of large RDF datasets into the Linking Open Data (LOD) cloud, including DBpedia, YAGO, and Geo-Names. Despite the size of LOD datasets and the development of (semi-) automatic methods to create and link LOD data, these datasets may be still incomplete, negatively affecting thus accuracy of Linked Data processing techniques. We acquire query answer completeness by capturing knowledge collected from the crowd, and propose a novel hybrid query processing engine that brings together machine and human computation to execute SPARQL queries. Our system, HARE, implements these hybrid query processing techniques. HARE encompasses several features: (1) a completeness model for RDF that exploits the characteristics of RDF in order to estimate the completeness of an RDF dataset; (2) a crowd knowledge base that captures crowd answers about missing values in the RDF dataset; (3) a query engine that combines on-the-fly crowd knowledge and estimates provided by the RDF completeness model, to decide upon the sub-queries of a SPARQL query that should be executed against the dataset or via crowd computing to enhance query answer completeness; and (4) a microtask manager that exploits the semantics encoded in the dataset RDF properties, to crowdsource SPARQL sub-queries as microtasks and update the crowd knowledge base with the results from the crowd. Effectiveness and efficiency of HARE are empirically studied on a collection of 50 SPARQL queries against the DBpedia dataset. Experimental results clearly show that our solution accurately enhances answer completeness. (C) 2017 Published by Elsevier B.V.
机译:链接开放数据计划鼓励将大型RDF数据集发布到链接开放数据(LOD)云中,包括DBpedia,YAGO和地理名称。尽管LOD数据集的大小以及建立和链接LOD数据的(半)自动方法的发展,这些数据集可能仍然不完整,从而对链接数据处理技术的准确性产生了负面影响。我们通过捕获从人群中收集的知识来获取查询答案的完整性,并提出一种新颖的混合查询处理引擎,该引擎将机器和人工计算结合在一起以执行SPARQL查询。我们的系统HARE实现了这些混合查询处理技术。 HARE包含以下几个特征:(1)RDF的完整性模型,该模型利用RDF的特征来估计RDF数据集的完整性。 (2)人群知识库,用于收集有关RDF数据集中缺失值的人群答案; (3)查询引擎,它结合了实时人群知识和RDF完整性模型提供的估计,以确定应针对数据集或通过人群计算执行的SPARQL查询的子查询,以增强查询答案完整性(4)一个微任务管理器,它利用在数据集RDF属性中编码的语义,将SPARQL子查询作为微任务众包,并使用众生的结果更新众生知识库。在针对DBpedia数据集的50个SPARQL查询集合上,对HARE的有效性和效率进行了经验研究。实验结果清楚地表明,我们的解决方案可以准确地提高答案的完整性。 (C)2017由Elsevier B.V.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号