【24h】

An Investigation of HTTP Header Information for Detecting Changes of Linked Open Data Sources

机译:用于检测链接的开放数据源更改的HTTP标头信息的调查

获取原文

摘要

Data on the Linked Open Data (LOD) cloud changes frequently. Applications that operate on local caches of Linked Data need to be aware of these changes. In this way they can update their cache to ensure operating on the most recent version of the data. Given the HTTP basis recommended in the Linked Data guidelines, the native way of detecting changes would be to use HTTP header information, such as the Last-Modified field. However, it is uncertain to which degree this field is currently supported on the LOD cloud and how reliable the provided information is. In this paper, we analyse a large-scale dataset obtained from the LOD cloud by weekly crawls over almost two years. On these weekly snapshots, we observed that for only 15 % of the Linked Data resources the HTTP header field Last-Modified is actually available and that the date provided for the last modification aligns in only 8 % with the observed changes of the data itself.
机译:链接开放数据(LOD)云中的数据经常更改。在链接数据的本地缓存上运行的应用程序需要了解这些更改。这样,他们可以更新其缓存以确保对最新版本的数据进行操作。给定链接数据准则中推荐的HTTP基础,检测更改的本机方法将是使用HTTP标头信息,例如Last-Modified字段。但是,尚不确定LOD云当前在何种程度上支持此字段以及所提供信息的可靠性。在本文中,我们通过近两年的每周爬网分析了从LOD云获得的大规模数据集。在这些每周快照中,我们观察到只有15%的链接数据资源中的HTTP标头字段Last-Modified实际上是可用的,并且为上次修改提供的日期仅与所观察到的数据本身的变化对齐了8%。

相似文献

  • 外文文献
  • 中文文献
  • 专利