首页> 外文期刊>Journal of web semantics: >Characterising dataset search-An analysis of search logs and data requests
【24h】

Characterising dataset search-An analysis of search logs and data requests

机译:表征DataSet搜索 - 搜索日志和数据请求的分析

获取原文
获取原文并翻译 | 示例
           

摘要

Large amounts of data are becoming increasingly available online. In order to benefit from it we need tools to retrieve the most relevant datasets that match ones data needs. Several vocabularies have been developed to describe datasets in order to increase their discoverability, but for data publishers is costly to cumbersome to annotate them using all, leading to the question of what properties are more important. In this work we contribute with a systematic study of the patterns and specific attributes that data consumers use to search for data and how it compares with general web search. We performed a query log analysis based on logs from four national open data portals and conducted a qualitative analysis of user data requests for requests issued to one of them. Search queries issued on data portals differ from those issued to web search engines in their length, topic, and structure. Based on our findings we hypothesise that portals search functionalities are currently used in an exploratory manner, rather than to retrieve a specific resource. In our study of data requests we found that geospatial and temporal attributes, as well as information on the required granularity of the data are the most common features. The findings of both analyses suggest that these features are of higher importance in dataset retrieval in contrast to general web search, suggesting that efforts of dataset publishers should focus on generating dataset descriptions including them. (C) 2018 Elsevier B.V. All rights reserved.
机译:大量数据越来越越来越多地提供。为了受益于它,我们需要工具来检索与数据需求匹配的最相关的数据集。已经开发了几个词汇表来描述数据集,以提高他们的可发现性,但对于数据发布者来说,繁琐的是繁琐的人使用所有人向他们注释它们,导致了哪些属性更为重要的问题。在这项工作中,我们有助于对数据消费者用于搜索数据的模式和特定属性的系统研究以及它如何与常规网络搜索进行比较。我们根据来自四个国家开放数据门户网站的日志进行了查询日志分析,并对用户数据请求进行了定性分析,以获得给其中一个的请求。在数据门户上发出的搜索查询与其长度,主题和结构发布到Web搜索引擎的查询。基于我们的发现,我们假设门户网站搜索功能目前以探索方式使用,而不是检索特定资源。在我们对数据请求的研究中,我们发现地理空间和时间属性,以及有关数据所需粒度的信息是最常见的功能。两种分析的发现表明,与一般网络搜索相比,这些特征在数据集检索中具有更高的重要性,这表明数据集发布者的努力应专注于生成包括它们的数据集描述。 (c)2018年elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号