首页> 外文期刊>Information >Blind Queries Applied to JSON Document Stores
【24h】

Blind Queries Applied to JSON Document Stores

机译:盲查询应用于JSON文档存储

获取原文
           

摘要

Social Media, Web Portals and, in general, information systems offer their own Application Programming Interfaces (APIs), used to provide large data sets concerning every aspect of day-by-day life. APIs usually provide data sets as collections of JSON documents. The heterogeneous structure of JSON documents returned by different APIs constitutes a barrier to effectively query and analyze these data sets. The adoption of NoSQL document stores, such as MongoDB , is useful for gathering these data sets, but does not solve the problem of querying the final heterogeneous repository. The aim of this paper is to provide analysts with a tool, named HammerJDB , that allows for blind querying collections of JSON documents within a NoSQL document database. The idea below is that users may know the application domain but it may be that they are not aware of the real structures of the documents stored in the database—the tool for blind querying tries to bridge the gap, by adopting a query rewriting mechanism. This paper is an evolution of a technique for blind querying Open Data portals and of its implementation within the Hammer framework, presented in some previous work. In this paper, we evolve that approach in order to query a NoSQL document database by evolving the Hammer framework into the HammerJDB framework, which is able to work on MongoDB databases. The effectiveness of the new approach is evaluated on a data set (derived from a real-life one), containing job-vacancy ads collected from European job portals.
机译:社交媒体,Web门户和一般的信息系统提供自己的应用程序编程接口(API),用于提供有关日常生活各个方面的大数据集。 API通常提供数据集作为JSON文档的集合。不同API返回的JSON文档的异构结构构成了有效查询和分析这些数据集的障碍。 NoSQL文档存储库(例如MongoDB)的采用对于收集这些数据集很有用,但不能解决查询最终异构存储库的问题。本文的目的是为分析人员提供一个名为HammerJDB的工具,该工具允许在NoSQL文档数据库中盲查询JSON文档的集合。下面的想法是,用户可能知道应用程序领域,但可能是他们不知道存储在数据库中的文档的真实结构-盲查询工具试图通过采用查询重写机制来弥合差距。本文是对盲查询开放数据门户网站技术的改进,并在先前的工作中介绍了在Hammer框架中的实现。在本文中,我们通过将Hammer框架演化为HammerJDB框架(可以在MongoDB数据库上运行)来发展该方法,以查询NoSQL文档数据库。新方法的有效性是在一个数据集(来自现实生活中)上进行评估的,该数据集包含从欧洲工作门户网站收集的职位空缺广告。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号