【24h】

Benefit and Cost of Query Answering in PDMS

机译:PDMS中查询答复的收益和成本

获取原文
获取原文并翻译 | 示例

摘要

Peer data management systems (PDMS) are a natural extension to integrated information systems. They consist of a dynamic set of autonomous peers, each of which can mediate between heterogenous schemas of other peers. A new data source joins a PDMS by defining a semantic mapping to one or more other peers, thus forming a network of peers. Queries submitted to a peer are answered with data residing at that peer and by data that is reached along paths of mappings through the network of peers. However, without optimization methods query reformulation in PDMS is very inefficient due to redundancy in mapping paths. We present a decentral strategy that guides peers in their decision along which further mappings the query should be sent. The strategy uses statistics of the peers own data and statistics of mappings to neighboring peers to predict whether it is worthwhile to send the query to that neighbor— or whether the query plan should be pruned at this point. These decisions are guided by a benefit and cost model, trading off the amount of data a neighbor will pass back, and the execution cost of that step. Thus, we allow a high scale-up of PDMS in the number of participating peers.
机译:对等数据管理系统(PDMS)是集成信息系统的自然扩展。它们由一组动态的自治对等体组成,每个对等体可以在其他对等体的异构模式之间进行调解。一种新的数据源通过定义到一个或多个其他对等方的语义映射来加入PDMS,从而形成一个对等方网络。提交给对等方的查询将通过该对等方中的数据以及通过对等方网络沿映射路径到达的数据进行回答。但是,如果没有优化方法,由于映射路径中的冗余,PDMS中的查询重新编制效率非常低。我们提出了一种分散的策略,该策略指导同级做出决策,并根据该策略进一步发送查询。该策略使用对等方自身数据的统计信息以及到相邻对等方的映射统计信息来预测是否值得向该邻居发送查询,或者此时是否应修剪查询计划。这些决策受制于收益和成本模型,权衡邻居将传回的数据量以及该步骤的执行成本。因此,我们允许参与的同行人数大量增加PDMS。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号