【24h】

Using Probabilistic Information in Data Integration

机译:在数据集成中使用概率信息

获取原文
获取原文并翻译 | 示例

摘要

The goal of a mediator system is to provide users a uniform interface to the multitude of information sources. To translate user queries, given in a mediated schema, to queries on the data sources, mediators rely on explicit mappings between the contents of the data sources and the meanings of the relations in the mediated schema.rnThus far, contents of data sources were described qualitatively. In this paper we describe the use of quantitative information in the form of probabilistic knowledge in mediator systems. We consider several kinds of probabilistic information: information about overlap between collections in the mediated schema, coverage of the information sources, and degrees of overlap between information sources. We address the problem of ordering accesses to multiple information sources, in order to maximize the likelihood of obtaining answers as early as possible. We describe a declarative formalism for specifying these kinds of probabilistic information, and we propose algorithms for ordering the information sources. Finally, we discuss a preliminary experimental evaluation of these algorithms on the domain of bibliographic sources available on the WWW.
机译:中介系统的目标是为用户提供与众多信息源的统一界面。为了将在中介模式中给出的用户查询转换为对数据源的查询,介体依赖于数据源的内容与中介模式中关系的含义之间的显式映射。到目前为止,已描述了数据源的内容定性地。在本文中,我们以概率知识的形式描述了调解员系统中定量信息的使用。我们考虑几种概率信息:有关中介模式中集合之间重叠的信息,信息源的覆盖范围以及信息源之间的重叠程度。我们解决了对访问多个信息源进行排序的问题,以便最大程度地尽早获得答案。我们描述了一种声明形式形式,用于指定这些概率信息,并提出了用于对信息源进行排序的算法。最后,我们在WWW上提供的书目资源领域讨论了这些算法的初步实验评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号