【24h】

Partial Answers in Information Integration Systems

机译:信息集成系统中的部分答案

获取原文
获取原文并翻译 | 示例

摘要

In the context of local-as-view information integration system a source collection defines a set of possible databases, therefore, querying a source collection, at least conceptually, means applying the query to each possible database, obtaining a set of possible answers. With this in mind we introduced the notion of the exact answer, which can be represented as a relation containing null values, and we gave two methods for computing the exact answer. The first method involves explicitly computing T(S) ― a syntactic representation of the set of global databases implicitly defined by the sources. The φ evaluation is then used to compute the exact answer. The other method reformulates a query in terms of the source relations, and, thus, avoids inverting the entire source collection. In order to achieve this we generalized classical notion of query containment to p-containment and used this notion to reformulate the query as a union of p-contained conjunctive queries. Then we defined the φ-evaluation which uses p-contained rewritings to compute the exact answer on the source collection. We have developed an experimental system, which enables us to evaluate the performance of the proposed methods to compute the exact answer in practice. As the result of our experiments we have established that computation of the exact answer can be done efficiently for all practical situations including the large-scale systems. Moreover, by comparing our results to the results presented in [9], we have confirmed that shifting from the computation of the certain answer to the computation of the exact answer does not change the average running time of query rewriting algorithms.
机译:在局部视点信息集成系统的上下文中,源集合定义了一组可能的数据库,因此,至少在概念上查询源集合意味着将查询应用于每个可能的数据库,以获得一组可能的答案。考虑到这一点,我们介绍了精确答案的概念,可以将其表示为包含空值的关系,并给出了两种计算精确答案的方法。第一种方法涉及显式计算T(S)-源隐式定义的全局数据库集的语法表示。然后使用φ评估来计算确切答案。另一种方法根据源关系重新构造了查询,因此避免了反转整个源集合。为了实现此目的,我们将查询包含的经典概念推广到p包含,并使用此概念将查询重新表述为p包含的联合查询的并集。然后,我们定义了φ评估,该评估使用p包含的重写来计算源集合上的确切答案。我们已经开发了一个实验系统,该系统使我们能够评估所提出方法的性能,以在实践中计算出准确答案。作为我们实验的结果,我们已经确定,可以针对包括大型系统在内的所有实际情况有效地计算出准确答案。此外,通过将我们的结果与[9]中给出的结果进行比较,我们已经确认,从某个答案的计算向准确答案的计算转移不会改变查询重写算法的平均运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号