Subsumption and Complementation as Data Fusion Operators

机译：包含和补充作为数据融合运算符

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of data fusion is to combine several representations of one real world object into a single, consistent representation, e.g., in data integration. A very popular operator to perform data fusion is the minimum union operator. It is defined as the outer union and the subsequent removal of subsumed tuples. Minimum union is used in other applications as well, for instance in database query optimization to rewrite outer join queries, in the semantic web community in implementing Sparql's optional operator, etc. Despite its wide applicability, there are only few efficient implementations, and until now, minimum union is not a relational database primitive.rnThis paper fills this gap as we present implementations of sub-sumption that serve as a building block for minimum union. Furthermore, we consider this operator as database primitive and show how to perform optimization of query plans in presence of sub-sumption and minimum union through rule-based plan transformations. Experiments on both artificial and real world data show that our algorithms outperform existing algorithms used for subsumption in terms of runtime and they scale to large volumes of data.rnIn the context of data integration, we observe that performing data fusion calls for more than subsumption and minimum union. Therefore, another contribution of this paper is the definition of the complementation and complement union operators. Intuitively, these allow to merge tuples that have complementing values and thus eliminate unnecessary null-values.

机译：数据融合的目的是例如在数据集成中将一个现实世界对象的几种表示组合成单个一致的表示。最小联合运算符是一种非常流行的执行数据融合的运算符。它定义为外部联合和随后删除的已包含的元组。最小并集还用于其他应用程序中，例如在数据库查询优化中重写外部联接查询，在语义Web社区中实现Sparql的可选运算符等。尽管它具有广泛的适用性，但只有很少的有效实现，直到现在，最小工会不是关系数据库的原始语言。本文填补了这一空白，因为我们介绍了作为最小工会构建模块的子消费实现。此外，我们将此运算符视为数据库基元，并说明如何通过基于规则的计划转换在存在子包含量和最小并集的情况下执行查询计划的优化。在人工和现实数据上的实验表明，我们的算法在运行时方面优于现有的包含算法，并且可以扩展到大量数据。在数据集成的上下文中，我们发现执行数据融合不仅需要包含和最低工会。因此，本文的另一贡献是补码和补码联合算子的定义。直观地讲，这些允许合并具有互补值的元组，从而消除不必要的空值。

著录项

来源
《13th international conference on extending database technology 2010》|2010年|P.501-512|共12页
会议地点 Lausanne(CH);Lausanne(CH)
作者
Jens Bleiholder; Sascha Szott; Mejanie Herschel; Frank Kaufer; Felix Naumann;
展开▼
作者单位

Hasso-Plattner-Institut Potsdam, Germany;

rnKonrad-Zuse-Zentrum fuer Informationstechnik Berlin Berlin, Germany;

rnUniversitaet Tubingen Tuebingen, Germany;

rnHasso-Plattner-lnstitut Potsdam, Germany;

rnHasso-Plattner-lnstitut Potsdam, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
minimum union; complement union; data integration; data quality;

机译：最低工会补工会数据整合；数据质量;
入库时间 2022-08-26 13:47:09

相似文献

外文文献
中文文献
专利

1. A novel fusion approach based on induced ordered weighted averaging operators for chemometric data analysis [J] . Haikel AlHichri, Yakoub Bazi, Naif Alajlan Journal of Chemometrics . 2013,第12期

机译：基于诱导有序加权平均算子的化学数据分析融合新方法
2. On data fusion in information retrieval using different aggregation operators [J] . Julien Ah-Pine Web Intelligence and Agent Systems . 2011,第1期

机译：使用不同聚合运算符的信息检索中的数据融合
3. Information combination operators for data fusion: a comparative review with classification [J] . Bloch I. IEEE transactions on systems, man, and cybernetics. Part A . 1996,第1期

机译：用于数据融合的信息组合算子：分类比较研究
4. Generalized Complement Operators And Applications In Some Semirings [C] . G. Bijev International Conference "Applications of Mathematics in Engineering and Economics" . 2013

机译：在一些半杂项中的广义补充运营商和应用
5. Complemented subspaces of bounded linear operators. [D] . Bahreini Esfahani, Manijeh. 2003

机译：有界线性算子的互补子空间。
6. Time Series Data Fusion Based on Evidence Theory and OWA Operator [O] . Gang Liu, Fuyuan Xiao 2019

机译：基于证据理论和OWA算子的时间序列数据融合
7. Hybrid tracking of human operators using IMU/UWB data fusion by a Kalman filter [O] . Corrales Ramón, Juan Antonio, Candelas-Herías, Francisco A., Torres Medina, Fernando 2008

机译：通过卡尔曼滤波器使用IMU / UWB数据融合对操作员进行混合跟踪
8. Exploratory Study of the Interpretation of Logical Operators in Database Querying(Verkennende Studie van de Interpretatie van Logische Operatoren bijhet Bevragen van een Database) [R] . Essens, P. J., McCann, C. A., Hartevelt, M. A. 1991

机译：数据库查询中逻辑运算符解释的探索性研究（Verkennende studie van de Interpretatie van Logische Operatoren bijhet Bevragen van een Database）

Subsumption and Complementation as Data Fusion Operators

摘要

著录项

相似文献

相关主题

期刊订阅