首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Data Mapper: An Operator for Expressing One-to-Many Data Transformations
【24h】

Data Mapper: An Operator for Expressing One-to-Many Data Transformations

机译:数据映射器:用于表达一对多数据转换的操作员

获取原文

摘要

Transforming data is a fundamental operation in application scenarios involving data integration, legacy data migration, data cleaning, and extract-transform-load processes. Data transformations are often implemented as relational queries that aim at leveraging the optimization capabilities of most RDBMSs. However, relational query languages like SQL are not expressive enough to specify an important class of data transformations that produce several output tuples for a single input tuple. This class of data transformations is required for solving the data heterogeneities that occur when source data represents an aggregation of target data. In this paper, we propose and formally define the data mapper operator as an extension of the relational algebra to address one-to-many data transformations. We supply an algebraic rewriting technique that enables the optimization of data transformation expressions that combine filters expressed as standard relational operators with mappers. Furthermore, we identify the two main factors that influence the expected optimization gains.
机译:转换数据是涉及数据集成,传统数据迁移,数据清洁和提取转换加载过程的应用方案中的基本操作。数据转换通常被实现为关系查询,其目的在利用大多数RDBMSS的优化功能。但是,像SQL这样的关系查询语言不具有足够态度,以指定为单个输入元组产生多个输出元组的重要数据转换。求解源数据表示目标数据的聚合时,需要该类的数据变换来求解发生的数据异质性。在本文中,我们建议并正式将数据映射器运营商定义为关系代数的扩展,以解决一对多数据转换。我们提供了一种代数重写技术,可以实现将滤波器与映射器组合为标准关系运算符的滤波器的数据转换表达式。此外,我们确定影响预期优化收益的两个主要因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号