首页> 外国专利> System, method, and computer-readable medium for partial redistribution, partial duplication of rows of parallel join operation on skewed data

System, method, and computer-readable medium for partial redistribution, partial duplication of rows of parallel join operation on skewed data

机译:用于对偏斜数据进行部分重新分布,并行连接操作的行的部分复制的系统,方法和计算机可读介质

摘要

A system, method, and computer-readable medium that facilitate management of data skew during a parallel join operation are provided. Portions of tables involved in the join operation are distributed among a plurality of processing modules, and each of the processing modules is provided with a list of skewed values of a join column of a larger table involved in the join operation. Each of the processing modules scans the rows of the tables distributed to the processing modules and compares values of the join columns of both tables with the list of skewed values. Rows of the larger table having non-skewed values in the join column are redistributed, and rows of the larger table having skewed values in the join column are maintained locally at the processing modules. Rows of the smaller table that have non-skewed values in the join column are redistributed, and rows of the smaller table that have skewed values in the join column are duplicated among the processing modules.
机译:提供了一种在并行连接操作期间促进数据偏斜的管理的系统,方法和计算机可读介质。联接操作中涉及的表的部分分布在多个处理模块之间,并且每个处理模块都提供有联接操作中所涉及的较大表的联接列的偏斜值的列表。每个处理模块都扫描分配给处理模块的表的行,并将两个表的连接列的值与偏斜值列表进行比较。重新分配连接列中具有不偏斜值的较大表的行,并且连接列中具有不偏斜值的较大表的行在本地维护在处理模块中。在联结列中具有不偏斜值的较小表的行将被重新分配,并且在处理模块之间复制在联结列中具有偏斜值的较小表的行。

著录项

  • 公开/公告号US8131711B2

    专利类型

  • 公开/公告日2012-03-06

    原文格式PDF

  • 申请/专利权人 YU XU;PEKKA KOSTAMAA;

    申请/专利号US20080125299

  • 发明设计人 PEKKA KOSTAMAA;YU XU;

    申请日2008-05-22

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 17:25:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号