首页>
外国专利>
System, method, and computer-readable medium for partial redistribution, partial duplication of rows of parallel join operation on skewed data
System, method, and computer-readable medium for partial redistribution, partial duplication of rows of parallel join operation on skewed data
展开▼
机译:用于对偏斜数据进行部分重新分布,并行连接操作的行的部分复制的系统,方法和计算机可读介质
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system, method, and computer-readable medium that facilitate management of data skew during a parallel join operation are provided. Portions of tables involved in the join operation are distributed among a plurality of processing modules, and each of the processing modules is provided with a list of skewed values of a join column of a larger table involved in the join operation. Each of the processing modules scans the rows of the tables distributed to the processing modules and compares values of the join columns of both tables with the list of skewed values. Rows of the larger table having non-skewed values in the join column are redistributed, and rows of the larger table having skewed values in the join column are maintained locally at the processing modules. Rows of the smaller table that have non-skewed values in the join column are redistributed, and rows of the smaller table that have skewed values in the join column are duplicated among the processing modules.
展开▼