首页> 外国专利> JOINING TABLES IN A MAPREDUCE PROCEDURE

JOINING TABLES IN A MAPREDUCE PROCEDURE

机译：映射表中的联接表

页面导航

摘要
著录项
相似文献

摘要

Systems and techniques by which tables can be joined in a mapreduce procedure. In some implementations, when a large table of business data (e.g., having one billion transaction records or more) is to be joined with a large table of customer data (e.g., having hundreds of millions of customer records), then these two tables can be organized before the mapreduce procedure to speed up the table join. For example, the business data and the customer data can both be hash partitioned, based on the same key, into shards of business data and shards of customer data, respectively. The number of shards in these two groups has an integer relationship with each other: for example such that there are two business data shards for every customer data shard, or vice versa.

机译：可以在mapreduce过程中联接表的系统和技术。在一些实现中，当将大型业务数据表（例如，具有十亿个交易记录或更多）与大型客户数据表（例如，具有数亿个客户记录）结合在一起时，这两个表可以在mapreduce程序之前进行组织以加快表连接的速度。例如，可以基于相同的密钥将业务数据和客户数据都分别哈希分区为业务数据的碎片和客户数据的碎片。这两组中的分片数量彼此之间具有整数关系：例如，每个客户数据分片都有两个业务数据分片，反之亦然。

著录项

公开/公告号EP2702510B1

专利类型
公开/公告日2019-09-18

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号EP20120715491
发明设计人 CHATTOPADHYAY BISWAPESH;LIN LIANG;
展开▼

申请日2012-03-28
分类号G06F16/2455;
国家 EP
入库时间 2022-08-21 12:31:56

相似文献

专利
外文文献
中文文献