首页> 外国专利> MapReduce for distributed database processing

MapReduce for distributed database processing

机译:MapReduce用于分布式数据库处理

摘要

An input data set is treated as a plurality of grouped sets of key/value pairs, which enhances the utility of the MapReduce programming methodology. By utilizing such a grouping, map processing can be carried out independently on two or more related but possibly heterogeneous datasets (e.g., related by being characterized by a common primary key). The intermediate results of the map processing (key/value pairs) for a particular key can be processed together in a single reduce function by applying a different iterator to intermediate values for each group. Different iterators can be arranged inside reduce functions in ways however desired.
机译:输入数据集被视为键/值对的多个分组集,这增强了MapReduce编程方法的实用性。通过利用这样的分组,可以对两个或更多个相关但可能是异构的数据集(例如,通过以共同的主键来表征而相关)独立地进行地图处理。通过将不同的迭代器应用于每个组的中间值,可以在单个归约函数中一起处理特定键的映射处理(键/值对)的中间结果。可以按所需方式在reduce函数内部安排不同的迭代器。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号