首页> 外国专利> RECORD PROCESSING METHOD USING INDEX DATA STRUCTURE IN DISTRIBUTED PROCESSING SYSTEM BASED ON MAPREDUCE

RECORD PROCESSING METHOD USING INDEX DATA STRUCTURE IN DISTRIBUTED PROCESSING SYSTEM BASED ON MAPREDUCE

机译:基于映射的分布式处理系统中基于索引数据结构的记录处理方法

摘要

A method for processing a record by using an index in a MapReduce-based distribution processing system comprises: a step in which a distribution node of a distribution processing system classifies records as analysis targets in an input file; a step in which the distribution node generates a data structure having a plurality of indexes indicating a key and a storage location of each record; a step in which the distribution node generates a new index data structure having a new index indicating a key of a record, a storage location of a record, and an identifier of a data structure to which a record belongs while approaching the plurality of indexes in order of the keys in the data structure; and a step in which the distribution node applies a reduce function to a record stored on the basis of an identifier of a data structure and a storage location of a record indicated by an index having the same key while approaching indexes in order of keys in the new data structure.
机译:一种在基于MapReduce的分发处理系统中通过使用索引来处理记录的方法,包括:步骤,分发处理系统的分发节点将记录分类为输入文件中的分析目标;分发节点生成具有多个索引的数据结构的索引的步骤,所述索引指示每个记录的关键字和存储位置;步骤,其中分发节点在接近多个索引时生成具有新索引的新索引数据结构,该新索引数据结构指示记录的关键字,记录的存储位置以及记录所属的数据结构的标识符。键在数据结构中的顺序;步骤,其中分配节点根据数据结构的标识符和由具有相同键的索引表示的记录的存储位置,对存储的记录应用归约功能,同时按索引中的键顺序接近索引。新的数据结构。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号