首页>
外国专利>
METHOD FOR CONSTRUCTING AND UTILIZING INDEX TO IMPROVE DATA PROCESSING PERFORMANCE BASED ON MAPREDUCE IN HADOOP ENVIRONMENT
METHOD FOR CONSTRUCTING AND UTILIZING INDEX TO IMPROVE DATA PROCESSING PERFORMANCE BASED ON MAPREDUCE IN HADOOP ENVIRONMENT
展开▼
机译:HADOOP环境中基于映射的构造和利用指标以提高数据处理性能的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention relates to a method for processing data based on MapReduce. Specifically, the present invention relates to a method for constructing and utilizing an index to improve the data processing performance based on MapReduce in a Hadoop environment, which constructs a secondary index to effectively process big data with a MapReduce method in the Hadoop environment, and utilizes the secondary index for MapReduce-based data processing. The method for constructing an index comprises the following steps. Each mapper for processing file splits outputs an intermediate result value to be transmitted to a reducer by using an offset, a length, and a key value (K). Then, the reducer calculates the total offset and the total length by finding the smallest offset section and the largest offset section in a list of each of record sections, and stores the total offset and the total length in a split-level Hadoop index file.;COPYRIGHT KIPO 2017
展开▼