首页> 外国专利> System and method for tracking flow of data during map-reduce job execution in hadoop

System and method for tracking flow of data during map-reduce job execution in hadoop

机译：在hadoop中执行map-reduce作业时跟踪数据流的系统和方法

页面导航

摘要
著录项
相似文献

摘要

Disclosed is a method and system for tracking flow of data in a distributed file system. The system may receive a MapReduce job application at first. Subsequently, the system may identify relevant locations of code of the MapReduce job application. The system may instrument the code by adding one or more program statements at the relevant locations of the code. The instrumented code may be executed at each node to process big data. The system may receive processing details of the big data from each node. The system may aggregate the processing details to generate a hierarchical dataflow map to be used for tracking flow of the data in the distributed file system.

机译：公开了一种用于跟踪分布式文件系统中的数据流的方法和系统。系统可能首先会收到一个MapReduce作业应用程序。随后，系统可以标识MapReduce作业应用程序的代码的相关位置。系统可以通过在代码的相关位置添加一个或多个程序语句来检测代码。检测到的代码可以在每个节点上执行以处理大数据。系统可以从每个节点接收大数据的处理细节。该系统可以聚合处理细节以生成分层数据流图，该分层数据流图用于跟踪分布式文件系统中的数据流。

著录项

公开/公告号IN2014MU01857A

专利类型
公开/公告日2015-12-11

原文格式PDF
申请/专利权人
展开▼

申请/专利号IN1857/MUM/2014
发明设计人 MISHRA MAYANK;
展开▼

申请日2014-06-05
分类号H04L12/26;
国家 IN
入库时间 2022-08-21 14:25:21

相似文献

专利
外文文献
中文文献