首页>
外国专利>
System and method for data management in an open source distributed computing platform
System and method for data management in an open source distributed computing platform
展开▼
机译:开源分布式计算平台中的数据管理系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Disclosed is a method and system for data management in an open source distributed computing platform. The system comprises an input module, a data uploading module, an extraction module, an analytical module and a processing module. The input module is constructed to configure one or more parameters to be used for performing one or more operations on one or more document, the parameters are further mapped with the document. The data uploading module selects the document to be uploaded. The extraction module is configured to extract document content by performing a search in the document based on the parameters configured. The analytical module analyzes the document content so extracted by applying one or more logic rules while storing the document content in a distributed file system. The processing module performs operations in a parallel mode on the document content stored in the distributed file system based on the parameters configured.
展开▼