首页> 外国专利> System and method for data management in an open source distributed computing platform

System and method for data management in an open source distributed computing platform

机译:开源分布式计算平台中的数据管理系统和方法

摘要

Disclosed is a method and system for data management in an open source distributed computing platform. The system comprises an input module, a data uploading module, an extraction module, an analytical module and a processing module. The input module is constructed to configure one or more parameters to be used for performing one or more operations on one or more document, the parameters are further mapped with the document. The data uploading module selects the document to be uploaded. The extraction module is configured to extract document content by performing a search in the document based on the parameters configured. The analytical module analyzes the document content so extracted by applying one or more logic rules while storing the document content in a distributed file system. The processing module performs operations in a parallel mode on the document content stored in the distributed file system based on the parameters configured.
机译:公开了一种用于开源分布式计算平台中的数据管理的方法和系统。该系统包括输入模块,数据上传模块,提取模块,分析模块和处理模块。输入模块被构造成配置用于在一个或多个文档上执行一个或多个操作的一个或多个参数,该参数进一步与文档映射。数据上传模块选择要上传的文档。提取模块被配置为通过基于所配置的参数在文档中执行搜索来提取文档内容。分析模块通过在将文档内容存储在分布式文件系统中的同时应用一个或多个逻辑规则来分析如此提取的文档内容。处理模块基于配置的参数以并行模式对存储在分布式文件系统中的文档内容执行操作。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号