首页> 外文会议>2015 International Conference on Cloud Technologies and Applications >Log-based change data capture from schema-free document stores using MapReduce
【24h】

Log-based change data capture from schema-free document stores using MapReduce

机译:使用MapReduce从无模式文档存储中捕获基于日志的更改数据

获取原文
获取原文并翻译 | 示例

摘要

Change data capture (CDC) is an approach to data integration that is used to determine and track the data that has changed so that action can be taken using the change data. However, the state of art of change data capture (CDC) in the context of document-oriented NoSQL databases is not mature. Therefore, it is urgent to require a NoSQL CDC solution. Although some manufacturers of NoSQL databases start to research on CDC for NoSQL, these approaches are just for the specific product. In our paper, we propose a log-based CDC approach from abstract schema-free document stores using MapReduce. The process is divided into map and reduce procedures, benefited from MapReduce framework, to generate cell state models (CSMs). In order to infinitely look back to any revision, we enable our proposed CSM to support copy-modify-merge model to manage the revisions of change data. Finally, experimental results show that this approach is independent and appropriate for document stores, with high performance and throughput capacity.
机译:更改数据捕获(CDC)是一种数据集成方法,用于确定和跟踪已更改的数据,以便可以使用更改数据来采取措施。但是,在面向文档的NoSQL数据库的上下文中,变更数据捕获(CDC)的技术水平还不成熟。因此,迫切需要NoSQL CDC解决方案。尽管一些NoSQL数据库制造商开始研究CDC for NoSQL,但是这些方法仅适用于特定产品。在本文中,我们使用MapReduce从抽象的无模式文档存储中提出了一种基于日志的CDC方法。得益于MapReduce框架,该过程分为map和reduce过程,以生成单元状态模型(CSM)。为了无限地回顾任何修订,我们使建议的CSM支持复制-修改-合并模型来管理变更数据的修订。最后,实验结果表明该方法是独立的,适用于文档存储,具有高性能和吞吐能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号