首页> 外文会议>Conference on Computing in High Energy and Nuclear Physics >Sequential data access with Oracle and Hadoop:a performance comparison
【24h】

Sequential data access with Oracle and Hadoop:a performance comparison

机译:与Oracle和Hadoop的顺序数据访问:性能比较

获取原文

摘要

The Hadoop framework has proven to be an effective and popular approach for dealing with "Big Data" and, thanks to its scaling ability and optimised storage access, Hadoop Distributed File System-based projects such as MapReduce or HBase are seen as candidates to replace traditional relational database management systems whenever scalable speed of data processing is a priority.But do these projects deliver in practice? Does migrating to Hadoop's "shared nothing" architecture really improve data access throughput? And, if so, at what cost? Authors answer these questions-addressing cost/performance as well as raw performance- based on a performance comparison between an Oracle-based relational database and Hadoop's distributed solutions like MapReduce or HBase for sequential data access.A key feature of our approach is the use of an unbiased data model as certain data models can significantly favour one of the technologies tested.
机译:Hadoop框架已被证明是处理“大数据”的有效和流行的方法,并且由于其缩放能力和优化的存储访问,Hadoop分布式文件系统如MapReduce或HBase的项目被视为替换传统的候选者关系数据库管理系统随时可扩展数据处理速度是优先级。但是这些项目在实践中提供吗?迁移到Hadoop的“共享”架构确实改善了数据访问吞吐量?而且,如果是的,在什么费用?作者回答了这些问题 - 解决成本/性能以及基于oracle的关系数据库和Hadoop的分布式解决方案的性能比较,如MapReduce或HBase,用于顺序数据访问。我们方法的主要特征是使用当某些数据模型时,一个无偏的数据模型可以显着支持测试的一项技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号