首页> 外文期刊>Big Data, IEEE Transactions on >A Distributed Stream Library for Java 8
【24h】

A Distributed Stream Library for Java 8

机译:Java 8的分布式流库

获取原文
获取原文并翻译 | 示例

摘要

Java 8 has introduced new capabilities such as lambda expressions and streams which simplify data-parallel computing. However, as a base language for Big Data systems, it still lacks a number of important capabilities such as processing very large datasets and distributing the computation over multiple machines. This paper gives an overview of the Java 8 Streams API and proposes extensions to allow its use in Big Data systems. It also shows how the API can be used to implement a range of standard Big Data paradigms. Finally, it compares performance with that of Hadoop and Spark. Despite being a proof-of-concept implementation, results indicate that it is a lightweight and efficient framework, comparable in performance to Hadoop and Spark, and is up to 5 times faster for the largest input sizes tested.
机译:Java 8引入了新功能,例如lambda表达式和流,这些功能简化了数据并行计算。但是,作为大数据系统的基本语言,它仍然缺少许多重要功能,例如处理非常大的数据集并将计算分布在多台计算机上。本文概述了Java 8 Streams API,并提出了扩展使其可以在大数据系统中使用。它还显示了如何使用API​​来实现一系列标准的大数据范例。最后,它将性能与Hadoop和Spark的性能进行比较。尽管是概念验证的实现,结果表明它是一个轻量级且高效的框架,其性能可与Hadoop和Spark媲美,并且对于测试的最大输入大小,其速度最高可提高5倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号