【24h】

Analyzing Big Data Streams with Apache SAMOA

机译:使用Apache Samoa分析大数据流

获取原文

摘要

Apache Apache samoa (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams. Big data, is defined as datasets whose size is beyond the ability of typical software tools to capture, store, manage and analyze, due to the time and memory complexity. Velocity is one of the main properties of big data. Apache Apache SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Apache Flink, Apache Storm, Apache Samza, and Apache Apex. Apache Apache SAMOA is written in Java and is available at https://samoa.incubator.apache.org/under the Apache Software License version 2.0.
机译:Apache Apache Samoa(可扩展的高级大规模在线分析)是一个用于挖掘大数据流的开源平台。大数据,被定义为数据集,其大小超出了典型的软件工具捕获,存储,管理和分析的能力,由于时间和内存复杂性。速度是大数据的主要属性之一。 Apache Apache Samoa提供了一个分布式流媒体算法的集合,用于最常见的数据挖掘和机器学习任务,如分类,聚类和回归,以及开发新算法的编程抽象。它具有可插拔架构,可允许它在几个分布式流处理引擎上运行,例如Apache Flink,Apache Storm,Apache Samza和Apache Apex。 Apache Apache Samoa是用Java编写的,可在Https://samoa.incubator.apache.org/under下提供Apache软件许可证2.0版。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号