...
首页> 外文期刊>Advances in Engineering Software >Cloud-agnostic architectures for machine learning based on Apache Spark
【24h】

Cloud-agnostic architectures for machine learning based on Apache Spark

机译:基于Apache Spark的机器学习云无障碍架构

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Reference architectures for Big Data, machine learning and stream processing include not only recommended practices and interconnected building blocks but considerations for scalability, availability, manageability, and security as well. However, the automated deployment of multi-VM platforms on various clouds leveraging on such reference architectures may raise several issues. The paper focuses particularly on the widespread Apache Spark Big Data platform as the baseline and the Occopus cloud-agnostic orchestrator tool. The set of new generation reference architectures are configurable by human-readable descriptors according to available resources and cloud-providers, and offers various components such as Jupyter Notebook, RStudio, HDFS, and Kafka. These pre-configured reference architectures can be automatically deployed even by the data scientist on-demand, using a multi-cloud approach for a wide range of cloud systems like Amazon AWS, Microsoft Azure, Open-Stack, OpenNebula, CloudSigma, etc. Occopus enables the scaling of cluster-oriented components (such as Spark) of the instantiated reference architectures. The presented solution was successfully used in the Hungarian Comparative Agendas Project (CAP) by the Institute for Political Science to classify newspaper articles.
机译:用于大数据的参考架构,机器学习和流处理不仅包括推荐的实践和互连的构建块,而且考虑了可扩展性,可用性,可管理性和安全性。但是,在这种参考体系结构上利用的各种云上的多VM平台的自动部署可能会提高几个问题。纸张特别侧重于广泛的Apache Spark大数据平台作为基线和欧姆无话量的orcoptrator工具。根据可用资源和云提供商,该组新一代参考体系结构可由人类可读描述符进行配置,并提供各种组件,如Jupyter Notebook,Rstudio,HDFS和Kafka。这些预先配置的参考架构甚至可以通过数据科学家按需自动部署,使用多云方法进行广泛的云系统,如亚马逊AWS,Microsoft Azure,开放堆栈,OpenneBula,CloudSigma等偶尔启用实例化参考体系结构的面向群集的组件(例如火花)的缩放。本申请的解决方案在政治学会中成功地在匈牙利比较议程项目(CAP)中,以对报纸制品进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号