首页> 外文会议>International Conference on Smart Systems and Data Science >Adaptation of Classical Machine Learning Algorithms to Big Data Context: Problems and Challenges : Case Study: Hidden Markov Models Under Spark
【24h】

Adaptation of Classical Machine Learning Algorithms to Big Data Context: Problems and Challenges : Case Study: Hidden Markov Models Under Spark

机译:古典机器学习算法对大数据背景的适应:问题与挑战:案例研究:火花下的隐马尔可夫模型

获取原文

摘要

Big Data Analytics presents a great opportunity for scientists and businesses. It changed the methods of managing and analyzing the huge amount of data. To make big data valuable, we often use Machine Learning algorithms. Indeed, these algorithms have shown, in the past, their processing speed, efficiency and accuracy. But today, with the complex characteristics of big data, new problems have emerged and we are facing new challenges when developing and designing a new Machine Learning algorithm for Big Data Analytics. Therefore, it is essential to review the classical algorithms to adapt them to this new context. One of the methods of adaptation is the coupling between new technologies (i.e., distributed computing by GPU, Hadoop, Spark) and the Machine Learning algorithms to reduce the computational cost of data analysis. This paper highlights main challenges of adaptation of Machine Learning algorithms to the Big Data context and describes a novel method to make these algorithms efficient and fast in Big Data processing by taking as a case study the Hidden Markov Models using Spark framework. The results of complexity comparison of classical algorithms and those adapted to the Big Data context using Spark show a great improvement.
机译:大数据分析介绍了科学家和企业一个很好的机会。它改变了管理和分析的数据量庞大的方法。为了让大数据有价值,我们经常使用的机器学习算法。事实上,这些算法已示出,在过去,它们的处理速度,效率和准确性。但今天,随着大数据的复杂特点,新问题不断出现,开发和设计新的机器学习算法用于大数据分析的时候,我们面临新的挑战。因此,有必要回顾经典算法,使其适应这种新形势。一个适应的方法是新的技术(即,分布式计算通过GPU,Hadoop的,火花)和机器学习算法,以减少数据分析的计算成本之间的耦合。本文强调的机器学习算法对大数据背景下适应的主要挑战,并介绍了一种新方法,利用星火框架以作为个案研究的隐马尔可夫模型,使这些算法在大数据处理高效,快速。经典算法的复杂度比较那些使用星火适应大数据背景下的结果显示了很大的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号