【24h】

Big Data Analytic Paradigms -From PCA to Deep Learning

机译:大数据分析范式 - 从PCA到深度学习

获取原文

摘要

Large sets of data (numerical, textural and image) have been accumulating in all aspects of our lives for a long time. Advances in sensor technology, the Internet, social networks, wireless communication, and inexpensive memory have all contributed to an explosion of "Big Data". Big data is created in many ways in today's highly inter-connected world. Social networks, system of systems (SoS: complex interoperable) systems and wireless systems are only some of the platforms creating big data. Recent efforts have developed a promising approach, called "Data Analytics", which uses statistical and computational intelligence (CI) tools such as principal component analysis (PCA), clustering, fuzzy logic, neuro-computing, evolutionary computation, Bayesian networks, etc. to reduce the size of "Big Data" to a manageable size and apply these tools to a) extract information, b) build a knowledge base using the derived data, and c) eventually develop a non-parametric model for the “Big Data”. This paper attempts to construct a bridge between SoS and Data Analytic to develop reliable models for such systems. One of the recent most promising data analytic too is "Deep Learning". Deep learning is the broad term for the recent development and extensions of neural networks in the machine learning community, which has allowed for state of the art results in speech, image, and natural language processing tasks. Hierarchical learning is an area of research which focuses on learning high order representations from low level data. Learning to recognize objects from images, recognizing words or syllables from audio, or recognizing poses and movement from video are all good examples of modern hierarchical learning research, which is a central focus of the "deep learning" movement in the machine learning and computational statistics community. This paper will give a rather comprehensive look at all the BIG data analytic tools – old and new. Data bases of photovoltaic and wind energy data will all be used here.
机译:大量数据(数值,纹理和图像)已经积累了长期我们生活的所有方面。传感器技术的进步,互联网,社交网络,无线通信和廉价内存都有促成了“大数据”的爆炸。在当今高度关联的世界中,在许多方面创建了大数据。社交网络,系统系统(SOS:复杂的互操作)系统和无线系统只是创建大数据的一些平台。最近的努力已经开发了一种有希望的方法,称为“数据分析”,它使用统计和计算智能(CI)工具,如主成分分析(PCA),聚类,模糊逻辑,神经计算,进化计算,贝叶斯网络等。为了将“大数据”的大小降低到可管理的大小并应用这些工具到a)提取信息,b)使用派生数据构建知识库,并且c)最终为“大数据”开发非参数模型。 。本文试图在SOS和数据分析之间构建一个桥梁,以开发这种系统的可靠模型。最近最有前途的数据分析之一也是“深入学习”。深度学习是最近在机器学习界中的神经网络的开发和扩展的广泛术语,这允许最先进的状态导致语音,图像和自然语言处理任务。分层学习是一个研究领域,专注于从低级数据学习高阶表示。学习从图像中识别对象,识别来自音频的单词或音节,或识别来自视频的姿势和运动是现代分层学习研究的所有良好示例,这是机器学习和计算统计中“深度学习”运动的中央焦点社区。本文将提供相当综合的全面看所有大数据分析工具 - 新旧。这里将使用光伏和风能数据的数据基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号