首页> 外文会议>IEEE International Instrumentation and Measurement Technology Conference >A novel IVS procedure for handling Big Data with Artificial Neural Networks
【24h】

A novel IVS procedure for handling Big Data with Artificial Neural Networks

机译:一种用人工神经网络处理大数据的新型IVS程序

获取原文

摘要

In recent times, thanks to the availability of a large quantity of data coming from the industrial process, several techniques based on a data-driven approach could be developed. Between all the data-driven techniques, as Principle Component Regression, Support Vector Machines, Artificial Neural Networks, Neuro-Fuzzy Systems, and many others, the data on which they rely should be analyzed to find correlations and dependencies that could improve their design. For this reason, the Input variable Selection (IVS) process has become of great interest in the recent period. The classical IVS relies on classical statistics, as Pearson coefficients, able to discover linear dependencies among data; today, due to the significant amount of data available, the challenge of also discovering non-linear dependencies appears to be a necessary skill, mainly for the design and development of a neural network. This paper proposes the use of a novel statistical tool named Maximal Information Coefficient (MIC) for developing an IVS procedure able to discover dependencies in a considerable dataset and guide the IVS designer to the selection of input variables in a data-driven application. As a case study, the procedure will be applied to a real application developed in the context of the Swedish forest industry, in order to choose the input variables of a neural network able to estimate the timber bundles volume, which represents an expensive parameter to measure in this context.
机译:最近,由于从工业过程中提供了大量数据的可用性,可以开发基于数据驱动方法的几种技术。在所有数据驱动技术之间,作为原理成分回归,支持向量机,人工神经网络,神经模糊系统以及许多其他人,应该分析他们依赖的数据,以找到可以改善其设计的相关性和依赖关系。因此,输入变量选择(IVS)过程在最近的时间内变得非常兴趣。古典IVS依赖于古典统计数据,作为Pearson系数,能够发现数据之间的线性依赖性;如今,由于可用的大量数据,发现非线性依赖性的挑战似乎是必要的技能,主要用于神经网络的设计和开发。本文提出了使用名为最大信息系数(MIC)的新型统计工具,用于开发能够发现IVS程序,能够在可相当大的数据集中发现依赖性,并将IVS设计者引导到数据驱动应用中的输入变量。作为一个案例研究,该程序将应用于在瑞典林业行业的背景下开发的实际应用程序,以便选择能够估计木材捆绑体积的神经网络的输入变量,这代表了昂贵的测量参数在这种情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号