首页> 外文会议>IEEE International Conference on Emerging Technologies and Factory Automation >Proposal of Feature Value Selection Method for Time-Critical Learning
【24h】

Proposal of Feature Value Selection Method for Time-Critical Learning

机译:特征值选择方法的提议

获取原文

摘要

The development of IoT has led to the creation of a data-enriched environment that enables data gathering by using distributed sensors and terminals. However, in this environment, the cost of data analysis has increased. Machine learning has gained attention for reducing the cost because enabling automatic data analysis, as well as multidimensional data, is expected. However, for enormous data, such as Big Data, we still have to pay costs. Therefore, selecting feature values when using machine learning technology is essential, especially as inputs of a classifier. Selecting the feature values increases its estimation accuracy. Moreover, the time cost, as well as calculation cost, needs consideration for the actual time-critical use of machine learning, especially in its learning process. Therefore, in this study, we proposed an algorithm that selected suitable feature values in required time. The proposed method consists of two stages: stepwise input selection stage using ANOVA and feature deletion stage according to the contribution rate of the features to estimate accuracy. These selection and deletion processes continue until the required processing time. We confirmed the efficiency of the proposed method by using an environment of a crystallization process in a factory and a household's occupancy estimation. A comparison with the original stepwise input method proved that the proposed method improved the estimation accuracy by 2% and 5% in the estimation of the substance amount of the crystallization process and household's occupancy, respectively.
机译:IOT的开发导致创建了通过使用分布式传感器和终端来实现数据收集的丰富环境。但是,在这种环境中,数据分析的成本增加了。由于启用自动数据分析,以及多维数据,机器学习已经注意到降低成本,以及多维数据。但是,对于巨大的数据,如大数据,我们仍然需要支付成本。因此,选择使用机器学习技术时的要素值是必不可少的,特别是作为分类器的输入。选择要素值会增加其估计精度。此外,时间成本以及计算成本需要考虑机器学习的实际时间关键时间,特别是在其学习过程中。因此,在本研究中,我们提出了一种在所需时间选择合适的特征值的算法。该方法由两个阶段组成:逐步输入选择阶段使用ANOVA和特征删除阶段根据特征的贡献率来估计准确性。这些选择和删除过程在所需的处理时间之前继续。我们通过在工厂和家庭的占用估计中使用结晶过程的环境确认了所提出的方法的效率。与原始逐步输入法的比较证明,该方法分别在估计结晶过程和家庭占用的物质量的估计中提高了2%和5%的估计精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号