首页> 外文会议>International conference on opto-electronics engineering and information science >Small Data Set Learning with Synthetic Samples and Area Membership Values
【24h】

Small Data Set Learning with Synthetic Samples and Area Membership Values

机译:用合成样本和区域成员值学习的小数据集

获取原文

摘要

A common strategy in manufacturing systems is to execute pilot runs before the mass production. We model the limited data obtained from pilot runs,and try to shorten the lead time required for predict future production in this study. A manufacturing system is usually comprehensive;Artificial Neural Networks are commonly applied to extract management knowledge from acquired data for its non-linear properties. It is the fundamental assumption for Artificial Neural Networks to get as large a number of training data as needed;nevertheless, this is often not achievable for pilot runs because there are few data obtained during trial stages,and theoretically this means that the obtained knowledge is fragile. The purpose of this research is through the proposed procedure to decrease the Artificial Neural Networks prediction error rate in small data set problems. Based on a consideration of dependent data attributes, the proposed procedure is designed to utilize extreme value theory and statistical prediction interval calculations to derive fuzzybased synthetic samples to fill sparse data information gaps. After synthetic samples are generated,area membership values are applied to combine data values,and occurrence possibility for every sample is used as the input of Artificial Neural Networks. The results of this research indicate that the prediction error rate can be significantly decreased by applying the proposed procedure to a very small data set with attribute dependency.
机译:在制造系统的常见策略是大规模生产之前,执行试运行。我们从试运行获得的有限的数据模型,并尽量缩短预测未来的生产在这项研究中所需要的准备时间。制造系统通常是全面;人工神经网络通常由为它的非线性特性所获取的数据应用到提取管理知识。这是基本的假设为人工神经网络,以获得尽可能大的数量的训练数据的需要;然而,这往往不是试运行实现的,因为有在审判阶段获得的数据很少,理论上这意味着,所获得的知识脆弱的。这项研究的目的是通过建议的程序,以减少在小数据集问题的人工神经网络预测误差率。根据所考虑的从属数据属性的,所提出的过程被设计成利用极值理论和统计预测间隔计算以导出fuzzybased合成样品填充稀疏数据信息的空白。产生合成的样品后,区域成员值应用到数据值进行组合,并为每样品发生的可能性被用作神经网络的输入端。这项研究的结果表明,预测误差率可以通过应用程序建议与属性的依赖非常小的数据集显著下降。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号