首页> 外文期刊>Decision support systems >Improving learning accuracy by using synthetic samples for small datasets with non-linear attribute dependency
【24h】

Improving learning accuracy by using synthetic samples for small datasets with non-linear attribute dependency

机译:通过对具有非线性属性相关性的小型数据集使用合成样本来提高学习准确性

获取原文
获取原文并翻译 | 示例
       

摘要

Small-data problems are commonly encountered in the early stages of a new manufacturing procedure, presenting challenges to both academics and practitioners, as good performance is difficult to achieve with learning models when there is a lack of sufficient data. Virtual sample generation (VSG) has been shown to be an effective method to overcome this issue in a wide range of studies in various fields. Such works usually assume that the relations among attributes are independent of each other, and produce synthetic data by using sample distributions of these. However, the VSG technique may be ineffective if the real data has interrelated attributes. Therefore, this research provides a novel procedure to generate related virtual samples with non-linear attribute dependency. To construct a relational model between the independent and dependent attributes, we employ gene expression programming (CEP) to find the most suitable mathematical model. One practical dataset and three real UCl datasets are presented in this paper to verify the effectiveness of the proposed method, and the results show that the proposed approach has better learning accuracy with regard to a back-propagation neural (BPN) network than that of the well-known mega-trend-diffusion (MTD) and the multi regression analysis (MRA) approaches.
机译:在新的制造过程的早期阶段,通常会遇到小数据问题,这给学者和从业人员都带来了挑战,因为在缺乏足够数据的情况下,学习模型很难获得良好的性能。在各个领域的广泛研究中,虚拟样本生成(VSG)已被证明是克服此问题的有效方法。这些工作通常假定属性之间的关系彼此独立,并通过使用这些属性的样本分布来生成综合数据。但是,如果实际数据具有相互关联的属性,则VSG技术可能无效。因此,本研究提供了一种新颖的程序来生成具有非线性属性相关性的相关虚拟样本。要构建独立属性和依赖属性之间的关系模型,我们采用基因表达编程(CEP)来找到最合适的数学模型。本文提出了一个实际的数据集和三个真实的UCl数据集以验证该方法的有效性,结果表明,相对于反向传播神经(BPN)网络,该方法具有更好的学习准确性。众所周知的大趋势扩散(MTD)和多元回归分析(MRA)方法。

著录项

  • 来源
    《Decision support systems》 |2014年第3期|286-295|共10页
  • 作者单位

    Department of Industrial and Information Management,National Cheng Kung University,University Road,Tainan 70101,Taiwan,ROC;

    Department of Industrial and Information Management,National Cheng Kung University,University Road,Tainan 70101,Taiwan,ROC;

    Department of Industrial and Information Management,National Cheng Kung University,University Road,Tainan 70101,Taiwan,ROC;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Small dataset; Attribute dependency; Related virtual samples; Gene expression programming; Mega trend diffusion;

    机译:小数据集;属性依赖;相关的虚拟样本;基因表达编程;大趋势扩散;
  • 入库时间 2022-08-18 02:13:38

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号