首页> 外文期刊>Decision support systems >Rebuilding sample distributions for small dataset learning
【24h】

Rebuilding sample distributions for small dataset learning

机译:重建样本分布以进行小型数据集学习

获取原文
获取原文并翻译 | 示例
       

摘要

Over the past few decades, a few learning algorithms have been proposed to extract knowledge from data. The majority of these algorithms have been developed with the assumption that training sets can denote populations. When the training sets contain only a few properties of their populations, the algorithms may extract minimal and/or biased knowledge for decision makers. This study develops a systematic procedure based on fuzzy theories to create new training sets by rebuilding the possible sample distributions, where the procedure contains new functions that estimate domains and a sample generating method. In this study, two real cases of a leading company in the thin film transistor liquid crystal display (TFT-LCD) industry are examined. Two learning algorithms-a back-propagation neural network and support vector regression-are employed for modeling, and two sample generation approaches-bootstrap aggregating (bagging) and the synthetic minority over-sampling technique (SMOTE)-are employed to compare the accuracy of the models. The results indicate that the proposed method outperforms bagging and the SMOTE with the greatest amount of statistical support. (C) 2017 Elsevier B.V. All rights reserved.
机译:在过去的几十年中,已经提出了一些学习算法来从数据中提取知识。这些算法中的大多数是在训练集可以表示总体的前提下开发的。当训练集仅包含其总体的一些属性时,算法可能会为决策者提取最少和/或有偏见的知识。这项研究开发了基于模糊理论的系统程序,通过重建可能的样本分布来创建新的训练集,其中该程序包含估计域的新功能和样本生成方法。在这项研究中,考察了薄膜晶体管液晶显示器(TFT-LCD)行业一家领先公司的两个实际案例。使用两种学习算法-反向传播神经网络和支持向量回归-进行建模,并使用两种样本生成方法-引导聚合(装袋)和合成少数过采样技术(SMOTE)-比较样本的准确性。模型。结果表明,所提出的方法在统计支持量最大的情况下胜过装袋和SMOTE。 (C)2017 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Decision support systems》 |2018年第1期|66-76|共11页
  • 作者单位

    Natl Cheng Kung Univ, Dept Ind & Informat Management, Univ Rd, Tainan 70101, Taiwan;

    Natl Cheng Kung Univ, Dept Ind & Informat Management, Univ Rd, Tainan 70101, Taiwan;

    Natl Cheng Kung Univ, Dept Ind & Informat Management, Univ Rd, Tainan 70101, Taiwan;

    Natl Cheng Kung Univ, Dept Ind & Informat Management, Univ Rd, Tainan 70101, Taiwan;

    Natl Cheng Kung Univ, Dept Ind & Informat Management, Univ Rd, Tainan 70101, Taiwan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Small data; Virtual sample; Data preprocessing;

    机译:小数据;虚拟样本;数据预处理;
  • 入库时间 2022-08-18 02:13:08

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号