...
首页> 外文期刊>PLoS One >A novel cross-validation strategy for artificial neural networks using distributed-lag environmental factors
【24h】

A novel cross-validation strategy for artificial neural networks using distributed-lag environmental factors

机译:分布式滞后环境因素的人工神经网络的一种新型交叉验证策略

获取原文
           

摘要

In recent years, machine learning methods have been applied to various prediction scenarios in time-series data. However, some processing procedures such as cross-validation (CV) that rearrange the order of the longitudinal data might ruin the seriality and lead to a potentially biased outcome. Regarding this issue, a recent study investigated how different types of CV methods influence the predictive errors in conventional time-series data. Here, we examine a more complex distributed lag nonlinear model (DLNM), which has been widely used to assess the cumulative impacts of past exposures on the current health outcome. This research extends the DLNM into an artificial neural network (ANN) and investigates how the ANN model reacts to various CV schemes that result in different predictive biases. We also propose a newly designed permutation ratio to evaluate the performance of the CV in the ANN. This ratio mimics the concept of the R-square in conventional statistical regression models. The results show that as the complexity of the ANN increases, the predicted outcome becomes more stable, and the bias shows a decreasing trend. Among the different settings of hyperparameters, the novel strategy, Leave One Block Out Cross-Validation (LOBO-CV), demonstrated much better results, and the lowest mean square error was observed. The hyperparameters of the ANN trained by the LOBO-CV yielded the minimum number of prediction errors. The newly proposed permutation ratio indicates that LOBO-CV can contribute up to 34% of the prediction accuracy.
机译:近年来,机器学习方法已应用于时间序列数据中的各种预测方案。然而,重新排列纵向数据顺序的交叉验证(CV)等一些处理过程可能会破坏序列性并导致可能偏置的结果。关于这个问题,最近的一项研究研究了不同类型的CV方法如何影响传统的时间序列数据中的预测误差。在这里,我们研究了一种更复杂的分布式滞后非线性模型(DLNM),已被广泛用于评估过去暴露对当前健康结果的累积影响。该研究将DLNM扩展到人工神经网络(ANN)中,并研究了ANN模型如何对导致不同预测偏差的各种CV方案的反应。我们还提出了一种新设计的置换比率来评估CV在ANN中的性能。该比率模仿传统统计回归模型中R范围的概念。结果表明,随着ANN的复杂性增加,预测结果变得更加稳定,偏差显示趋势降低。在近似参数的不同环境中,新颖的策略,留下一个块横跨交叉验证(Lobo-CV),展示了更好的结果,并且观察到最低平均误差。由Lobo-CV训练的ANN的封闭率产生了最小的预测误差数。新提出的排列比表明Lobo-CV可以贡献高达34%的预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号