首页> 外文期刊>Science Advances >Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets
【24h】

Machine learning in a data-limited regime: Augmenting experiments with synthetic data uncovers order in crumpled sheets

机译:在数据受限的情况下进行机器学习:使用合成数据进行增强实验会发现皱巴巴的纸张顺序

获取原文
           

摘要

Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce.
机译:机器学习作为一种识别复杂的高维数据结构的强大工具而受到了广泛的关注。但是,这些技术表面上不适用于缺乏数据或获得昂贵数据的实验系统。在这里,我们介绍了一种策略,可以通过使用简单得多的姊妹系统的综合生成的数据扩充实验数据集来解决这一僵局。具体来说,我们研究了皱折的薄板的折痕网络中自发出现的局部秩序,这是空间复杂性的一个典范示例,并且表明即使在数据受限的情况下,机器学习技术也可以有效。这是通过用不竭的数量的刚性平折薄板的模拟数据来补充稀缺的实验数据集而实现的,这些数据很容易模拟并共享共同的统计属性。这极大地提高了模式完成测试问题中的预测能力,并证明了机器学习在台式实验中的有用性,在台式实验中数据很好但是却很少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号