Improved Architectures and Training Algorithms for Deep Operator Networks

Sifan Wang; Hanwen Wang; Paris Perdikaris

首页> 外文期刊>Journal of scientific computing >Improved Architectures and Training Algorithms for Deep Operator Networks

【24h】

Improved Architectures and Training Algorithms for Deep Operator Networks

机译：Improved Architectures and Training Algorithms for Deep Operator Networks

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Abstract Operator learning techniques have recently emerged as a powerful tool for learning maps between infinite-dimensional Banach spaces. Trained under appropriate constraints, they can also be effective in learning the solution operator of partial differential equations (PDEs) in an entirely self-supervised manner. In this work we analyze the training dynamics of deep operator networks (DeepONets) through the lens of Neural Tangent Kernel theory, and reveal a bias that favors the approximation of functions with larger magnitudes. To correct this bias we propose to adaptively re-weight the importance of each training example, and demonstrate how this procedure can effectively balance the magnitude of back-propagated gradients during training via gradient descent. We also propose a novel network architecture that is more resilient to vanishing gradient pathologies. Taken together, our developments provide new insights into the training of DeepONets and consistently improve their predictive accuracy by a factor of 10-50x, demonstrated in the challenging setting of learning PDE solution operators in the absence of paired input-output observations. All code and data accompanying this manuscript will be made publicly available at https://github.com/PredictiveIntelligenceLab/ImprovedDeepONets.

著录项

来源
《Journal of scientific computing》 |2022年第2期|1-42|共42页
作者
Sifan Wang; Hanwen Wang; Paris Perdikaris;
展开▼
作者单位

University of Pennsylvania;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Deep learning; Partial differential equations; Computational physics; Physics-informed machine learning;

Improved Architectures and Training Algorithms for Deep Operator Networks

摘要

著录项

相关主题

期刊订阅