...
首页> 外文期刊>Computational Materials Science >ThermoEPred-EL: Robust bandgap predictions of chalcogenides with diamond-like structure via feature cross-based stacked ensemble learning
【24h】

ThermoEPred-EL: Robust bandgap predictions of chalcogenides with diamond-like structure via feature cross-based stacked ensemble learning

机译:Thermoepred-el:通过特征跨基堆叠集合学习的菱形结构具有硫化物的强大带隙预测

获取原文
获取原文并翻译 | 示例
           

摘要

Implementation on rapid and accurate bandgap prediction has great practical implications for a range of applications. While quantum mechanical computations are enormously computation-time intensive, using informatics-based statistical learning approaches can be a promising alternative due to its availability, power and relatively limited cost of high-performance computational equipment. Here we demonstrate a systematic ensemble learning model which integrates a novel feature-engineering approach and a robust learning framework for predicting bandgaps of one series of typical thermoelectric materials: chalcogenides with diamond-like structure. After combining a feature crossing technique with a feature selection method, the proposed optimal descriptor set is identified by searching the feature space of 23,454 descriptors stemmed from the elemental features. Stable statistic-based feature selection methods are applied to identify the most crucial and relevant descriptors. The stacked ensemble learning model, which integrates the advantages of three different level-0 models (LASSO, SVR, and AdaBoost) and one level-1 model (GBDT), obtains 90.48% prediction accuracy, thus improving model accuracy and robustness. The results demonstrate the interpretability and generalizability of the stacked ensemble model, which can be applied to bandgap predictions in other material systems, thereby accelerating the design and optimization process for discovering new functional materials.
机译:在快速和准确的带隙预测上实现对一系列应用具有很大的实际影响。虽然量子机械计算是极凡的计算时间密集型,但由于其可用性,功率和相对有限的高性能计算设备的成本,使用基于信息学的统计学习方法可能是一个有前途的替代方案。在这里,我们展示了一种系统的集合学习模型,其集成了一种新颖的特征 - 工程方法和一种稳健的学习框架,用于预测一系列典型的热电材料的带隙:硫族化合物用金刚石结构。在用特征选择方法组合特征交叉技术之后,通过搜索从元素特征源的23,454描述符的特征空间来识别所提出的最佳描述符集。应用稳定的基于统计的特征选择方法来标识最重要的和相关的描述符。堆叠的集合学习模型,集成了三种不同级别的型号(套索,SVR和Adaboost)和一个级别-1型号(GBDT)的优势,获得了90.48%的预测精度,从而提高了模型精度和鲁棒性。结果证明了堆叠集合模型的可解释性和概括性,其可以应用于其他材料系统中的带隙预测,从而加速了发现新功能材料的设计和优化过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号