首页> 美国卫生研究院文献>Scientific Data >Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction
【2h】

Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction

机译:通过半监督关系提取自动生成居里和Néel温度的材料数据库

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Large auto-generated databases of magnetic materials properties have the potential for great utility in materials science research. This article presents an auto-generated database of 39,822 records containing chemical compounds and their associated Curie and Néel magnetic phase transition temperatures. The database was produced using natural language processing and semi-supervised quaternary relationship extraction, applied to a corpus of 68,078 chemistry and physics articles. Evaluation of the database shows an estimated overall precision of 73%. Therein, records processed with the text-mining toolkit, ChemDataExtractor, were assisted by a modified Snowball algorithm, whose original binary relationship extraction capabilities were extended to quaternary relationship extraction. Consequently, its machine learning component can now train with ≤ 500 seeds, rather than the 4,000 originally used. Data processed with the modified Snowball algorithm affords 82% precision. Database records are available in MongoDB, CSV and JSON formats which can easily be read using Python, R, Java and MatLab. This makes the database easy to query for tackling big-data materials science initiatives and provides a basis for magnetic materials discovery.
机译:大型自动生成的磁性材料特性数据库在材料科学研究中具有巨大的应用潜力。本文介绍了一个自动生成的数据库,其中包含化合物及其相关的居里和Néel磁性相变温度的39,822条记录。该数据库是使用自然语言处理和半监督的四元关系提取生成的,并应用于68,078条化学和物理文章的语料库。对数据库的评估显示,估计的整体精度为73%。其中,使用文本挖掘工具包ChemDataExtractor处理的记录得到了改进的Snowball算法的辅助,该算法的原始二元关系提取功能已扩展到四元关系提取。因此,其机器学习组件现在可以训练≤500个种子,而不是最初使用的4000个种子。使用改进的Snowball算法处理的数据可提供82%的精度。数据库记录以MongoDB,CSV和JSON格式提供,可以使用Python,R,Java和MatLab轻松读取。这使数据库易于查询以解决大数据材料科学计划,并为磁性材料发现提供了基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号