首页> 外文期刊>Chemosphere >Evaluation and application of machine learning-based retention time prediction for suspect screening of pesticides and pesticide transformation products in LC-HRMS
【24h】

Evaluation and application of machine learning-based retention time prediction for suspect screening of pesticides and pesticide transformation products in LC-HRMS

机译:基于机器学习的保留时间预测的评估与应用杀虫剂和农药转化产物在LC-HRMS中的筛选

获取原文
获取原文并翻译 | 示例
           

摘要

Computational QSAR models have gradually been preferred for retention time prediction in data mining of emerging environmental contaminants using liquid chromatography coupled with mass spectrometry. Generally, the model performance relies on the components such as machine learning algorithms, chemical features, and example data. In this study, we evaluated the performances of four algorithms on three feature sets, using 321 and 77 pesticides as the training and validation sets, respectively. The results were varied with different combinations of algorithms on distinct feature sets. Two strategies including enhancing the complexity of chemical features and enlarging the size of the training set were proved to improve the results. XGBoost, Random Forest, and lightGBM algorithms exhibited the best results when built on a large-scale chemical descriptors, while the Keras algorithm preferred fingerprints. These four models have comparable prediction accuracies that at least 90% of pesticides in validation set can be successfully predicted with DRT 1.0 min. Meanwhile, a blended prediction strategy using average results from four models presented a better result than any single model. This strategy was used for assisting identification of pesticides and pesticide transformation products in 120 strawberry samples from a national survey of food contamination. Twenty pesticides and twelve pesticide transformation products were tentatively identified, where all pesticides and two pesticide transformation products (bifenazate diazene and spirotetramat-enol) were confirmed by standard materials. The outcome of this study suggested that retention time prediction is a valuable approach in compound identification when integrated with in silico MS2 spectra and other MS identification strategies. (C) 2020 Elsevier Ltd. All rights reserved.
机译:使用液相色谱与质谱法耦合液相色谱法,计算QSAR模型逐渐得到了新出现的环境污染物的数据挖掘中的保留时间预测。通常,模型性能依赖于机器学习算法,化学特征和示例数据等组件。在本研究中,我们在三个特征集中评估了四种算法的性能,分别使用321和77农药作为训练和验证集。结果与不同特征集的不同算法组合不同。证明了两种策略,包括增强化学特征的复杂性和扩大培训集的规模,以改善结果。 XGBoost,随机森林和LightGBM算法在大型化学描述符上建立时呈现最佳效果,而Keras算法首选指纹。这四种模型具有可比的预测精度,可以使用DRT <1.0分钟成功预测至少90%的验证集中的农药。同时,使用四种模型的平均结果的混合预测策略提出了比任何单一模型更好的结果。该策略用于辅助在2009年粮食污染调查中识别120个草莓样品中的农药和农药转化产物。暂时发现了20种农药和12种农药转化产品,其中所有农药和两种农药转化产物(二苯基酸盐转化产物(二苯基酸盐二氮杂化物)通过标准材料确认。该研究的结果表明,在与硅MS2光谱和其他MS识别策略中集成时,保留时间预测是复合识别中的有价值的方法。 (c)2020 elestvier有限公司保留所有权利。

著录项

  • 来源
    《Chemosphere》 |2021年第5期|129447.1-129447.10|共10页
  • 作者单位

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Changning Ctr Dis Control & Prevent Shanghai 200051 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Calif Dept Publ Hlth Richmond CA 94804 USA;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

    Shanghai Municipal Ctr Dis Control & Prevent Shanghai 200336 Peoples R China|State Environm Protect Key Lab Environm Hlth Impa Shanghai 200336 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Retention time prediction; Pesticide transformation products; Machine learning; Algorithms; Chemical feature;

    机译:保留时间预测;农药转型产品;机器学习;算法;化学特征;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号