首页> 美国卫生研究院文献>other >A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures
【2h】

A large scale test dataset to determine optimal retention index threshold based on three mass spectral similarity measures

机译:基于三个质谱相似度测量大规模测试数据集以确定最佳保留索引阈值

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Retention index (RI) is useful for metabolite identification. However, when RI is integrated with mass spectral similarity for metabolite identification, many controversial RI threshold setup are reported in literatures. In this study, a large scale test dataset of 5844 compounds with both mass spectra and RI information were created from National Institute of Standards and Technology (NIST) repetitive mass spectra (MS) and RI library. Three MS similarity measures: NIST composite measure, the real part of Discrete Fourier Transform (DFT.R) and the detail of Discrete Wavelet Transform (DWT.D) were used to investigate the accuracy of compound identification using the test dataset. To imitate real identification experiments, NIST MS main library was employed as reference library and the test dataset was used as search data. Our study shows that the optimal RI thresholds are 22, 15, and 15 i.u. for the NIST composite, DFT.R and DWT.D measures, respectively, when the RI and mass spectral similarity are integrated for compound identification. Compared to the mass spectrum matching, using both RI and mass spectral matching can improve the identification accuracy by 1.7%, 3.5%, and 3.5% for the three mass spectral similarity measures, respectively. It is concluded that the improvement of RI matching for compound identification heavily depends on the method of MS spectral similarity measure and the accuracy of RI data.
机译:保留指数(RI)可用于代谢物鉴定。然而,当RI与代谢物识别的质量光谱相似性集成时,文献中报告了许多有争议的RI阈值设置。在这项研究中,从国家标准和技术研究所(NIST)重复质谱(MS)和RI文库中创建了5844个具有质量谱和RI信息的大规模测试数据集。三个MS相似度措施:NIST复合度量,离散傅里叶变换(DFT.R)的实部和离散小波变换(DWT.D)的细节来研究使用测试数据集的化合物识别的准确性。为了模仿真实的识别实验,NIST MS主库被用作参考文库,并且测试数据集用作搜索数据。我们的研究表明,最佳RI阈值是22,15和15 i.u。对于NIST复合,DFT.R和DWT.D分别措施,当RI和质谱相似性被整合以进行化合物鉴定。与质谱相比,使用RI和质谱匹配可以分别将鉴定精度提高1.7%,3.5%和3.5%,为三种质谱相似度测量。结论是,改善化合物识别的RI匹配大大取决于MS光谱相似度量的方法和RI数据的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号