...
首页> 外文期刊>Metabolites >A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods
【24h】

A Data Set of 255,000 Randomly Selected and Manually Classified Extracted Ion Chromatograms for Evaluation of Peak Detection Methods

机译:一个255,000的数据集随机选择和手动分类提取离子色谱图,用于评估峰值检测方法

获取原文
           

摘要

Non-targeted mass spectrometry (MS) has become an important method over recent years in the fields of metabolomics and environmental research. While more and more algorithms and workflows become available to process a large number of non-targeted data sets, there still exist few manually evaluated universal test data sets for refining and evaluating these methods. The first step of non-targeted screening, peak detection and refinement of it is arguably the most important step for non-targeted screening. However, the absence of a model data set makes it harder for researchers to evaluate peak detection methods. In this Data Descriptor, we provide a manually checked data set consisting of 255,000 EICs (5000 peaks randomly sampled from across 51 samples) for the evaluation on peak detection and gap-filling algorithms. The data set was created from a previous real-world study, of which a subset was used to extract and manually classify ion chromatograms by three mass spectrometry experts. The data set consists of the converted mass spectrometry files, intermediate processing files and the central file containing a table with all important information for the classified peaks.
机译:非靶向质谱(MS)已成为近年来代谢组和环境研究领域的重要方法。虽然越来越多的算法和工作流程可用于处理大量非目标数据集,但仍然存在一些手动评估的通用测试数据集,用于精炼和评估这些方法。非靶向筛选,峰值检测和改进的第一步可以是非靶向筛选的最重要步骤。然而,没有模型数据集使得研究人员更难评估峰值检测方法。在该数据描述符中,我们提供由255,000个EICS(从500个样本中随机采样5000峰)的手动检查的数据集,用于评估峰值检测和间隙填充算法。数据集是从先前的真实研究创建的,其中子集用于通过三个质谱专家提取和手动分类离子色谱图。数据集包括转换的质谱文件,中间处理文件和包含表的中央文件,其中包含对分类峰的所有重要信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号