首页> 美国卫生研究院文献>Frontiers in Bioengineering and Biotechnology >Learning to Classify Organic and Conventional Wheat – A Machine Learning Driven Approach Using the MeltDB 2.0 Metabolomics Analysis Platform
【2h】

Learning to Classify Organic and Conventional Wheat – A Machine Learning Driven Approach Using the MeltDB 2.0 Metabolomics Analysis Platform

机译:学习对有机小麦和常规小麦进行分类–使用MeltDB 2.0代谢组学分析平台的机器学习驱动方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present results of our machine learning approach to the problem of classifying GC-MS data originating from wheat grains of different farming systems. The aim is to investigate the potential of learning algorithms to classify GC-MS data to be either from conventionally grown or from organically grown samples and considering different cultivars. The motivation of our work is rather obvious nowadays: increased demand for organic food in post-industrialized societies and the necessity to prove organic food authenticity. The background of our data set is given by up to 11 wheat cultivars that have been cultivated in both farming systems, organic and conventional, throughout 3 years. More than 300 GC-MS measurements were recorded and subsequently processed and analyzed in the MeltDB 2.0 metabolomics analysis platform, being briefly outlined in this paper. We further describe how unsupervised (t-SNE, PCA) and supervised (SVM) methods can be applied for sample visualization and classification. Our results clearly show that years have most and wheat cultivars have second-most influence on the metabolic composition of a sample. We can also show that for a given year and cultivar, organic and conventional cultivation can be distinguished by machine-learning algorithms.
机译:我们提出了我们的机器学习方法的结果,以解决源自不同农业系统的小麦籽粒的GC-MS数据分类问题。目的是研究学习算法对常规生长样品或有机生长样品的GC-MS数据进行分类的可能性,并考虑不同的品种。如今,我们的工作动机非常明显:后工业化社会对有机食品的需求增加,并且有必要证明有机食品的真实性。我们的数据集的背景是在长达3年的时间内,通过有机和传统两种耕作系统种植的多达11个小麦品种给出的。记录了300多次GC-MS测量,随后在MeltDB 2.0代谢组学分析平台中进行了处理和分析,本文对此进行了简要概述。我们进一步描述如何将无监督(t-SNE,PCA)和有监督(SVM)方法应用于样本可视化和分类。我们的结果清楚地表明,年份对样品的代谢成分影响最大,而小麦品种对代谢成分的影响次之。我们还可以证明,对于给定的年份和品种,可以通过机器学习算法区分有机栽培和常规栽培。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号