Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis

A Smolinska; A-Ch Hauschild; R R R Fijten; J W Dallinga; J Baumbach; F J van Schooten

首页> 外文期刊>Journal of breath research >Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis

【24h】

Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis

机译：当前的呼吸组学-代谢组学呼吸分析中的数据预处理技术和机器学习综述

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We define breathomics as the metabolomics study of exhaled air. It is a strongly emerging metabolomics research field that mainly focuses on health-related volatile organic compounds (VOCs). Since the amount of these compounds varies with health status, breathomics holds great promise to deliver non-invasive diagnostic tools. Thus, the main aim of breathomics is to find patterns of VOCs related to abnormal (for instance inflammatory) metabolic processes occurring in the human body. Recently, analytical methods for measuring VOCs in exhaled air with high resolution and high throughput have been extensively developed. Yet, the application of machine learning methods for fingerprinting VOC profiles in the breathomics is still in its infancy. Therefore, in this paper, we describe the current state of the art in data pre-processing and multivariate analysis of breathomics data. We start with the detailed pre-processing pipelines for breathomics data obtained from gas-chromatography mass spectrometry and an ion-mobility spectrometer coupled to multi-capillary columns. The outcome of data pre-processing is a matrix containing the relative abundances of a set of VOCs for a group of patients under different conditions (e.g. disease stage, treatment). Independently of the utilized analytical method, the most important question, 'which VOCs are discriminatory?', remains the same. Answers can be given by several modern machine learning techniques (multivariate statistics) and, therefore, are the focus of this paper. We demonstrate the advantages as well the drawbacks of such techniques. We aim to help the community to understand how to profit from a particular method. In parallel, we hope to make the community aware of the existing data fusion methods, as yet unresearched in breathomics.

机译：我们将呼吸组学定义为呼出气的代谢组学研究。这是一个新兴的代谢组学研究领域，主要致力于健康相关的挥发性有机化合物（VOC）。由于这些化合物的量随健康状况而变化，呼吸组学有望提供无创诊断工具。因此，呼吸组学的主要目的是寻找与人体内发生的异常（例如炎症性）代谢过程有关的VOCs模式。最近，以高分辨率和高通量测量呼出空气中VOC的分析方法得到了广泛的发展。然而，机器学习方法在呼吸组学中对VOC轮廓进行指纹识别的应用仍处于起步阶段。因此，在本文中，我们描述了呼吸毒理学数据的数据预处理和多元分析的最新技术。我们从详细的预处理管线开始，以获取从气相色谱质谱法和耦合到多毛细管色谱柱的离子迁移谱仪获得的呼吸动力学数据。数据预处理的结果是一个矩阵，其中包含一组在不同条件下（例如疾病阶段，治疗）的患者的一组VOC的相对丰度。与所采用的分析方法无关，最重要的问题“哪些挥发性有机化合物具有歧视性？”保持不变。可以通过几种现代机器学习技术（多元统计）给出答案，因此，这是本文的重点。我们展示了这种技术的优点以及缺点。我们旨在帮助社区了解如何从特定方法中获利。同时，我们希望使社区了解呼吸呼吸组学尚未研究的现有数据融合方法。

著录项

来源
《Journal of breath research》 |2014年第2期|共20页
作者
A Smolinska; A-Ch Hauschild; R R R Fijten; J W Dallinga; J Baumbach; F J van Schooten;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类内科学;
关键词
GC-MS; MCC-IMS; exhaled air; multivariate analysis; volatile organic compounds (VOCs);

机译：GC-MS;MCC-IMS;呼出空气;多变量分析;挥发性有机化合物（VOC）;
入库时间 2022-08-18 23:54:10

相似文献

外文文献
中文文献
专利

1. Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis [J] . A Smolinska, A-Ch Hauschild, R R R Fijten, Journal of breath research . 2014,第2期

机译：当前的呼吸组学-代谢组学呼吸分析中的数据预处理技术和机器学习综述
2. Data-based quality analysis in machining production: Influence of data pre-processing on the results of machine learning models [J] . Amina Ziegenbein, Joachim Metternich Procedia CIRP . 2021,第a期

机译：加工生产中基于数据的质量分析：数据预处理对机器学习模型结果的影响
3. Predicting the botanical and geographical origin of honey with multivariate data analysis and machine learning techniques: A review [J] . Maione Camila, Barbosa Fernando Jr., Barbosa Rommel Melgaco Computers and Electronics in Agriculture . 2019,第期

机译：预测多元数据分析和机器学习技术的植物和地理来源：综述
4. Classification of Astronomical Objects in the Galaxy M81 using Machine Learning Techniques II. An Application of Clustering in Data Pre-processing [C] . Tapanapong Chuntama, Chutipong Suwannajak, Prapaporn Techa-Angkoon, International Joint Conference on Computer Science and Software Engineering . 2021

机译：使用机器学习技术II的Galaxy M81中天文对象的分类。在数据预处理中群集的应用
5. Investigation of Mid-Latitude Mesoscale Convective Systems Cloud and Precipitation Properties and Their Associated Environments through an Integrative Analysis of Radar, Satellite, Reanalysis Data and Machine Learning Techniques [D] . Cui, Wenjun. 2021

机译：通过雷达，卫星，重新分析数据和机器学习技术的一体化分析研究中纬度Mescle对流系统云和沉淀特性及其相关环境
6. Current Developments in Machine Learning Techniques in Biological Data Mining [O] . Gerard G Dumancas, Indra Adrianto, Ghalib Bello, 2017

机译：生物数据挖掘中机器学习技术的最新发展
7. A Review of Current Machine Learning Techniques Used in Manufacturing Diagnosis [O] . Ademujimi, Toyosi,, Brundage, Michael,, Prabhu, Vittaldas, 2017

机译：当前用于制造诊断的机器学习技术综述

Current breathomics-a review on data pre-processing techniques and machine learning in metabolomics breath analysis

摘要

著录项

相似文献

相关主题

期刊订阅