Selection of microbial biomarkers with genetic algorithm and principal component analysis

Ping Zhang; Nicholas P. West; Pin-Yen Chen; Mike W. C. Thang; Gareth Price; Allan W. Cripps; Amanda J. Cox

首页> 外文期刊>BMC Bioinformatics >Selection of microbial biomarkers with genetic algorithm and principal component analysis

【24h】

Selection of microbial biomarkers with genetic algorithm and principal component analysis

机译：具有遗传算法和主成分分析的微生物生物标志物

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

BACKGROUND:Principal components analysis (PCA) is often used to find characteristic patterns associated with certain diseases by reducing variable numbers before a predictive model is built, particularly when some variables are correlated. Usually, the first two or three components from PCA are used to determine whether individuals can be clustered into two classification groups based on pre-determined criteria: control and disease group. However, a combination of other components may exist which better distinguish diseased individuals from healthy controls. Genetic algorithms (GAs) can be useful and efficient for searching the best combination of variables to build a prediction model. This study aimed to develop a prediction model that combines PCA and a genetic algorithm (GA) for identifying sets of bacterial species associated with obesity and metabolic syndrome (Mets).RESULTS:The prediction models built using the combination of principal components (PCs) selected by GA were compared to the models built using the top PCs that explained the most variance in the sample and to models built with selected original variables. The advantages of combining PCA with GA were demonstrated.CONCLUSIONS:The proposed algorithm overcomes the limitation of PCA for data analysis. It offers a new way to build prediction models that may improve the prediction accuracy. The variables included in the PCs that were selected by GA can be combined with flexibility for potential clinical applications. The algorithm can be useful for many biological studies where high dimensional data are collected with highly correlated variables.

机译：背景：主成分分析（PCA）通常用于通过在构建预测模型之前通过减少可变数字来找到与某些疾病相关的特征模式，特别是当一些变量相关时。通常，来自PCA的前两个或三个组分用于确定是否可以基于预先确定的标准组聚集成两个分类基团：对照和疾病组。然而，可能存在其他组分的组合，其更好地区分患病的个体免受健康对照。遗传算法（气体）对于搜索最佳变量组合来构建预测模型是有用和有效的。该研究旨在开发一种预测模型，该预测模型结合了PCA和遗传算法（GA）来识别与肥胖症和代谢综合征（METS）相关的细菌物种组。结果：使用所选主组件（PC）的组合构建的预测模型通过GA与使用顶级PC构建的模型进行比较，该模型解释了样本中最方差以及使用所选原始变量构建的模型。对PCA与GA组合的优点进行了演示。结论：所提出的算法克服了PCA的数据分析的限制。它提供了一种构建预测模型的新方法，可以提高预测精度。由GA选择的PC中包含的变量可以与潜在的临床应用的灵活性相结合。该算法对于许多生物学研究非常有用，其中收集具有高度相关变量的高尺寸数据。

著录项

来源
《BMC Bioinformatics》 |2019年第s6期|共8页
作者
Ping Zhang; Nicholas P. West; Pin-Yen Chen; Mike W. C. Thang; Gareth Price; Allan W. Cripps; Amanda J. Cox;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
PCAGenetic algorithmObesityBiomarker;

机译：pcagenetic almorithmobesitybiomarker.;

相似文献

外文文献
中文文献
专利

1. A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm [J] . Harun Uguz Knowledge-Based Systems . 2011,第7期

机译：利用信息增益，主成分分析和遗传算法的文本分类两阶段特征选择方法
2. Development of a classification and ranking method for the identification of possible biomarkers in two-dimensional gel-electrophoresis based on principal component analysis and variable selection proceduresf J [J] . Elisa Robotti, Marco Demartini, Fabio Gosetti, Molecular BioSystems . 2011,第3期

机译：基于主成分分析和变量选择程序的二维凝胶电泳中可能的生物标记物识别分类和分级方法的开发
3. Genetic algorithms applied to the selection of factors in principal component regression [J] . Depczynski U. Analytica chimica acta . 2000,第2期

机译：遗传算法在主成分回归中选择因子
4. Combination of Principal Component Analysis and Genetic Algorithm for Microbial Biomarker Identification in Obesity [C] . Ping Zhang, Nicholas West, Pin-Yen Chen, IEEE International Conference on Bioinformatics and Biomedicine . 2018

机译：主成分分析和遗传算法相结合的肥胖微生物标志物鉴定
5. Component selection optimization using genetic algorithms. [D] . Carlson, Susan Elizabeth. 1993

机译：使用遗传算法优化组件选择。
6. Selection of microbial biomarkers with genetic algorithm and principal component analysis [O] . Ping Zhang, Nicholas P. West, Pin-Yen Chen, 2019

机译：遗传算法和主成分分析法选择微生物标志物
7. Combination of Principal Component Analysis and Genetic Algorithm for Microbial Biomarker Identification in Obesity [O] . Ping Zhang, Nicholas West, Pin-Yen Chen, 2018

机译：主成分分析和遗传算法在肥胖症中微生物生物标志物鉴定的组合

Selection of microbial biomarkers with genetic algorithm and principal component analysis

摘要

著录项

相似文献

相关主题

期刊订阅