首页> 美国卫生研究院文献>PLoS Clinical Trials >Predictive Power Estimation Algorithm (PPEA) - A New Algorithm to Reduce Overfitting for Genomic Biomarker Discovery

【2h】

Predictive Power Estimation Algorithm (PPEA) - A New Algorithm to Reduce Overfitting for Genomic Biomarker Discovery

机译：预测能力估计算法（ppEa） - 一种新的算法以减少过度拟合基因组生物标记发现

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Toxicogenomics promises to aid in predicting adverse effects, understanding the mechanisms of drug action or toxicity, and uncovering unexpected or secondary pharmacology. However, modeling adverse effects using high dimensional and high noise genomic data is prone to over-fitting. Models constructed from such data sets often consist of a large number of genes with no obvious functional relevance to the biological effect the model intends to predict that can make it challenging to interpret the modeling results. To address these issues, we developed a novel algorithm, Predictive Power Estimation Algorithm (PPEA), which estimates the predictive power of each individual transcript through an iterative two-way bootstrapping procedure. By repeatedly enforcing that the sample number is larger than the transcript number, in each iteration of modeling and testing, PPEA reduces the potential risk of overfitting. We show with three different cases studies that: (1) PPEA can quickly derive a reliable rank order of predictive power of individual transcripts in a relatively small number of iterations, (2) the top ranked transcripts tend to be functionally related to the phenotype they are intended to predict, (3) using only the most predictive top ranked transcripts greatly facilitates development of multiplex assay such as qRT-PCR as a biomarker, and (4) more importantly, we were able to demonstrate that a small number of genes identified from the top-ranked transcripts are highly predictive of phenotype as their expression changes distinguished adverse from nonadverse effects of compounds in completely independent tests. Thus, we believe that the PPEA model effectively addresses the over-fitting problem and can be used to facilitate genomic biomarker discovery for predictive toxicology and drug responses.

著录项

期刊名称 PLoS Clinical Trials
作者
Jiangang Liu; Robert A. Jolly; Aaron T. Smith; George H. Searfoss; Keith M. Goldstein; Vladimir N. Uversky; Keith Dunker; Shuyu Li; Craig E. Thomas; Tao Wei;
展开▼
作者单位

展开▼
年(卷),期 2011(6),9
年度 2011
页码 e24233
总页数 11
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-21 11:25:57

相似文献

外文文献
中文文献
专利

1. Harmless Overfitting: Using Denoising Autoencoders in Estimation of Distribution Algorithms [J] . Malte Probst, Franz Rothlauf Journal of machine learning research . 2020,第a期

机译：无害的过度限制：在分布算法估计中使用去噪自动化器
2. Prediction of Anti-inflammatory Plants and Discovery of Their Biomarkers by Machine Learning Algorithms and Metabolomic Studies [J] . Chagas-Paula Daniela Aparecida, Oliveira Tiago Branquinho, Zhang Tong, Planta medica: Natural products and medicinal plant research . 2015,第6期

机译：机器学习算法和代谢组学研究预测抗炎植物及其生物标志物
3. Standard machine learning algorithms applied to UPLC-TOF/MS metabolic fingerprinting for the discovery of wound biomarkers in Arabidopsis thaliana [J] . Julien Boccard, Alexandros Kalousis, Melanie Hilario, Chemometrics and Intelligent Laboratory Systems . 2010,第1期

机译：将标准机器学习算法应用于UPLC-TOF / MS代谢指纹识别，以发现拟南芥中的伤口生物标记
4. Method for Reducing Uncertainties of Predictive Range Estimation Algorithms in Electric Vehicles [C] . Achim Enthaler, Frank Gauterin IEEE Vehicular Technology Conference . 2015

机译：减少电动汽车预测范围估计算法不确定性的方法
5. Algorithmic Methods for Multi-Omics Biomarker Discovery [D] . Li, Yichao. 2018

机译：多组学生物标志物发现的算法方法
6. Reproducible Cancer Biomarker Discovery in SELDI-TOF MS Using Different Pre-Processing Algorithms [O] . Jinfeng Zou, Guini Hong, Xinwu Guo, 2008

机译：使用不同的预处理算法在SELDI-TOF MS中发现可重现的癌症生物标记
7. Predictive Power Estimation Algorithm (PPEA) - A New Algorithm to Reduce Overfitting for Genomic Biomarker Discovery [O] . Liu, Jiangang, Jolly, Robert A., Smith, Aaron T., 2011

机译：预测功率估计算法（PPEA）-减少基因组生物标志物发现的过度拟合的新算法

Predictive Power Estimation Algorithm (PPEA) - A New Algorithm to Reduce Overfitting for Genomic Biomarker Discovery

摘要

著录项

相似文献

相关主题

期刊订阅