Enriched random forests

Amaratunga Dhammika; Cabrera Javier; Lee Yung-Seop

首页> 外文期刊>Bioinformatics >Enriched random forests

【24h】

Enriched random forests

机译：丰富的随机森林

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Although the random forest classification procedure works well in datasets with many features, when the number of features is huge and the percentage of truly informative features is small, such as with DNA microarray data, its performance tends to decline significantly. In such instances, the procedure can be improved by reducing the contribution of trees whose nodes are populated by non-informative features. To some extent, this can be achieved by prefiltering, but we propose a novel, yet simple, adjustment that has demonstrably superior performance: choose the eligible subsets at each node by weighted random sampling instead of simple random sampling, with the weights tilted in favor of the informative features. This results in an enriched random forest. We illustrate the superior performance of this procedure in several actual microarray datasets.

机译：尽管随机森林分类程序在具有许多特征的数据集中效果很好，但是当特征数量巨大且真正具有信息意义的特征所占的百分比较小时（如DNA微阵列数据），其性能往往会大大下降。在这种情况下，可以通过减少其节点由非信息性特征填充的树的贡献来改进该过程。在某种程度上，这可以通过预滤波来实现，但是我们提出了一种新颖而又简单的调整，该调整具有明显的优越性能：通过加权随机抽样而不是简单随机抽样在每个节点上选择合格的子集，权重倾斜信息功能。这导致了丰富的随机森林。我们在几个实际的微阵列数据集中说明了该程序的优越性能。

著录项

来源
《Bioinformatics》 |2008年第18期|共5页
作者
Amaratunga Dhammika; Cabrera Javier; Lee Yung-Seop;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物科学;生物工程学（生物技术）;
关键词
enriched random forest;

机译：丰富的随机森林;

相似文献

外文文献
中文文献
专利

1. Application of Random Forest and data integration identifies three dysregulated genes and enrichment of Central Carbon Metabolism pathway in Oral Cancer [J] . Srija Mukhopadhyay, Sahana Ghosh, Debodipta Das, BMC Cancer . 2020,第1期

机译：随机森林和数据集成的应用鉴定了三种失调基因和口腔癌中央碳代谢途径的富集
2. Prediction of RNA-binding residues in proteins from primary sequence using an enriched random forest model with a novel hybrid feature. [J] . Ma X, Guo J, Wu J, Proteins: Structure, Function, and Genetics . 2011,第4期

机译：使用具有新型杂种特征的富集随机森林模型，可以预测一级序列中蛋白质中的RNA结合残基。
3. Enriched random forests [J] . Amaratunga Dhammika, Cabrera Javier, Lee Yung-Seop Bioinformatics . 2008,第18期

机译：丰富的随机森林
4. Towards Stock Market Data Mining Using Enriched Random Forests from Textual Resources and Technical Indicators [C] . Manolis Maragoudakis, Dimitrios Serpanos Artificial intelligence applications and innovations . 2010

机译：从文本资源和技术指标着手使用丰富的随机森林进行股票数据挖掘
5. Understanding and Enriching Randomness within Resource-Constrained Devices [D] . Wallace, Kyle M. 2018

机译：了解和丰富资源受限设备中的随机性
6. Application of Random Forest and data integration identifies three dysregulated genes and enrichment of Central Carbon Metabolism pathway in Oral Cancer [O] . Srija Mukhopadhyay, Sahana Ghosh, Debodipta Das, 2020

机译：随机森林和数据集成的应用鉴定了三种失调基因和口腔癌中央碳代谢途径的富集
7. Application of Random Forest and data integration identifies three dysregulated genes and enrichment of Central Carbon Metabolism pathway in Oral Cancer [O] . Srija Mukhopadhyay, Sahana Ghosh, Debodipta Das, 2020

机译：随机森林和数据集成的应用鉴定了三种失调基因和口腔癌中央碳代谢途径的富集

Enriched random forests

摘要

著录项

相似文献

相关主题

期刊订阅