Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data.

Zintzaras E; Kowald A

首页> 外文期刊>Computers in Biology and Medicine >Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data.

【24h】

Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data.

机译：森林分类树和森林支持向量机算法：使用微阵列数据进行演示。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Classification into multiple classes when the measured variables are outnumbered is a major methodological challenge in -omics studies. Two algorithms that overcome the dimensionality problem are presented: the forest classification tree (FCT) and the forest support vector machines (FSVM). In FCT, a set of variables is randomly chosen and a classification tree (CT) is grown using a forward classification algorithm. The process is repeated and a forest of CTs is derived. Finally, the most frequent variables from the trees with the smallest apparent misclassification rate (AMR) are used to construct a productive tree. In FSVM, the CTs are replaced by SVMs. The methods are demonstrated using prostate gene expression data for classifying tissue samples into four tumor types. For threshold split value 0.001 and utilizing 100 markers the productive CT consisted of 29 terminal nodes and achieved perfect classification (AMR=0). When the threshold value was set to 0.01, a tree with 17 terminal nodes was constructed based on 15 markers (AMR=7%). In FSVM, reducing the fraction of the forest that was used to construct the best classifier from the top 80% to the top 20% reduced the misclassification to 25% (when using 200 markers). The proposed methodologies may be used for identifying important variables in high dimensional data. Furthermore, the FCT allows exploring the data structure and provides a decision rule.

机译：当组学研究中，当测量变量超过时，将其分为多个类别是一个主要的方法论挑战。提出了两种解决维数问题的算法：森林分类树（FCT）和森林支持向量机（FSVM）。在FCT中，随机选择一组变量，并使用前向分类算法来生长分类树（CT）。重复该过程，并派生出一系列CT。最后，使用具有最小表观错误分类率（AMR）的树木中最频繁出现的变量来构造生产树。在FSVM中，CT被SVM取代。使用前列腺基因表达数据证明了该方法可将组织样品分为四种肿瘤类型。对于阈值拆分值0.001，并使用100个标记，生产性CT由29个终端节点组成，并实现了完美分类（AMR = 0）。当阈值设置为0.01时，基于15个标记（AMR = 7％）构建具有17个终端节点的树。在FSVM中，将用于构建最佳分类器的森林比例从最高的80％减少到最高的20％，可以将错误分类减少到25％（使用200个标记时）。所提出的方法可以用于识别高维数据中的重要变量。此外，FCT允许探索数据结构并提供决策规则。

著录项

来源
《Computers in Biology and Medicine》 |2010年第5期|共6页
作者
Zintzaras E; Kowald A;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类医用一般科学;
关键词

相似文献

外文文献
中文文献
专利

1. Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data. [J] . Zintzaras E, Kowald A Computers in Biology and Medicine . 2010,第5期

机译：森林分类树和森林支持向量机算法：使用微阵列数据进行演示。
2. Comparison of Support Vector Machine and Random Forest Algorithms for Invasive and Expansive Species Classification Using Airborne Hyperspectral Data [J] . International journal of applied mechanics . 2020,第3期

机译：使用空气高光谱数据对侵入性和膨胀物种分类的支持向量机和随机林算法的比较
3. Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest and Neural Network: A Review [J] . Ernest Yeboah Boateng, Joseph Otoo, Daniel A. Abaye Journal of Data Analysis and Information Processing . 2020,第04期

机译：分类算法基本原则K-最近邻，支持向量机，随机森林和神经网络：综述
4. COMPARISON OF SUPPORT VECTOR MACHINES, RANDOM FOREST AND DECISION TREE METHODS FOR CLASSIFICATION OF SENTINEL - 2A IMAGE USING DIFFERENT BAND COMBINATIONS [C] . Taskin Kavzoglu, Furkan Bilucan, Alihan Teke Asian Conference on Remote Sensing . 2020

机译：使用不同频带组合对Sentinel - 2A图像分类的支持向量机，随机林和决策树方法的比较
5. Multicategory support vector machines, theory, and application to the classification of microarray data and satellite radiance data. [D] . Lee, Yoonkyung. 2002

机译：多类别支持向量机，理论及其在微阵列数据和卫星辐射度数据分类中的应用。
6. A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification [O] . Alexander Statnikov, Lily Wang, Constantin F Aliferis 2008

机译：随机森林和支持向量机在基于微阵列的癌症分类中的综合比较
7. Machine learning based classification for semantic world modeling : support vector machine based decision tree for single tree level forest species mapping [O] . Krahwinkler Petra Maria 2013

机译：基于机器学习的语义世界建模分类：基于支持向量机的决策树，用于单树级森林树种映射

Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data.

摘要

著录项

相似文献

相关主题

期刊订阅