Sampling Methods in Decision Trees

机译：决策树中的抽样方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

CART is widely used by researchers in data mining applications. However, for a very large data set building CART is nearly an impossible task. We argue that a tree built entirely from a large simple random sample from the data set is reasonable. In addition, we propose two new algorithms; the parabola method that relies on the property that the Gini-index is a smooth function in any continuous attribute thus allowing accurate approximation of the minimum Gini-index and the double sampling method useful when a data set is very large. Experimental results show that these two method perform extremely well.

机译：研究人员在数据挖掘应用中广泛使用了CART。但是，对于非常大的数据集，构建CART几乎是不可能的任务。我们认为，完全根据数据集中的大量简单随机样本构建的树是合理的。此外，我们提出了两种新算法；抛物线方法依赖于以下属性：基尼系数是任何连续属性中的平滑函数，因此允许最小基尼系数的精确近似值和在数据集非常大时有用的双采样方法。实验结果表明，这两种方法的效果都非常好。

著录项

来源
《International Conference on Artificial Intelligence IC-AI'2000 Vol.2, Jun 26-29, 2000, Las Vegas, Nevada, USA》|2000年|p.1069-1075|共7页
会议地点 Las Vegas NV(US);Las Vegas NV(US)
作者
Kishan G. Mehrotra; Mohammed Jeragh;
展开▼
作者单位

Department of EECS Syracuse University Syracuse, NY, U.S.A.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
cart; gini-index; decision tree; random sampling;

机译：大车;基尼指数决策树;随机抽样;

相似文献

外文文献
中文文献
专利

1. Comparison of the decision tree, artificial neural network, and linear regression methods based on the number and types of independent variables and sample size [J] . Yong Soo Kim Expert systems with applications . 2008,第2期

机译：基于自变量数量和类型以及样本量的决策树，人工神经网络和线性回归方法的比较
2. Development and Test of Fixed Average K-means Base Decision Trees Grouping Method by Improving Decision Tree Clustering Method [J] . Jai-Houng Leu, Chih-Yao Lo, Chi-Hau Liu Journal of Applied Sciences . 2009,第3期

机译：改进的决策树聚类方法对固定平均K均值基础决策树分组方法的开发和测试
3. Student modeling method using decision tree learning based on portfolio concept - structure of learning history data-based and condensing method using decision tree learning [J] . Tatsunori Matsui, Toshio Okamoto 電子情報通信学会技術研究報告. 情報セキュリティ. Information Security . 2000,第113期

机译：基于投资组合概念的学生建模方法 - 基于学习历史数据的结构基于学习历史数据的结构，使用决策树学习
4. Sampling Methods in Decision Trees [C] . Kishan G. Mehrotra, Mohammed Jeragh International conference on artificial intelligence . 2000

机译：决策树中的抽样方法
5. A comparison of N-tree distance sampling with fixed-radius plot and variable-radius point sampling methods. [D] . Lessard, Veronica Clare. 1997

机译：N树距离采样与固定半径图和可变半径点采样方法的比较。
6. The Efficacy of Consensus Tree Methods for Summarizing Phylogenetic Relationships from a Posterior Sample of Trees Estimated from Morphological Data [O] . Joseph E O’Reilly, Philip C J Donoghue -1

机译：共识树方法从形态数据估计的后树样本中总结系统发生关系的功效
7. Effects of Sampling Methods on Prediction Quality. The Case of Classifying Land Cover Using Decision Trees [O] . Hochreiter, Ronald, Waldhauser, Christoph 2014

机译：抽样方法对预测质量的影响。的情况下用决策树划分土地覆盖
8. Industrial Hygiene Sampling, Decision-Making, Monitoring and Recordkeeping (554). Sampling Methods [R] . 1978

机译：工业卫生取样，决策，监测和记录（554）。采样方法

Sampling Methods in Decision Trees

摘要

著录项

相似文献

相关主题

期刊订阅