CaDET: interpretable parametric conditional density estimation with decision trees and forests

Cousins Cyrus; Riondato Matteo

首页> 外文期刊>Machine Learning >CaDET: interpretable parametric conditional density estimation with decision trees and forests

【24h】

CaDET: interpretable parametric conditional density estimation with decision trees and forests

机译：CaDET：具有决策树和森林的可解释参数条件密度估计

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We introduce CaDET, an algorithm for parametric Conditional Density Estimation (CDE) based on decision trees and random forests. CaDET uses the empirical cross entropy impurity criterion for tree growth, which incentivizes splits that improve predictive accuracy more than the regression criteria or estimated mean-integrated-square-error used in previous works. CaDET also admits more efficient training and query procedures than existing tree-based CDE approaches, and stores only a bounded amount of information at each tree leaf, by using sufficient statistics for all computations. Previous tree-based CDE techniques produce complicated uninterpretable distribution objects, whereas CaDET may be instantiated with easily interpretable distribution families, making every part of the model easy to understand. Our experimental evaluation on real datasets shows that CaDET usually learns more accurate, smaller, and more interpretable models, and is less prone to overfitting than existing tree-based CDE approaches.

机译：我们介绍CaDET，一种基于决策树和随机森林的参数条件密度估计（CDE）算法。 CaDET使用经验性的交叉熵杂质准则进行树木生长，该准则可激励劈裂，比以前的工作中使用的回归准则或估计的均方误差，其预测准确性更高。与现有的基于树的CDE方法相比，CaDET还接受了更有效的训练和查询过程，并且通过对所有计算使用足够的统计信息，CaDET仅在每个树叶中存储有限量的信息。以前的基于树的CDE技术会生成复杂的无法解释的分发对象，而CaDET可以使用易于解释的分发族实例化，从而使模型的每个部分都易于理解。我们对真实数据集的实验评估表明，与现有的基于树的CDE方法相比，CaDET通常可以学习更准确，更小，更易解释的模型，并且不太容易过度拟合。

著录项

来源
《Machine Learning》 |2019年第9期|1613-1634|共22页
作者
Cousins Cyrus; Riondato Matteo;
展开▼
作者单位

Brown Univ, Dept Comp Sci, Providence, RI 02912 USA;

Amherst Coll, Dept Comp Sci, Amherst, MA 01002 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Parametric models; Random forests; Sufficient statistics;

机译：参数模型;随机森林;充足的统计数据;

相似文献

外文文献
中文文献
专利

1. Explainable decision forest: Transforming a decision forest into an interpretable tree [J] . Information Fusion . 2020,第期

机译：解释决策森林：将决策林转化为可解释的树
2. Non parametric estimation of the conditional density function with right-censored and dependent data [J] . Xiong Xianzhu, Ou Meijuan Communications in Statistics . 2021,第13a15期

机译：具有右禁用和依赖数据的条件密度函数的非参数估计
3. A kernel-based parametric method for conditional density estimation [J] . Fu G., Shih F.Y., Wang H. Pattern Recognition: The Journal of the Pattern Recognition Society . 2011,第2期

机译：基于核的条件密度估计参数方法
4. A comparison between decision trees and decision tree forest models for software development effort estimation [C] . Nassif, Ali Bou, Azzeh, 2013 Third International Conference on Communications and Information Technology . 2013

机译：决策树和决策树森林模型之间的比较，以进行软件开发工作量估算
5. Asymptotics and Interpretability of Decision Trees and Decision Tree Ensembles [D] . Zhou, Yichen. 2019

机译：决策树和决策树集合的渐近性和可解释性
6. QuickBird image-based estimation of tree stand density using local maxima filtering method: A case study in a Beijing forest [O] . Shuhan Wang, Xiaoli Zhang, Mohammed Abdelmanan Hassan, -1

机译：基于QuickBird图像的局部最大值滤波方法估算林分密度：以北京森林为例
7. CaDET: interpretable parametric conditional density estimation with decision trees and forests [O] . Cyrus Cousins, Matteo Riondato 2019

机译：CADET：用决策树和森林解释参数化条件密度估计
8. Retrofitting Decision Tree Classifiers Using Kernel Density Estimation [R] . Smyth, P. 1995

机译：利用核密度估计改造决策树分类器

CaDET: interpretable parametric conditional density estimation with decision trees and forests

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅