Statistical Inference for Cluster Trees

机译：聚类树的统计推断

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A cluster tree provides a highly-interpretable summary of a density function by representing the hierarchy of its high-density clusters. It is estimated using the empirical tree, which is the cluster tree constructed from a density estimator. This paper addresses the basic question of quantifying our uncertainty by assessing the statistical significance of topological features of an empirical cluster tree. We first study a variety of metrics that can be used to compare different trees, analyze their properties and assess their suitability for inference. We then propose methods to construct and summarize confidence sets for the unknown true cluster tree. We introduce a partial ordering on cluster trees which we use to prune some of the statistically insignificant features of the empirical tree, yielding interpretable and parsimonious cluster trees. Finally, we illustrate the proposed methods on a variety of synthetic examples and furthermore demonstrate their utility in the analysis of a Graft-versus-Host Disease (GvHD) data set.

机译：集群树通过表示其高密度集群的层次结构，提供了密度函数的高度可解释的摘要。使用经验树进行估算，该经验树是根据密度估算器构造的聚类树。本文通过评估经验聚类树的拓扑特征的统计显着性，解决了量化不确定性的基本问题。我们首先研究各种可用于比较不同树木，分析其属性并评估其适用性的度量标准。然后，我们提出了构建和总结未知真实簇树的置信度集的方法。我们在聚类树上引入了部分排序，用于修剪经验树的一些统计上无关紧要的特征，从而产生可解释且简约的聚类树。最后，我们在各种合成示例上说明了所提出的方法，并进一步证明了它们在移植物抗宿主病（GvHD）数据集分析中的效用。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2016年|1839-1847|共9页
会议地点
作者
Jisu Kim; Yen-Chi Chen; Sivaraman Balakrishnan; Alessandro Rinaldo; Larry Wasserman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Experimental Design and Statistical Inference for Cluster Point Processes - with Applications to the Fruit Dispersion of Anemochorous Forest Trees [J] . W. Nather, K. Walder Biometrical Journal . 2003,第8期

机译：聚类点过程的实验设计和统计推论-兼谈绒毛林木果实的疏导。
2. Statistical inference is overemphasized in cluster investigations: the case of the cluster of breast cancers at the Australian Broadcasting Corporation studios in Brisbane, Australia. [J] . Coory M Internal medicine journal . 2008,第4期

机译：在聚类调查中过分强调了统计推论：澳大利亚布里斯班的澳大利亚广播公司工作室的乳腺癌聚类案例。
3. Causal inference in statistics : A primer . Judea ? Pearl , Maria ? Glymour , and Nicholas ? Jewell , John Wiley & Sons, Ltd. , Chichester, UK . Causal inference in statistics Causal inference in statistics : A primer A primer . Judea ? Pearl Judea Judea ? Pearl Pearl , Maria ? Glymour Maria Maria ? Glymour Glymour , and Nicholas ? Jewell Nicholas Nicholas ? Jewell Jewell , John Wiley & Sons, Ltd. John Wiley & Sons, Ltd. , Chichester, UK Chichester, UK . [J] . Hogan Joseph W. Biometrics: Journal of the Biometric Society : An International Society Devoted to the Mathematical and Statistical Aspects of Biology . 2019,第2期

机译：统计因果推断：一个底漆。犹太？玛丽亚珍珠？甘肃和尼古拉斯？ Jojell，John Wiley＆amp; Sons，Ltd。，奇切斯特，英国。统计统计因果推断的因果推断：底漆底漆。犹太？ Pearl Judea Judea？珍珠珍珠，玛丽亚？ Glymour Maria Maria？ glymour glymour和尼古拉斯？ Jewell Nicholas Nicholas？ John Wiley＆amp的Jewell Jewell; 儿子，有限公司John Wiley＆Sons，Ltd。，奇切斯特，英国奇切斯特，英国。
4. Statistical Inference for Cluster Trees [C] . Jisu Kim, Yen-Chi Chen, Sivaraman Balakrishnan, Annual conference on Neural Information Processing Systems . 2016

机译：集群树的统计推断
5. Ensemble trees and CLTS: Statistical inference in machine learning [D] . Mentch, Lucas Kirk 2015

机译：集成树和CLTS：机器学习中的统计推断
6. Network inference with ensembles of bi-clustering trees [O] . Konstantinos Pliakos, Celine Vens 2019

机译：具有双聚类树集成的网络推断
7. Developments in statistical inference when assessing spatiotemporal disease clustering with the tau statistic [O] . Timothy M. Pollington, Michael J. Tildesley, T. Déirdre Hollingsworth, 2021

机译：评估与TAU统计数据的时尚疾病聚类时统计推断的发展
8. Bayesian Inference for Color Image Quantization via Model-Based Clustering Trees [R] . Murtagh, F. , Raftery, A. E. , Starck, J. 2001

机译：基于模型聚类树的彩色图像量化的贝叶斯推断

Statistical Inference for Cluster Trees

摘要

著录项

相似文献

相关主题

期刊订阅