首页> 外文学位 >Maximum entropy density estimation and modeling geographic distributions of species.

【24h】

Maximum entropy density estimation and modeling geographic distributions of species.

机译：最大熵密度估计和物种物种地理分布建模。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Maximum entropy (maxent) approach, formally equivalent to maximum likelihood, is a widely used density-estimation method. When input datasets are small, maxent is likely to overfit. Overfitting can be eliminated by various smoothing techniques, such as regularization and constraint relaxation, but theory explaining their properties is often missing or needs to be derived for each case separately. In this dissertation, we propose a unified treatment for a large and general class of smoothing techniques. We provide fully general guarantees on their statistical performance and propose optimization algorithms with complete convergence proofs. As special cases, we can easily derive performance guarantees for many known regularization types including L1 and L2-squared regularization. Furthermore, our general approach enables us to derive entirely new regularization functions with superior statistical guarantees. The new regularization functions use information about the structure of the feature space, incorporate information about sample selection bias, and combine information across several related density-estimation tasks. We propose algorithms solving a large and general subclass of generalized maxent problems, including all discussed in the dissertation, and prove their convergence. Our convergence proofs generalize techniques based on information geometry and Bregman divergences as well as those based more directly on compactness.; As an application of maxent, we discuss an important problem in ecology and conservation: the problem of modeling geographic distributions of species. Here, small sample sizes hinder accurate modeling of rare and endangered species. Generalized maxent offers several advantages over previous techniques. In particular, generalized maxent addresses the problem in a statistically sound manner and allows principled extensions to situations when data collection is biased or when we have access to data on many related species. The utility of our unified approach is demonstrated in comprehensive experiments on large real-world datasets. We find that generalized maxent is among the best-performing species-distribution modeling techniques. Our experiments also show that the contributions of this dissertation, i.e., regularization strategies, bias-removal approaches, and multiple-estimation techniques, all significantly improve the predictive performance of maxent.

机译：形式上等效于最大似然的最大熵（maxent）方法是一种广泛使用的密度估计方法。当输入数据集较小时，maxent可能会过拟合。过度拟合可以通过各种平滑技术来消除，例如正则化和约束松弛，但是解释其属性的理论常常缺失，或者需要针对每种情况分别导出。在本文中，我们提出了对大型和通用类平滑技术的统一处理。我们对其统计性能提供全面的保证，并提出具有完整收敛性证明的优化算法。作为特殊情况，我们可以轻松得出许多已知正则化类型（包括L1和L2平方正则化）的性能保证。此外，我们的通用方法使我们能够导出具有卓越统计保证的全新正则函数。新的正则化函数使用有关特征空间结构的信息，合并有关样本选择偏差的信息，并跨多个相关的密度估计任务组合信息。我们提出了解决广义广义问题的一个大而广义子类的算法，包括本文讨论的所有子类，并证明了它们的收敛性。我们的收敛证明概括了基于信息几何和Bregman散度的技术，以及更直接基于紧凑性的技术。作为maxent的应用，我们讨论了生态和保护方面的一个重要问题：对物种地理分布建模的问题。在这里，小样本量妨碍了对稀有和濒危物种的准确建模。与以前的技术相比，广义maxent具有许多优点。特别是，广义的maxent以统计上合理的方式解决了这个问题，并允许对数据收集有偏见或我们可以访问许多相关物种的数据的情况进行原则性扩展。在大型的真实数据集上进行的综合实验中证明了我们统一方法的实用性。我们发现广义maxent是表现最佳的物种分布建模技术之一。我们的实验还表明，本论文的贡献，即正则化策略，消除偏倚的方法和多重估计技术，均显着提高了maxent的预测性能。

著录项

作者
Dudik, Miroslav.;
展开▼
作者单位

Princeton University.;

展开▼
授予单位 Princeton University.;
学科 Statistics.; Artificial Intelligence.; Computer Science.
学位 Ph.D.
年度 2007
页码 245 p.
总页数 245
原文格式 PDF
正文语种 eng
中图分类统计学;人工智能理论;自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Maximum Entropy Density Estimation with Generalized Regularization and an Application to Species Distribution Modeling [J] . Dud?-k Miroslav, Phillips Steven J., Schapire Robert E. Journal of machine learning research . 2007,第Jun期

机译：广义正则化的最大熵密度估计及其在物种分布建模中的应用
2. Predicting geographic distributions of fishes in remote stream networks using maximum entropy modeling and landscape characterizations [J] . Holder Anna M., Markarian Arev, Doyle Jessie M., Ecological Modelling . 2020,第1期

机译：使用最大熵建模和景观特征预测远程流网络中鱼类的地理分布
3. Maximum entropy modeling of geographic distributions of the flea beetle species endemic in Italy (Coleoptera: Chrysomelidae: Galerucinae: Alticini) [J] . Urbani Fabrizia, DAlessandro Paola, Frasca Roberta, Zoologischer Anzeiger . 2015,第Null期

机译：意大利特有的跳蚤甲虫物种的地理分布的最大熵模型（鞘翅目：金眼科：Galerucinae：Alticini）
4. Maximum Entropy Niche-Based Modeling (Maxent) of Potential Geographical Distributions of Lobesia Botrana (Lepidoptera: Tortricidae) in China [C] . Wencheng Lv, Zhihong Li, Xingxia Wu, IFIP TC 5/SIG 5.1 conference on computer and computing technologies in agriculture;CCTA 2011 . 2011

机译：基于最大熵基于生态位的中国山形莲（鳞翅目：Tor科）的潜在地理分布建模（Maxent）
5. The application of maximum entropy density estimation to the classification of short vegetation using multifrequency, polarimetric SAR. [D] . Kouskoulas, Yanni A. 2001

机译：最大熵密度估计在短频植被分类中的应用-多频极化SAR。
6. Pharmaco-Informatics: Accelerated NPEM Population Pharmacokinetic Modeling Maximum Entropy (ME) Parameter Distributions and New Multiple Model (MM) Stochastic Dosage Regimens now by Oral IM and both Intermittent and Continous IV [O] . Roger W. Jelliffe, David Bayard, Alan Schumitzky, 1995

机译：药物信息学：现在通过口服IM以及间歇性和连续性IV加速NPEM群体药代动力学建模最大熵（ME）参数分布和新的多重模型（MM）随机剂量方案
7. Maximum entropy density estimation with generalized regularization and an application to species distribution modeling [O] . Dudík Miroslav, Phillips Steven J, Schapire Robert E 2007

机译：广义正则化的最大熵密度估计及其在物种分布模型中的应用

Maximum entropy density estimation and modeling geographic distributions of species.

摘要

著录项

相似文献

相关主题

期刊订阅