首页> 外文学位 >Almost nonparametric and nonparametric estimation in mixture models.
【24h】

Almost nonparametric and nonparametric estimation in mixture models.

机译:混合模型中的几乎非参数和非参数估计。

获取原文
获取原文并翻译 | 示例

摘要

An almost nonparametric approach for the estimation of the mixing proportion in a mixture of two distributions, when we have a vector of observations on each subject, is to define a mixture of binomials. A mixture of binomials can be obtained if the vector or observations is mapped into a vector of zeroes and ones by selecting a cut point c. In this dissertation it is shown that the estimation of the cut point c, which minimizes the variance of the estimator of the mixing parameter, does not need to be very precise for some common distributions when the means of these distributions are more than two standard deviations apart. If more cut points are introduced a multinomial distribution is obtained and it is shown that the trinomial distribution is preferable to the binomial and the tetranomial is preferable to the trinomial distribution. In general, we prove that the multinomial distribution with r + 1 classes is preferable to the multinomial distribution with r classes. Nevertheless, it seems that if we introduce more than two cut points (a multinomial distribution with more than three regions) the gain in efficiency is not significant.; Nonparametric approaches are proposed for the estimation of the mixing parameter in a mixture of two continuous distributions with equal shapes and unimodal symmetric densities. In these approaches some cut points ci are introduced in order to define a multinomial distribution, three cut points for a tetranomial distribution and five cut points for a sextinomial distribution. The assumed symmetry of the component distributions is exploited in order to obtain the probabilities for each class of the multinomial approach and five methods of estimation of the parameters of the multinomial mixture are studied. These methods basically measure the concordance among the observed frequencies and the expected frequencies. We present Mathematica and S-plus program codes in order to obtain the estimates of the parameters in the multinomial mixture. A Monte Carlo study shows that for normal components, the estimators of the mixing proportion in the sextinomial approaches are comparable with the EM algorithm estimator if the means are 1.75 standard deviations apart, but the estimators of the sextinomial approaches have an efficiency of 50% with respect to the EM estimator when the distance between the means is 2.32 standard deviations. When the component distribution are not normal, the sextinomial approaches outperform the EM algorithm that assumes that the components are normal.; These tetranomial and sextinomial approaches can be easily adapted for use with training samples and three methods of sampling are considered. With training samples and normal components, the estimators from the sextinomial methods are comparable with the EM algorithm estimator. However, when component distributions are not normal, the sextinomial estimators outperform the EM algorithm estimator which assumes that the component distributions are normal.
机译:当我们对每个主题都有观测向量时,用于估计两种分布的混合比例的一种几乎非参数的方法是定义二项式的混合。如果通过选择切点 c 将一个或多个向量映射到零和一的向量,则可以获得二项式混合。本文表明,对切点 c 的估计使混合参数估计值的方差最小,对于某些常见的分布来说,不需要非常精确的估计。分布相差两个以上的标准差。如果引入更多的切点,则将获得多项式分布,并且表明三项式分布优于二项式,而四项式优于三项式。通常,我们证明 r + 1类的多项式分布比 r 类的多项式分布更可取。然而,似乎如果我们引入两个以上的切点(具有三个以上区域的多项式分布),效率的提高并不显着。提出了非参数方法来估计形状和单峰对称密度相等的两个连续分布的混合中的混合参数。在这些方法中,引入了一些切点 c i 以定义多项式分布,三个切点用于四项式分布,五个切点用于六项式分布。利用组分分布的对称性来获得每一类多项式方法的概率,并研究了五种估计多项式混合参数的方法。这些方法基本上测量了观测频率和预期频率之间的一致性。我们提供Mathematica和S-plus程序代码,以便获得多项式混合中参数的估计。蒙特卡罗研究表明,对于正态分量,如果均值相差1.75标准偏差,则六项式方法中混合比例的估计量与EM算法估计量可比,但是六项式方法中的混合量估计量的效率为50%。当均值之间的距离为2.32标准偏差时,相对于EM估计量。当成分分布不正常时,六项式方法的性能优于假定成分正常的EM算法。这些四项式和六项式方法可以轻松地用于训练样本,并考虑了三种采样方法。通过训练样本和正常分量,六项式方法的估计量可与EM算法的估计量相媲美。但是,当成分分布不正常时,六项式估计量要优于假设成分分布是正态的EM算法估计量。

著录项

  • 作者单位

    The Pennsylvania State University.;

  • 授予单位 The Pennsylvania State University.;
  • 学科 Statistics.; Mathematics.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 148 p.
  • 总页数 148
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 统计学;数学;
  • 关键词

  • 入库时间 2022-08-17 11:46:48

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号