首页> 外文OA文献 >Robuustheid van gecensureerde dieptekwantielen, PCA en kernel regressie,met toepassingen in model selectie
【2h】

Robuustheid van gecensureerde dieptekwantielen, PCA en kernel regressie,met toepassingen in model selectie

机译:审查深度分位数,PCA和核回归的稳健性及其在模型选择中的应用

摘要

In statistics, classical methods often heavily rely on assumptions which cannot always be met in practice. For instance, it is often assumed that the data are generated from a specific underlying distribution. And even if the model assumptions are distribution-free, most methods assume that the sample contains independent and identically distributed observations. However, when outliers are present such methods can perform very poorly. Robust statistics seeks to provide methods that are not unlimitedly affected by outliers. The goal is to learn the structure of the majority of the data, even if a minority of observations disturbs the pattern.In this work robustness is studied in two settings: regression and Principal Component Analysis (PCA). Regression analysis models the relationship between a response variable and a set of explanatory variables (also called covariates). Interest lies in the conditional distribution of the response, conditional on values of the explanatory variables. One can concentrate on estimating certain aspects of this conditional distribution, e.g. the mean, leading to least squares regression. However, in some applications a more detailed description beyond the mean might be useful. Quantile regression aims at estimating all conditional quantiles, thus fully characterizing the conditional distribution. Assuming a linear relationship between response and covariates, linear quantile regression can be performed using an
机译:在统计中,经典方法通常严重依赖在实践中无法始终满足的假设。例如,通常假定数据是从特定的基础分布生成的。即使模型假设是无分布的,大多数方法也假设样本包含独立且分布相同的观测值。但是,如果存在异常值,则此类方法的性能可能会非常差。稳健的统计旨在提供不受异常值无限影响的方法。目的是即使少数观察结果扰乱了模式也要学习大多数数据的结构。在此工作中,在两种情况下研究了鲁棒性:回归和主成分分析(PCA)。回归分析对响应变量和一组解释变量(也称为协变量)之间的关系进行建模。兴趣在于响应的条件分布,该条件分布取决于解释变量的值。人们可以集中精力估计这种条件分布的某些方面,例如均值,导致最小二乘回归。但是,在某些应用程序中,超出平均值的更详细的描述可能会有用。分位数回归旨在估计所有条件分位数,从而充分表征条件分布。假设响应和协变量之间存在线性关系,则可以使用

著录项

  • 作者

    Debruyne Michiel;

  • 作者单位
  • 年度 2007
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号