首页> 中文期刊> 《理论物理通讯(英文版)》 >Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

     

摘要

It is a common practice to evaluate probability density function or matter spatial density function from statistical samples.Kernel density estimation is a frequently used method,but to select an optimal bandwidth of kernel estimation,which is completely based on data samples,is a long-term issue that has not been well settled so far.There exist analytic formulae of optimal kernel bandwidth,but they cannot be applied directly to data samples,since they depend on the unknown underlying density functions from which the samples are drawn.In this work,we devise an approach to pick out the totally data-based optimal bandwidth.First,we derive correction formulae for the analytic formulae of optimal bandwidth to compute the roughness of the sample's density function.Then substitute the correction formulae into the analytic formulae for optimal bandwidth,and through iteration we obtain the sample's optimal bandwidth.Compared with analytic formulae,our approach gives very good results,with relative differences from the analytic formulae being only 2% ~ 3% for sample size larger than 104.This approach can also be generalized easily to cases of variable kernel estimations.

著录项

  • 来源
    《理论物理通讯(英文版)》 |2018年第12期|728-734|共7页
  • 作者

    Zhen-Wei Li; Ping He;

  • 作者单位

    Center for Theoretical Physics and College of Physics, Jilin University, Changchun 130012, China;

    Changchun Observatory, National Astronomical Observatories, CAS, Changchun 130117, China;

    Center for Theoretical Physics and College of Physics, Jilin University, Changchun 130012, China;

    Center for High Energy Physics, Peking University, Beijing 100871, China;

  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号