首页> 外文期刊>Computational statistics & data analysis >Bayesian multiple response kernel regression model for high dimensional data and its practical applications in near infrared spectroscopy
【24h】

Bayesian multiple response kernel regression model for high dimensional data and its practical applications in near infrared spectroscopy

机译:高维数据的贝叶斯多响应核回归模型及其在近红外光谱中的实际应用

获取原文
获取原文并翻译 | 示例
           

摘要

Non-linear regression based on reproducing kernel Hilbert space (RKHS) has recently become very popular in fitting high-dimensional data. The RKHS formulation provides an automatic dimension reduction of the covariates. This is particularly helpful when the number of covariates (p) far exceed the number of data points. In this paper, we introduce a Bayesian nonlinear multivariate regression model for high-dimensional problems. Our model is suitable when we have multiple correlated observed response corresponding to same set of covariates. We introduce a robust Bayesian support vector regression model based on a multivariate version of Vapnik's -insensitive loss function. The likelihood corresponding to the multivariate Vapnik's -insensitive loss function is constructed as a scale mixture of truncated normal and gamma distribution. The regression function is constructed using the finite representation of a function in the reproducing kernel Hilbert space (RKHS). The kernel parameter is estimated adaptively by assigning a prior on it and using the Markov chain Monte Carlo (MCMC) techniques for computation. Practical applications of our model are demonstrated via applications in near-infrared (NIR) spectroscopy and simulation studies. Our Bayesian kernel models are highly accurate in predicting composition of materials based on its near infrared (NIR) spectroscopy signature. We have compared our method with popularly used methodologies in NIR spectroscopy, like partial least square (PLS), principal component regression (PCA), support vector machine (SVM), Gaussian process regression (GPR), and random forest (RF). In all the simulation and real case studies, our multivariate Bayesian RKHS regression model outperforms the standard methods by a substantially large margin. The implementation of our models based on MCMC is fairly fast and straight forward.
机译:基于复制核希尔伯特空间(RKHS)的非线性回归最近在拟合高维数据中变得非常流行。 RKHS公式可自动减少协变量。当协变量(p)的数量远远超过数据点的数量时,这特别有用。在本文中,我们介绍了针对高维问题的贝叶斯非线性多元回归模型。当我们有多个相关的观察到的响应对应于同一组协变量时,我们的模型是合适的。我们介绍了基于Vapnik不敏感损失函数的多元版本的鲁棒贝叶斯支持向量回归模型。对应于多元Vapnik不敏感损失函数的似然度被构造为正态分布和伽马分布的比例混合。使用再现内核希尔伯特空间(RKHS)中函数的有限表示来构造回归函数。通过在其上分配先验值并使用马尔可夫链蒙特卡洛(MCMC)技术进行计算,可以自适应地估计内核参数。通过在近红外(NIR)光谱学和模拟研究中的应用,证明了我们模型的实际应用。我们的贝叶斯核仁模型基于其近红外(NIR)光谱特征,在预测材料成分方面非常准确。我们将我们的方法与近红外光谱中常用的方法进行了比较,例如偏最小二乘(PLS),主成分回归(PCA),支持向量机(SVM),高斯过程回归(GPR)和随机森林(RF)。在所有模拟和实际案例研究中,我们的多元贝叶斯RKHS回归模型都大大优于标准方法。我们基于MCMC的模型的实现是非常快速和直接的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号