首页> 外文期刊>The Annals of applied statistics >VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS
【24h】

VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS

机译:基于模型的判别分析中的变量选择和更新及其在食品认证中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Food authenticity studies are concerned with determining if food samples have been correctly labeled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-supervised manner using both labeled and unlabeled data. The method is shown to give excellent classification performance on several high-dimensional multiclass food authenticity data sets with more variables than observations. The variables selected by the proposed method provide information about which variables are meaningful for classification purposes. A headlong search strategy for variable selection is shown to be efficient in terms of computation and achieves excellent classification performance. In applications to several food authenticity data sets, our proposed method outperformed default implementations of Random Forests, AdaBoost, transductive SVMs and Bayesian Multinomial Regression by substantial margins.
机译:食品真实性研究与确定食品样品是否已正确标记有关。判别分析方法是食品认证方法不可或缺的一部分。受食品真实性应用的启发,提出了一种基于模型的判别分析方法,其中包括变量选择。判别分析模型使用标签数据和未标签数据以半监督的方式拟合。结果表明,该方法在多个多维多类食品真实性数据集上具有出色的分类性能,其变量多于观察值。通过提出的方法选择的变量提供有关哪些变量对于分类目的有意义的信息。事实表明,用于变量选择的直接搜索策略在计算方面非常有效,并且可以实现出色的分类性能。在应用于多个食品真实性数据集的过程中,我们提出的方法在很大程度上优于随机森林,AdaBoost,转导支持向量机和贝叶斯多项式回归的默认实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号