...
首页> 外文期刊>Journal of the Royal Statistical Society. Series C, Applied statistics >Using unlabelled data to update classification rules with applications in food authenticity studies
【24h】

Using unlabelled data to update classification rules with applications in food authenticity studies

机译:使用未标记的数据更新分类规则,并在食品真伪研究中应用

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

An authentic food is one that is what it purports to be. Food processors and consumers need to be assured that, when they pay for a specific product or ingredient, they are receiving exactly what they pay for. Classification methods are an important tool in food authenticity studies where they are used to assign food samples of unknown type to known types. A classification method is developed where the classification rule is estimated by using both the labelled and the unlabelled data, in contrast with many classical methods which use only the labelled data for estimation. This methodology models the data as arising from a Gaussian mixture model with parsimonious covariance structure, as is done in model-based clustering. A missing data formulation of the mixture model is used and the models are fitted by using the EM and classification EM algorithms. The methods are applied to the analysis of spectra of food-stuffs recorded over the visible and near infra-red wavelength range in food authenticity studies. A comparison of the performance of model-based discriminant analysis and the method of classification proposed is given. The classification method proposed is shown to yield very good misclassification rates. The correct classification rate was observed to be as much as 15% higher than the correct classification rate for model-based discriminant analysis.
机译:真正的食物就是它所要的。食品加工者和消费者需要确保,当他们为特定产品或成分付费时,他们所收到的正是他们所支付的。分类方法是食品真实性研究的重要工具,用于将未知类型的食物样品分配给已知类型。与许多仅使用标记数据进行估计的经典方法相比,开发了一种分类方法,其中通过使用标记数据和未标记数据来估计分类规则。这种方法对数据进行建模,该数据来自具有简约协方差结构的高斯混合模型,就像在基于模型的聚类中所做的那样。使用了缺少的混合模型数据公式,并使用EM和分类EM算法对模型进行了拟合。该方法用于食品真实性研究中可见光和近红外波长范围内记录的食品光谱分析。给出了基于模型的判别分析与分类方法的性能比较。所提出的分类方法显示出非常好的误分类率。观察到正确的分类率比基于模型的判别分析的正确分类率高15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号