VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS

THOMAS BRENDAN MURPHY; NEMA DEAN; ADRIAN E. RAFTERY

首页> 外文期刊>The Annals of applied statistics >VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS

【24h】

VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS

机译：基于模型的判别分析中的变量选择和更新及其在食品认证中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Food authenticity studies are concerned with determining if food samples have been correctly labeled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-supervised manner using both labeled and unlabeled data. The method is shown to give excellent classification performance on several high-dimensional multiclass food authenticity data sets with more variables than observations. The variables selected by the proposed method provide information about which variables are meaningful for classification purposes. A headlong search strategy for variable selection is shown to be efficient in terms of computation and achieves excellent classification performance. In applications to several food authenticity data sets, our proposed method outperformed default implementations of Random Forests, AdaBoost, transductive SVMs and Bayesian Multinomial Regression by substantial margins.

机译：食品真实性研究与确定食品样品是否已正确标记有关。判别分析方法是食品认证方法不可或缺的一部分。受食品真实性应用的启发，提出了一种基于模型的判别分析方法，其中包括变量选择。判别分析模型使用标签数据和未标签数据以半监督的方式拟合。结果表明，该方法在多个多维多类食品真实性数据集上具有出色的分类性能，其变量多于观察值。通过提出的方法选择的变量提供有关哪些变量对于分类目的有意义的信息。事实表明，用于变量选择的直接搜索策略在计算方面非常有效，并且可以实现出色的分类性能。在应用于多个食品真实性数据集的过程中，我们提出的方法在很大程度上优于随机森林，AdaBoost，转导支持向量机和贝叶斯多项式回归的默认实现。

著录项

来源
《The Annals of applied statistics》 |2010年第1期|共26页
作者
THOMAS BRENDAN MURPHY; NEMA DEAN; ADRIAN E. RAFTERY;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类高等数学;
关键词
Food authenticity studies; headlong search; model-based discriminant analysis; normal mixture models; semi-supervised learning; updating classification rules; variable selection;

机译：食品真伪研究;长期搜索;基于模型的判别分析;正常混合模型;半监督学习;更新分类规则;变量选择;

相似文献

外文文献
中文文献
专利

1. VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS [J] . THOMAS BRENDAN MURPHY, NEMA DEAN, ADRIAN E. RAFTERY The Annals of applied statistics . 2010,第1期

机译：基于模型的判别分析中的变量选择和更新及其在食品认证中的应用
2. Variable selection for model-based high-dimensional clustering and its application to microarray data. [J] . Wang S, Zhu J Biometrics: Journal of the Biometric Society : An International Society Devoted to the Mathematical and Statistical Aspects of Biology . 2008,第2期

机译：基于模型的高维聚类的变量选择及其在微阵列数据中的应用。
3. High-dimensional data analysis: selection of variables, data compression and graphics--application to gene expression [J] . Lauter J, Horn F, Rosolowski M, Biometrical Journal . 2009,第2期

机译：高维数据分析：变量选择，数据压缩和图形显示-在基因表达中的应用
4. HIGH DIMENSIONAL FEATURE SELECTION FOR DISCRIMINANT MICROARRAY DATA ANALYSIS [C] . JUFU FENG, JIANGXIN SHI, QINGYUN SHI Workshop on data mining and modeling . 2003

机译：判别微阵列数据分析的高维特征选择
5. Variable selection methodology for high -dimensional multivariate binary data with application to microbial community DNA fingerprint analysis. [D] . Wilbur, Jayson Dwight. 2002

机译：高维多元二进制数据的变量选择方法，应用于微生物群落DNA指纹分析。
6. Variable Selection and Updating In Model-Based Discriminant Analysis for High Dimensional Data with Food Authenticity Applications [O] . Thomas Brendan Murphy, Nema Dean, Adrian E. Raftery -1

机译：具有食品真实性应用的高维数据模型的基于模型的判别分析中的变量选择和更新
7. Variable selection and updating in model-based discriminant analysis for high dimensional data with food authenticity applications [O] . Murphy, Thomas Brendan, Dean, Nema, Raftery, Adrian E. 2010

机译：基于模型的判别分析中的变量选择和更新，用于具有食品真实性的高维数据
8. Statistical Analysis of Very High-Dimensional Data Sets of Hierarchically Structured Binary Variables with Missing Data and Application to Marine Corps Readiness Evaluations [R] . Zacks, S., Marlow, W. H., Brier, S. S. 1983

机译：具有缺失数据的分层结构二元变量的超高维数据集的统计分析及其在海军陆战队准备评估中的应用

VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS

摘要

著录项

相似文献

相关主题

期刊订阅