首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Application of classification methods when group sizes are unequal by incorporation of prior probabilities to three common approaches: Application to simulations and mouse urinary chemosignals
【24h】

Application of classification methods when group sizes are unequal by incorporation of prior probabilities to three common approaches: Application to simulations and mouse urinary chemosignals

机译:通过将先验概率合并到三种常见方法中来在组大小不相等时分类方法的应用:在模拟和小鼠尿液化学信号中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

Four common classification methods are described, Euclidean Distance to Centroids (EDC), Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA) and Support Vector Machines (SVM). In many applications of chemometrics e.g. in medicine and biology it is common for there to be unequal sample sizes in different groups. When class sizes are unequal the performance of some of these methods may be biased according to class size. This paper describes approaches for incorporating prior probabilities of class membership using Bayesian approaches to three of the methods LDA, QDA and SVM, either assuming equal probability or assuming that the relative sample sizes relate to the relative probabilities. EDC is used as a benchmark to determine model stabilities. The methods are illustrated by four simulated datasets of different structures and one real dataset consisting of the gas chromatographic profile of mouse urine comparing controls to those on a diet.
机译:描述了四种常见的分类方法:欧式距质心距离(EDC),线性判别分析(LDA),二次判别分析(QDA)和支持向量机(SVM)。在化学计量学的许多应用中,例如在医学和生物学领域,不同群体的样本数量通常不相等。当类大小不相等时,其中某些方法的性能可能会根据类大小而有所偏差。本文介绍了使用贝叶斯方法将类成员资格的先验概率合并到LDA,QDA和SVM中的三种方法的方法,这些方法假定概率相等或相对样本量与相对概率有关。 EDC用作确定模型稳定性的基准。该方法由四个具有不同结构的模拟数据集和一个由小鼠尿液的气相色谱图构成的真实数据集进行了说明,这些数据将对照与饮食中的对照进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号