首页> 外文期刊>The Analyst: The Analytical Journal of the Royal Society of Chemistry: A Monthly International Publication Dealing with All Branches of Analytical Chemistry >The Monte Carlo validation framework for the discriminant partial least squares model extended with variable selection methods applied to authenticity studies of Viagra (R) based on chromatographic impurity profiles
【24h】

The Monte Carlo validation framework for the discriminant partial least squares model extended with variable selection methods applied to authenticity studies of Viagra (R) based on chromatographic impurity profiles

机译:判别式偏最小二乘模型的蒙特卡洛验证框架,采用可变选择方法进行了扩展,该方法应用于基于色谱杂质图谱的伟哥(R)的真实性研究

获取原文
获取原文并翻译 | 示例
       

摘要

The aim of this work was to develop a general framework for the validation of discriminant models based on the Monte Carlo approach that is used in the context of authenticity studies based on chromatographic impurity profiles. The performance of the validation approach was applied to evaluate the usefulness of the diagnostic logic rule obtained from the partial least squares discriminant model (PLS-DA) that was built to discriminate authentic ViagraO samples from counterfeits (a two-class problem). The major advantage of the proposed validation framework stems from the possibility of obtaining distributions for different figures of merit that describe the PLS-DA model such as, e.g., sensitivity, specificity, correct classification rate and area under the curve in a function of model complexity. Therefore, one can quickly evaluate their uncertainty estimates. Moreover, the Monte Carlo model validation allows balanced sets of training samples to be designed, which is required at the stage of the construction of PLS-DA and is recommended in order to obtain fair estimates that are based on an independent set of samples. In this study, as an illustrative example, 46 authentic ViagraO samples and 97 counterfeit samples were analyzed and described by their impurity profiles that were determined using high performance liquid chromatography with photodiode array detection and further discriminated using the PLS-DA approach. In addition, we demonstrated how to extend the Monte Carlo validation framework with four different variable selection schemes: the elimination of uninformative variables, the importance of a variable in projections, selectivity ratio and significance multivariate correlation. The best PLS-DA model was based on a subset of variables that were selected using the variable importance in the projection approach. For an independent test set, average estimates with the corresponding standard deviation (based on 1000 Monte Carlo runs) of the correct classification rate, sensitivity, specificity and area under the curve were equal to 96.42% +/- 2.04, 98.69% +/- 1.38, 94.16% +/- 3.52 and 0.982 +/- 0.017, respectively.
机译:这项工作的目的是开发一个基于蒙特卡洛方法的判别模型验证的通用框架,该框架用于基于色谱杂质图谱的真实性研究中。验证方法的性能用于评估从部分最小二乘判别模型(PLS-DA)获得的诊断逻辑规则的有用性,该模型用于从假冒品中区分出真正的伟哥样品(两类问题)。所提出的验证框架的主要优点来自于获得描述PLS-DA模型的不同品质因数的分布的可能性,例如,灵敏度,特异性,正确的分类率和曲线下面积随模型复杂度的变化。因此,可以快速评估其不确定性估计。此外,蒙特卡洛模型验证允许设计平衡的训练样本集,这在PLS-DA的构建阶段是必需的,建议使用此方法,以便获得基于独立样本集的公平估计。在本研究中,作为说明性示例,对46个真实的ViagraO样品和97个假冒样品进行了分析和描述,其杂质分布使用高效液相色谱和光电二极管阵列检测确定,并使用PLS-DA方法进行进一步区分。此外,我们演示了如何使用四种不同的变量选择方案扩展Monte Carlo验证框架:消除非信息变量,变量在预测中的重要性,选择性比和显着性多元相关性。最佳的PLS-DA模型基于变量的子集,这些子集是使用投影方法中的变量重要性选择的。对于独立测试集,正确估计率,敏感性,特异性和曲线下面积的相应标准偏差(基于1000蒙特卡洛试验)的平均估计值等于96.42%+/- 2.04、98.69%+/-分别为1.38、94.16%+/- 3.52和0.982 +/- 0.017。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号