首页> 美国卫生研究院文献>AAPS PharmSciTech >A Training Set Selection Strategy for a Universal Near-Infrared Quantitative Model
【2h】

A Training Set Selection Strategy for a Universal Near-Infrared Quantitative Model

机译:通用近红外定量模型的训练集选择策略

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The purpose of this article is to propose an empirical solution to the problem of how many clusters of complex samples should be selected to construct the training set for a universal near infrared quantitative model based on the Næs method. The sample spectra were hierarchically classified into clusters by Ward’s algorithm and Euclidean distance. If the sample spectra were classified into two clusters, the 1/50 of the largest Heterogeneity value in the cluster with larger variation was set as the threshold to determine the total number of clusters. One sample was then randomly selected from each cluster to construct the training set, and the number of samples in training set equaled the number of clusters. In this study, 98 batches of rifampicin capsules with API contents ranging from 50.1% to 99.4% were studied with this strategy. The root mean square errors of cross validation and prediction were 2.54% and 2.31% for the model for rifampicin capsules, respectively. Then, we evaluated this model in terms of outlier diagnostics, accuracy, precision, and robustness. We also used the strategy of training set sample selection to revalidate the models for cefradine capsules, roxithromycin tablets, and erythromycin ethylsuccinate tablets, and the results were satisfactory. In conclusion, all results showed that this training set sample selection strategy assisted in the quick and accurate construction of quantitative models using near-infrared spectroscopy.
机译:本文的目的是为应该选择多少个复杂样本簇以基于Næs方法构建通用近红外定量模型的训练集的问题提供一种经验解决方案。根据沃德算法和欧几里得距离,将样品光谱按层次划分为簇。如果将样品光谱分为两个簇,则将簇中具有较大变化的最大异质性值的1/50设置为确定簇总数的阈值。然后从每个聚类中随机选择一个样本以构建训练集,训练集中的样本数等于聚类数。在这项研究中,使用该策略研究了98批次的API含量从50.1%到99.4%的利福平胶囊。交叉验证和预测的利福平胶囊模型的均方根误差分别为2.54%和2.31%。然后,我们从异常诊断,准确性,准确性和鲁棒性方面评估了该模型。我们还使用了训练集样本选择的策略来重新验证头孢拉定胶囊,罗红霉素片剂和红霉素乙基琥珀酸酯片剂的模型,结果令人满意。总之,所有结果表明,该训练集样本选择策略有助于使用近红外光谱法快速准确地构建定量模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号