首页> 外文期刊>Spectrochimica acta, Part A. Molecular and biomolecular spectroscopy >Discrimination of raw and processed Dipsacus asperoides by near infrared spectroscopy combined with least squares-support vector machine and random forests
【24h】

Discrimination of raw and processed Dipsacus asperoides by near infrared spectroscopy combined with least squares-support vector machine and random forests

机译:结合最小二乘支持向量机和随机森林的近红外光谱法鉴别生和加工的双歧曲霉

获取原文
获取原文并翻译 | 示例
       

摘要

Most herbal medicines could be processed to fulfill the different requirements of therapy. The purpose of this study was to discriminate between raw and processed Dipsacus asperoides, a common traditional Chinese medicine, based on their near infrared (NIR) spectra. Least squares-support vector machine (LS-SVM) and random forests (RF) were employed for full-spectrum classification. Three types of kernels, including linear kernel, polynomial kernel and radial basis function kernel (RBF), were checked for optimization of LS-SVM model. For comparison, a linear discriminant analysis (LDA) model was performed for classification, and the successive projections algorithm (SPA) was executed prior to building an LDA model to choose an appropriate subset of wavelengths. The three methods were applied to a dataset containing 40 raw herbs and 40 corresponding processed herbs. We ran 50 runs of 10-fold cross validation to evaluate the model's efficiency. The performance of the LS-SVM with RBF kernel (RBF LS-SVM) was better than the other two kernels. The RF, RBF LS-SVM and SPA-LDA successfully classified all test samples. The mean error rates for the 50 runs of 10-fold cross validation were 1.35% for RBF LS-SVM, 2.87% for RF, and 2.50% for SPA-LDA. The best classification results were obtained by using LS-SVM with RBF kernel, while RF was fast in the training and making predictions.
机译:大多数草药都可以加工以满足不同的治疗要求。这项研究的目的是根据近红外(NIR)光谱来区分生的和加工的Dipsacus asperoides(一种常见的中药)。最小二乘支持向量机(LS-SVM)和随机森林(RF)用于全光谱分类。检查了三种类型的核,包括线性核,多项式核和径向基函数核(RBF),以优化LS-SVM模型。为了进行比较,执行了线性判别分析(LDA)模型进行分类,并在建立LDA模型以选择合适的波长子集之前执行了连续投影算法(SPA)。将这三种方法应用于包含40种原始草药和40种相应加工草药的数据集。我们进行了50次10倍交叉验证,以评估模型的效率。带有RBF内核的LS-SVM(RBF LS-SVM)的性能优于其他两个内核。 RF,RBF LS-SVM和SPA-LDA成功地对所有测试样品进行了分类。 50次10倍交叉验证的平均错误率对于RBF LS-SVM为1.35%,对于RF为2.87%,对于SPA-LDA为2.50%。带有RBF核的LS-SVM可以得到最好的分类结果,而RF的训练和预测速度很快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号