首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Selection of preprocessing methodology for multivariate regression of cellular FTIR and Raman spectra in radiobiological analyses
【24h】

Selection of preprocessing methodology for multivariate regression of cellular FTIR and Raman spectra in radiobiological analyses

机译:放射生物学分析中细胞FTIR和拉曼光谱多元回归的预处理方法的选择

获取原文

摘要

Vibrational spectra of biological species suffer from the influence of many extraneous interfering factors that require removal through preprocessing before analysis. The present study was conducted to optimise the preprocessing methodology and variable subset selection during regression of and confocal Raman microspectroscopy (CRM) and Fourier Transform Infrared microspectroscopy (FTIRM) spectra against ionizing radiation dose. Skin cells were γ-irradiated in-vitro and their Raman and FTIRM spectra were used to retrospectively predict the radiation dose using linear and nonlinear partial least squares (PLS) regression algorithms in addition to support vector regression (SVR). The optimal preprocessing methodology (which comprised combinations of spectral filtering, baseline subtraction, scaling and normalization options) was selected using a genetic algorithm (GA) with the root mean squared error of prediction (RMSEP) used as the fitness criterion for selection of the preprocessing chromosome (where this was calculated on an independent set of test spectra randomly selected from the dataset on each pass of the algorithm). The results indicated that GA selection of the optimal preprocessing methodology substantially improved the predictive capacity of the regression algorithms over baseline methodologies, although the optimal preprocessing chromosomes were similar for various regression algorithms, suggesting an optimal preprocessing methodology for radiobiological analyses with biospectroscopy. Feature selection of both FTIRM and CRM spectra using genetic algorithms and multivariate regression provided further decreases in RMSEP, but only with non-linear multivariate regression algorithms.
机译:生物物种的振动光谱受到许多外部干扰因素的影响,这些因素需要在分析之前通过预处理去除。进行本研究以优化预处理方法和共聚焦拉曼光谱(CRM)和傅里叶变换红外光谱(FTIRM)光谱对电离辐射剂量的回归过程中的变量子集选择。皮肤细胞进行了γ射线体外照射,除了支持向量回归(SVR)外,还使用线性和非线性偏最小二乘(PLS)回归算法对它们的拉曼光谱和FTIRM光谱进行了回顾性预测辐射剂量。使用遗传算法(GA)选择最佳的预处理方法(包括光谱滤波,基线减法,缩放和归一化选项的组合),并将预测的均方根误差(RMSEP)用作选择预处理的适用性标准染色体(在算法的每个遍次上从数据集中随机选择的一组独立测试光谱中计算得出)。结果表明,尽管最佳预处理染色体与各种回归算法相似,但最佳预处理方法的遗传算法选择大大提高了回归算法的预测能力,这表明采用生物光谱学进行放射生物学分析的最佳预处理方法。使用遗传算法和多元回归对FTIRM和CRM光谱进行特征选择,可进一步降低RMSEP,但仅限于非线性多元回归算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号