...
首页> 外文期刊>Journal of the royal statistical society >Automated feature extraction from profiles with application to a batch fermentation process
【24h】

Automated feature extraction from profiles with application to a batch fermentation process

机译:从配置文件中自动提取特征并将其应用于批量发酵过程

获取原文
获取原文并翻译 | 示例

摘要

An automated approach to extract interpretable features of univariate or multivariate profiles (functional data) is proposed. A landmark alignment algorithm is modified and the alignment is combined with piecewise linear approximations. Least absolute shrinkage and selection operator (lasso) regression is used for selecting the most important intercepts and slopes and yields an alternative to partial least squares to model a response associated with the profiles. Latent variables can be difficult to interpret but our extracted features simply correspond to slopes and intercepts of particular parts of the profiles. Also, features that relate to the degree of warping between a given profile and a reference can be extracted as predictors. Selection criteria for the number of knots and common knot locations between profiles are developed. We apply our proposed method to batch fermentation data where the profiles consist of on-line measurements of process variables and the corresponding yield of the process. The extracted features have good interpretability (with large dimensional reduction) and in combination with the lasso have prediction accuracy which is comparable with that of partial least squares applied to the original profiles. Also our proposed feature extraction method is applied to publicly available data where near infrared spectra define the profiles and the prediction accuracy of our feature lasso method is comparable with those of more complicated alternatives.
机译:提出了一种自动方法来提取单变量或多变量配置文件(功能数据)的可解释特征。修改了界标对齐算法,并将对齐方式与分段线性逼近相结合。最小绝对收缩和选择算子(lasso)回归用于选择最重要的截距和斜率,并生成偏最小二乘的替代方法来模拟与剖面关联的响应。潜在变量可能难以解释,但我们提取的特征仅对应于剖面特定部分的斜率和截距。同样,可以提取与给定轮廓和参考之间的翘曲程度有关的特征作为预测变量。提出了轮廓之间的结数和公共结位置的选择标准。我们将我们提出的方法应用于批量发酵数据,其中配置文件包括过程变量的在线测量和过程的相应产量。提取的特征具有良好的可解释性(具有较大的尺寸缩减),并且与套索结合使用时的预测精度与应用于原始轮廓的局部最小二乘法的预测精度相当。同样,我们提出的特征提取方法适用于公开数据,其中近红外光谱定义了轮廓,并且我们的特征套索方法的预测准确性可与更复杂的替代方法相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号