...
首页> 外文期刊>BMC Bioinformatics >K-OPLS package: Kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space
【24h】

K-OPLS package: Kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space

机译:K-OPLS包:基于内核的正交投影到特征空间中预测和解释的潜在结构

获取原文
           

摘要

Background Kernel-based classification and regression methods have been successfully applied to modelling a wide variety of biological data. The Kernel-based Orthogonal Projections to Latent Structures (K-OPLS) method offers unique properties facilitating separate modelling of predictive variation and structured noise in the feature space. While providing prediction results similar to other kernel-based methods, K-OPLS features enhanced interpretational capabilities; allowing detection of unanticipated systematic variation in the data such as instrumental drift, batch variability or unexpected biological variation. Results We demonstrate an implementation of the K-OPLS algorithm for MATLAB and R, licensed under the GNU GPL and available at http://www.sourceforge.net/projects/kopls/ . The package includes essential functionality and documentation for model evaluation (using cross-validation), training and prediction of future samples. Incorporated is also a set of diagnostic tools and plot functions to simplify the visualisation of data, e.g. for detecting trends or for identification of outlying samples. The utility of the software package is demonstrated by means of a metabolic profiling data set from a biological study of hybrid aspen. Conclusion The properties of the K-OPLS method are well suited for analysis of biological data, which in conjunction with the availability of the outlined open-source package provides a comprehensive solution for kernel-based analysis in bioinformatics applications.
机译:背景技术基于内核的分类和回归方法已成功应用于建模各种生物数据。基于内核的正交投影(K-OPLS)方法提供独特的属性,便于特征空间中的预测变化和结构噪声的单独建模。虽然提供类似于基于内核的方法的预测结果,但K-OPL具有增强的解释能力;允许检测在诸如仪器漂移,批量变异性或意外的生物变化的数据中的意外系统变化。结果我们展示了Matlab和R,在GNU GPL下获得的K-OPLS算法的实现,并在http://www.sourceforge.net/projects/kopls/下获得。该软件包包括模型评估(使用交叉验证),培训和预测未来样本的基本功能和文档。 Incorporated也是一组诊断工具和绘图函数,以简化数据的可视化,例如,用于检测趋势或识别外围样品。通过从混合Aspen的生物学研究中,通过代谢分析数据进行说明软件包的效用。结论K-OPLS方法的性质非常适合于分析生物数据,这与概述的开源包装的可用性结合提供了一种综合解决生物信息学应用中内核的基于核的分析解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号