首页> 外文期刊>Bioinformatics >Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction
【24h】

Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction

机译:多任务学习和多输出回归在多种遗传性状预测中的新应用

获取原文
获取原文并翻译 | 示例
       

摘要

Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models. In many cases, for the same set of samples and markers, multiple traits are observed. Some of these traits might be correlated with each other. Therefore, modeling all the multiple traits together may improve the prediction accuracy. In this work, we view the multitrait prediction problem from a machine learning angle: as either a multitask learning problem or a multiple output regression problem, depending on whether different traits share the same genotype matrix or not. We then adapted multitask learning algorithms and multiple output regression algorithms to solve the multitrait prediction problem. We proposed a few strategies to improve the least square error of the prediction from these algorithms. Our experiments show thatmodelingmultiple traits together could improve the prediction accuracy for correlated traits.
机译:给定一组双等位基因分子标记(例如SNP),其基因型值在植物,动物或人类样品的集合上进行数字编码,遗传特征预测的目标是通过同时建模所有标记效应来预测定量特征值。遗传特征预测通常表示为线性回归模型。在许多情况下,对于同一组样品和标记物,观察到多种性状。其中一些特征可能相互关联。因此,将所有多个特征一起建模可以提高预测准确性。在这项工作中,我们从机器学习的角度看待多特征预测问题:作为多任务学习问题还是多输出回归问题,这取决于不同的特征是否共享相同的基因型矩阵。然后,我们采用了多任务学习算法和多输出回归算法来解决多特征预测问题。我们提出了一些策略来改善这些算法的预测的最小平方误差。我们的实验表明,一起对多个性状进行建模可以提高相关性状的预测准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号