首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Does encoding matter? A novel view on the quantitative genetic trait prediction problem
【24h】

Does encoding matter? A novel view on the quantitative genetic trait prediction problem

机译:编码吗?关于定量遗传性质预测问题的新视图

获取原文

摘要

Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models which require quantitative encodings for the genotypes. There are lots of work on the prediction algorithms, but none of the existing work investigated the effects of the encodings on the genetic trait prediction problem. In this work, we view the genetic trait prediction problem from a novel angle: a multiple regression on categorical data problem, which requires encoding the categorical data into numerical data. We evaluate various encoding mechanisms and investigate by theory how different encodings affect the performance of the genetic trait prediction algorithms. To our knowledge, this is the first analysis on different encoding mechanisms for genetic trait prediction problem. We further proposed two novel encoding methods and we show that they are able to generate numerical features with higher predictive power. Our experiments show that our methods are superior to the other encoding methods for both single marker model and epistasis model.
机译:给定一组双等位基因的分子标记物,如单核苷酸多态性的,对植物,动物或人类的样品集合数字编码基因型值,遗传性状预测的目的是通过同时建模所有标记效应预测数量性状值。遗传性状预测通常表示为线性回归模型,其要求用于基因型定量编码。上有预测算法大量的工作,但没有一个现有工作的研究对遗传性状预测问题的编码的效果。在这项工作中,我们认为从一个新颖的角度遗传性状预测问题:在分类数据的问题多重回归,这需要分类数据编码成数字数据。我们评估各种编码机制,通过理论探讨不同的编码是如何影响的遗传性状预测算法的性能。据我们所知,这是对遗传性状预测问题不同的编码机制,第一个分析。我们进一步提出了两种新的编码方法,我们表明,他们能够产生数值特点具有较高的预测能力。我们的实验表明,我们的方法是优于其他编码方式为单标记模式和上位性模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号