首页> 外国专利> METHODS OF PREDICTING PATHOGENICITY OF GENETIC SEQUENCE VARIANTS

METHODS OF PREDICTING PATHOGENICITY OF GENETIC SEQUENCE VARIANTS

机译:预测基因序列变异的致病性的方法

摘要

Recent developments in cost-effective DNA sequencing allows for individualized genomic screening of a subject for genetic sequence variants. Training a pathogenicity prediction model using semi-supervised training methods produces a better model for predicting the pathogenicity of a test genetic sequence variant. Provided herein are methods for predicting the pathogenicity of a test genetic sequence variant by utilizing a training data set comprising labeled benign genetic sequence variants unlabeled genetic sequence variants, the unlabeled genetic sequence variants comprising a mixture of benign genetic sequence variants and pathogenic genetic sequence variants. The genetic sequences are annotated with one or more features and a machine learning model is trained in a semi-supervised process based on the training data. The test genetic sequence is then annotated using the one or more features and the probability that the test genetic sequence variant is pathogenic is predicted based on the trained machine learning model.
机译:具有成本效益的DNA测序的最新进展允许对个体进行基因序列变异的个体基因组筛选。使用半监督训练方法训练致病性预测模型会产生更好的模型,用于预测测试基因序列变异的致病性。本文提供了通过利用训练数据集来预测测试遗传序列变体的致病性的方法,该训练数据集包括标记的良性遗传序列变体,未标记的遗传序列变体,未标记的遗传序列变体,包括良性遗传序列变体和致病性遗传序列变体的混合物。遗传序列带有一个或多个特征,并基于训练数据在半监督过程中训练了机器学习模型。然后,使用一个或多个特征对测试遗传序列进行注释,并基于训练后的机器学习模型预测测试遗传序列变体具有致病性的可能性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号