首页> 外文会议>IEEE International Conference on Granular Computing >Comparison of Machine Learning and Pattern Discovery Algorithms for the Prediction of Human Single Nucleotide Polymorphisms
【24h】

Comparison of Machine Learning and Pattern Discovery Algorithms for the Prediction of Human Single Nucleotide Polymorphisms

机译:机器学习与模式发现算法对人单核苷酸多态性预测的比较

获取原文

摘要

This paper compares machine learning techniques and pattern discovery algorithms for the prediction of human single nucleotide polymorphisms (SNPs). We selected six pattern discovery algorithms (YMF, Projection, Weeder, MotifSampler, AlignACE and ANN-Spec) and two machine learning techniques (Random Forests and K-Nearest Neighbours) and applied them to the DNA sequences flanking non-coding SNPs on human chromosome 21. We compared the pattern similarity amongst the methods and validated the predictions using known SNPs on chromosome 22. Parameterization of both machine learning and pattern discovery algorithms was critical to their performance. Memory usage was broadly constant amongst the pattern discovery algorithms, but the CPU running time varied significantly between deterministic and probabilistic pattern discovery methods, i.e., on average, probabilistic methods run19 times slower than deterministic methods. This is the first demonstration of SNP prediction, as well as the first comparison of machine learning and pattern discovery algorithms in SNP prediction studies.
机译:本文比较了用于预测人单核苷酸多态性(SNP)的机器学习技术和模式发现算法。我们选择了六种模式发现算法(YMF,投影,除草剂,MotifSampler,对齐和Ann-SPEC)和两种机器学习技术(随机林和K-CORMALBORS),并将其施加到人类染色体上的非编码SNP中的DNA序列21.我们比较了方法之间的模式相似性,并在染色体上验证了使用已知SNP的预测22.机器学习和模式发现算法的参数化对其性能至关重要。模式发现算法中的内存使用量广泛恒定,但CPU运行时间在确定性和概率模式发现方法之间显着变化,即平均,概率方法比确定方法慢一倍。这是SNP预测的第一次演示,以及SNP预测研究中的机器学习和模式发现算法的第一次比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号