首页> 外文会议>International Conference on Advanced Computer Science and Information Systems >Application of decision tree classifier for single nucleotide polymorphism discovery from next-generation sequencing data
【24h】

Application of decision tree classifier for single nucleotide polymorphism discovery from next-generation sequencing data

机译:决策树分类器在下一代测序数据中的单核苷酸多态性发现中的应用

获取原文

摘要

Single Nucleotide Polymorphism (SNP) is the most abundant form of genetic variation and proven to be advantageous in diverse genetic-related studies. However, accurate determination of true SNPs from next-generation sequencing (NGS) data is a challenging task due to high error rates of NGS. To overcome this problem, we applied a machine learning method using C4.5 decision tree algorithm to discover SNPs from whole-genome NGS data. In addition, we conducted random undersampling to deal with the imbalanced data. The result shows that the proposed method is able to identify most of the true SNPs with more than 90% recall, but still suffers from a high rate of false-positives.
机译:单核苷酸多态性(SNP)是最丰富的遗传变异形式,并且在不同的遗传相关研究中被证明是有利的。然而,由于NGS的高误差率,准确确定来自下一代测序(NGS)数据的真实SNP是一个具有挑战性的任务。为了克服这个问题,我们应用了一种使用C4.5决策树算法的机器学习方法来发现来自全基因组NGS数据的SNP。此外,我们进行了随机缺乏采样来处理不平衡数据。结果表明,该方法能够识别大多数真正的SNP,超过90%的召回,但仍然存在高频率的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号