...
首页> 外文期刊>Bioinformatics >A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function.
【24h】

A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function.

机译:机器学习方法的比较研究,以预测单核苷酸多态性对蛋白质功能的影响。

获取原文
获取原文并翻译 | 示例
           

摘要

MOTIVATION: The large volume of single nucleotide polymorphism data now available motivates the development of methods for distinguishing neutral changes from those which have real biological effects. Here, two different machine-learning methods, decision trees and support vector machines (SVMs), are applied for the first time to this problem. In common with most other methods, only non-synonymous changes in protein coding regions of the genome are considered. RESULTS: In detailed cross-validation analysis, both learning methods are shown to compete well with existing methods, and to out-perform them in some key tests. SVMs show better generalization performance, but decision trees have the advantage of generating interpretable rules with robust estimates of prediction confidence. It is shown that the inclusion of protein structure information produces more accurate methods, in agreement with other recent studies, and the effect of using predicted rather than actual structure is evaluated. AVAILABILITY: Software is available on request from the authors.
机译:动机:现在可获得的大量单核苷酸多态性数据促使人们开发出将中性变化与具有实际生物学效应的方法区分开的方法。在这里,两种不同的机器学习方法,决策树和支持向量机(SVM),第一次被应用到这个问题上。与大多数其他方法一样,仅考虑基因组蛋白质编码区的非同义变化。结果:在详细的交叉验证分析中,两种学习方法均显示出与现有方法的良好竞争,并且在某些关键测试中均胜过它们。 SVM具有更好的泛化性能,但是决策树的优势在于可以生成具有可预测性的规则,并具有可靠的预测置信度估计。结果表明,与其他最近的研究一致,包含蛋白质结构信息可产生更准确的方法,并且可以评估使用预测结构而非实际结构的效果。可用性:可应作者要求提供软件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号