...
首页> 外文期刊>BMC Genomics >Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach
【24h】

Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach

机译:数据融合的基因组预测和人衰老基因的优先级:机器学习方法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

BACKGROUND:Machine learning can effectively nominate novel genes for various research purposes in the laboratory. On a genome-wide scale, we implemented multiple databases and algorithms to predict and prioritize the human aging genes (PPHAGE).RESULTS:We fused data from 11 databases, and used Na?ve Bayes classifier and positive unlabeled learning (PUL) methods, NB, Spy, and Rocchio-SVM, to rank human genes in respect with their implication in aging. The PUL methods enabled us to identify a list of negative (non-aging) genes to use alongside the seed (known age-related) genes in the ranking process. Comparison of the PUL algorithms revealed that none of the methods for identifying a negative sample were advantageous over other methods, and their simultaneous use in a form of fusion was critical for obtaining optimal results (PPHAGE is publicly available at https://cbb.ut.ac.ir/pphage).CONCLUSION:We predict and prioritize over 3,000 candidate age-related genes in human, based on significant ranking scores. The identified candidate genes are associated with pathways, ontologies, and diseases that are linked to aging, such as cancer and diabetes. Our data offer a platform for future experimental research on the genetic and biological aspects of aging. Additionally, we demonstrate that fusion of PUL methods and data sources can be successfully used for aging and disease candidate gene prioritization.
机译:背景:机器学习可以有效地提名实验室各种研究目的的新基因。在基因组范围内,我们实施了多个数据库和算法以预测和优先考虑人衰老基因(PPhage)。结果:我们从11个数据库中融合了数据,并使用Na ve贝雷斯分类器和积极的未标记学习(普通未标记的学习(普通未标记的学习(Pul)方法, Nb,spy和rocchio-svm,在衰老方面对人类进行排名。培养方法使我们能够鉴定阳性(非老化)基因列表,以与排名过程中的种子(已知年龄相关)基因一起使用。脉动算法的比较显示,鉴定阴性样品的任何方法都没有其他方法是有利的,并且它们以融合形式的同时使用对于获得最佳结果至关重要(PPhage在HTTPS://cbb.ut上公开可用.C.IR / pPhage)。基于大量排名分数,结论,我们预测和优先于人类中有超过3,000名与年龄相关基因的探讨。所确定的候选基因与与老化有关的途径,本体和疾病相关,例如癌症和糖尿病。我们的数据为未来的衰老和生物学方面的未来实验研究提供了一个平台。另外,我们证明含铅方法和数据源的融合可以成功地用于老化和疾病候选基因优先级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号