...
首页> 外文期刊>BMC Bioinformatics >Mixture of experts models to exploit global sequence similarity on biomolecular sequence labeling
【24h】

Mixture of experts models to exploit global sequence similarity on biomolecular sequence labeling

机译:专家模型的混合,以利用生物分子序列标记上的全局序列相似性

获取原文

摘要

Background Identification of functionally important sites in biomolecular sequences has broad applications ranging from rational drug design to the analysis of metabolic and signal transduction networks. Experimental determination of such sites lags far behind the number of known biomolecular sequences. Hence, there is a need to develop reliable computational methods for identifying functionally important sites from biomolecular sequences. Results We present a mixture of experts approach to biomolecular sequence labeling that takes into account the global similarity between biomolecular sequences. Our approach combines unsupervised and supervised learning techniques. Given a set of sequences and a similarity measure defined on pairs of sequences, we learn a mixture of experts model by using spectral clustering to learn the hierarchical structure of the model and by using bayesian techniques to combine the predictions of the experts. We evaluate our approach on two biomolecular sequence labeling problems: RNA-protein and DNA-protein interface prediction problems. The results of our experiments show that global sequence similarity can be exploited to improve the performance of classifiers trained to label biomolecular sequence data. Conclusion The mixture of experts model helps improve the performance of machine learning methods for identifying functionally important sites in biomolecular sequences.
机译:背景技术鉴定生物分子序列中功能上重要的位点具有广泛的应用范围,从合理的药物设计到代谢和信号转导网络的分析。对这些位点的实验确定远远落后于已知生物分子序列的数量。因此,需要开发可靠的计算方法以从生物分子序列中鉴定功能上重要的位点。结果我们提出了一种混合了生物分子序列标签的专家方法,其中考虑了生物分子序列之间的全局相似性。我们的方法结合了无监督和有监督的学习技术。给定一组序列和在序列对上定义的相似性度量,我们通过使用光谱聚类学习模型的层次结构并使用贝叶斯技术结合专家的预测来学习专家模型的混合。我们评估我们对两个生物分子序列标记问题的方法:RNA蛋白质和DNA蛋白质界面预测问题。我们的实验结果表明,可以利用全局序列相似性来提高训练以标记生物分子序列数据的分类器的性能。结论专家模型的混合有助于提高机器学习方法在生物分子序列中识别功能重要位点的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号