...
首页> 外文期刊>The Open Bioinformatics Journal >iMPT-FRAKEL A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In
【24h】

iMPT-FRAKEL A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In

机译:IMPT-FRAKEL一个简单的多标签网站服务器,只使用指纹来确定哪种代谢途径类型可以参与

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Background: Metabolic pathway is one of the most basic biological pathways in living organisms. It consists of a series of chemical reactions and provides the necessary molecules and energies for organisms. To date, lots of metabolic pathways have been detected. However, there still exist hidden participants (compounds and enzymes) for some metabolic pathways due to the complexity and diversity of metabolic pathways. It is necessary to develop quick, reliable, and non-animal-involved prediction model to recognize metabolic pathways for any compound. Methods: In this study, a multi-label classifier, namely iMPT-FRAKEL, was developed for identifying which metabolic pathway types that compounds can participate in. Compounds and 12 metabolic pathway types were retrieved from KEGG. Each compound was represented by its fingerprints, which was the most widely used form for representing compounds and can be extracted from its SMILES format. A popular multi-label classification scheme, Random k-Labelsets (RAKEL) algorithm, was adopted to build the classifier. Classic machine learning algorithm, Support Vector Machine (SVM) with RBF kernel, was selected as the basic classification algorithm. Ten-fold cross-validation was used to evaluate the performance of the iMPT-FRAKEL. In addition, a web-server version of such classifier was set up, which can be assessed at http://cie.shmtu.edu.cn/impt/index. Results: iMPT-FRAKEL yielded the accuracy of 0.804, exact match of 0.745 and hamming loss of 0.039. Comparison results indicated that such classifier was superior to other models, including models with Binary Relevance (BR) or other classification algorithms. Conclusion: The proposed classifier employed limited prior knowledge of compounds but gives satisfying performance for recognizing metabolic pathways of compounds.
机译:背景:代谢途径是生物体中最基本的生物途径之一。它由一系列化学反应组成,并为生物提供必要的分子和能量。迄今为止,已检测到许多代谢途径。然而,由于代谢途径的复杂性和多样性,仍有一些代谢途径存在隐藏的参与者(化合物和酶)。有必要开发快速,可靠和非动物涉及的预测模型,以识别任何化合物的代谢途径。方法:在本研究中,开发了一种多标签分类器,即IMPT-FRAKEL,用于鉴定化合物可以参与哪种代谢途径类型。从Kegg检索化合物和12种代谢途径类型。每个化合物由其指纹表示,其是用于代表化合物的最广泛使用的形式,可以从其微笑的形式中提取。采用流行的多标签分类方案,随机k-labelsets(Rakel)算法构建分类器。 Classic Machine Learning算法,支持向量机(SVM)与RBF内核,作为基本分类算法。使用十倍的交叉验证来评估IMPT-FRAKEL的性能。此外,已设置此类分类器的Web服务器版本,可以在http://cie.shmtu.edu.cn/impt/index上进行评估。结果:IMPT-FRAKEL产生0.804的精度,精确匹配0.745,汉明损失为0.039。比较结果表明,此类分类器优于其他模型,包括具有二进制相关性(BR)或其他分类算法的模型。结论:拟议的分类器采用有限的化合物认识,但提供了识别化合物代谢途径的满足性能。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号