首页> 外文期刊>SAR and QSAR in Environmental Research >In silico toxicity prediction by support vector machine and SMILES representation-based string kernel
【24h】

In silico toxicity prediction by support vector machine and SMILES representation-based string kernel

机译:基于支持向量机和基于SMILES表示的字符串核的计算机毒性预测

获取原文
获取原文并翻译 | 示例
           

摘要

There is a great need to assess the harmful effects or toxicities of chemicals to which man is exposed. In the present paper, the simplified molecular input line entry specification (SMILES) representation-based string kernel, together with the state-of-the-art support vector machine (SVM) algorithm, were used to classify the toxicity of chemicals from the US Environmental Protection Agency Distributed Structure-Searchable Toxicity (DSSTox) database network. In this method, the molecular structure can be directly encoded by a series of SMILES substrings that represent the presence of some chemical elements and different kinds of chemical bonds (double, triple and stereochemistry) in the molecules. Thus, SMILES string kernel can accurately and directly measure the similarities of molecules by a series of local information hidden in the molecules. Two model validation approaches, five-fold cross-validation and independent validation set, were used for assessing the predictive capability of our developed models. The results obtained indicate that SVM based on the SMILES string kernel can be regarded as a very promising and alternative modelling approach for potential toxicity prediction of chemicals.View full textDownload full textKeywordstoxicity prediction, structure-toxicity relationship (STR), support vector machine (SVM), simplified molecular input line entry specification (SMILES), string kernelRelated var addthis_config = { ui_cobrand: "Taylor & Francis Online", services_compact: "citeulike,netvibes,twitter,technorati,delicious,linkedin,facebook,stumbleupon,digg,google,more", pubid: "ra-4dff56cd6bb1830b" }; Add to shortlist Link Permalink http://dx.doi.org/10.1080/1062936X.2011.645874
机译:迫切需要评估人类所接触化学物质的有害影响或毒性。在本文中,基于简化分子输入线输入规范(SMILES)表示的字符串核,以及最新的支持向量机(SVM)算法,被用于对美国化学品的毒性进行分类。环境保护局分布式结构可搜索毒性(DSSTox)数据库网络。在这种方法中,分子结构可以由一系列SMILES子串直接编码,这些子串代表分子中某些化学元素和不同种类的化学键(双键,三键和立体化学)的存在。因此,SMILES字符串内核可以通过隐藏在分子中的一系列局部信息来准确而直接地测量分子的相似性。两种模型验证方法(五重交叉验证和独立验证集)用于评估我们开发的模型的预测能力。所得结果表明,基于SMILES字符串核的SVM可以被视为对化学物潜在毒性进行预测的非常有前途的替代方法。查看全文下载全文关键词毒性预测,结构-毒性关系(STR),支持向量机(SVM) ),简化的分子输入行输入规范(SMILES),字符串kernelRelated var addthis_config = {ui_cobrand:“泰勒和弗朗西斯在线”,servicescompact:“ citeulike,netvibes,twitter,technorati,delicious,linkedin,facebook,stumbleupon,digg,google,更多”,发布号:“ ra-4dff56cd6bb1830b”};添加到候选列表链接永久链接http://dx.doi.org/10.1080/1062936X.2011.645874

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号