Context-based preprocessing of molecular docking data

Ana T Winck; Karina S Machado; Osmar Norberto de Souza; Duncan D Ruiz

首页> 外文期刊>BMC Genomics >Context-based preprocessing of molecular docking data

【24h】

Context-based preprocessing of molecular docking data

机译：基于上下文的分子对接数据预处理

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

BackgroundData preprocessing is a major step in data mining. In data preprocessing, several known techniques can be applied, or new ones developed, to improve data quality such that the mining results become more accurate and intelligible. Bioinformatics is one area with a high demand for generation of comprehensive models from large datasets. In this article, we propose a context-based data preprocessing approach to mine data from molecular docking simulation results. The test cases used a fully-flexible receptor (FFR) model of Mycobacterium tuberculosis InhA enzyme (FFR_InhA) and four different ligands.ResultsWe generated an initial set of attributes as well as their respective instances. To improve this initial set, we applied two selection strategies. The first was based on our context-based approach while the second used the CFS (Correlation-based Feature Selection) machine learning algorithm. Additionally, we produced an extra dataset containing features selected by combining our context strategy and the CFS algorithm. To demonstrate the effectiveness of the proposed method, we evaluated its performance based on various predictive (RMSE, MAE, Correlation, and Nodes) and context (Precision, Recall and FScore) measures.ConclusionsStatistical analysis of the results shows that the proposed context-based data preprocessing approach significantly improves predictive and context measures and outperforms the CFS algorithm. Context-based data preprocessing improves mining results by producing superior interpretable models, which makes it well-suited for practical applications in molecular docking simulations using FFR models.

机译：BackgroundData预处理是数据挖掘中的重要步骤。在数据预处理中，可以应用几种已知技术或开发新技术来提高数据质量，从而使挖掘结果变得更加准确和可理解。生物信息学是一个需要从大型数据集生成全面模型的领域。在本文中，我们提出了一种基于上下文的数据预处理方法来从分子对接模拟结果中挖掘数据。测试用例使用了结核分枝杆菌InhA酶（FFR_InhA）和四个不同配体的全柔性受体（FFR）模型。结果我们生成了一组初始属性以及它们各自的实例。为了改善此初始设置，我们应用了两种选择策略。第一种基于我们的基于上下文的方法，而第二种基于CFS（基于相关特征的选择）机器学习算法。此外，我们制作了一个额外的数据集，其中包含通过组合上下文策略和CFS算法选择的特征。为了证明所提出方法的有效性，我们基于各种预测（RMSE，MAE，Correlation和Nodes）和上下文（Precision，Recall和FScore）措施评估了其性能。结论对结果的统计分析表明，所提出的方法是基于上下文的数据预处理方法显着改善了预测和上下文度量，并且性能优于CFS算法。基于上下文的数据预处理通过生成出色的可解释模型来改善挖掘结果，这使其非常适合使用FFR模型进行分子对接模拟的实际应用。

著录项

来源
《BMC Genomics》 |2013年第6期|共页
作者
Ana T Winck; Karina S Machado; Osmar Norberto de Souza; Duncan D Ruiz;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医学遗传学;
关键词

相似文献

外文文献
中文文献
专利

1. Molecular docking of substituted pteridinones and pyrimidines to the ATP-binding site of the N-terminal domain of RSK2 and associated MM/GBSA and molecular field datasets [J] . Kimberly A. Casalvieri, Christopher J. Matheson, Donald S. Backos, Data in Brief . 2020,第2期

机译：取代的蕨菜和嘧啶的分子对接至RSK2和相关MM / GBSA的N-末端结构域的ATP结合位点和分子场数据集
2. System Bioinformatic Approach Through Molecular Docking, Network Pharmacology and Microarray Data Analysis to Determine the Molecular Mechanism Underlying the Effects of Rehmanniae Radix Praeparata on Cardiovascular Diseases [J] . Xiang Zhang, Dongdong Wang, Xiaodong Ren Current Protein and Peptide Science . 2019,第10期

机译：系统生物信息化方法通过分子对接，网络药理学和微阵列数据分析，确定康乃伊桡脂菌对心血管疾病影响的分子机制
3. 6Cl 2O 4) (C 10H 14N 2F) 2·2H 2O]]> [J] . Saadouni Hosna, Daron E. Janzen, Y. Sheena Mary, Spectrochimica acta, Part A. Molecular and biomolecular spectroscopy . 2018,第期

机译：<！[CDATA [CDATA [分子结构，光谱，电介质和热研究，非线性光学性质，天然键轨道，HOMO-LUMO和分子对接分析（C什么：IM =“POST”> 6 CL 2 O 4 ）（C 10 < / CE：INF> H 14 N 2 F） 2> 2 ·2H 2 O]]>
4. Study of Expanded Application of Molecular Docking on Virtual Screening - The case of molecular docking study of Keap1 and Michael Reaction Acceptor Molecules [C] . Xiuli Lu, Shuchao Chen, Yong Zhang, International Conference on Remote Sensing, Environment and Transportation Engineering . 2011

机译：分子对接对虚拟筛选的扩展应用研究 - Keap1和Michael反应受体分子的分子对接研究的情况
5. Model systems for molecular docking: Understanding molecular recognition in polar and charged binding sites. [D] . Boyce, Sarah Emily. 2009

机译：分子对接模型系统：了解极性和带电结合位点的分子识别。
6. Context-based preprocessing of molecular docking data [O] . Ana T Winck, Karina S Machado, Osmar Norberto de Souza, 2013

机译：基于上下文的分子对接数据预处理
7. Context-based preprocessing of molecular docking data [O] . Ana T Winck, Karina S Machado, Osmar de Souza, 2013

机译：基于上下文的分子对接数据预处理

Context-based preprocessing of molecular docking data

摘要

著录项

相似文献

相关主题

期刊订阅