The impact of feature selection methods on machine learning-based docking prediction of Indonesian medicinal plant compounds and HIV-1 protease

机译：特征选择方法对印度尼西亚药用植物化合物和HIV-1蛋白酶机的基于机器学习的对接预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work evaluates usage feature selection methods to reduce the number of features required to predict docking results between Indonesian medicinal plant compounds and HIV protease. Two feature selection methods, Recursive Feature Elimination (RFE) and Wrapper Method (WM), are trained with a dataset of 7,330 samples and 667 features from PubChem Bioassay and DUD-E decoys. To evaluate the selected features, a dataset of 368 Indonesian herbal chemical compounds labeled by manually docking to PDB HIV-1 protease is used to benchmark the performance of linear SVM classifier using different sets of features. Our experiments show that a set of 471 features selected by RFE and 249 by WM achieve a reduction of classification time by 4.0 and 8.2 seconds respectively. Although the accuracy and sensitivity are also increased by 8% and 16%, no meaningful improvement observed for precision and specificity.

机译：这项工作评估了使用特征选择方法，以减少预测印度尼西亚药用植物化合物和HIV蛋白酶之间的对接结果所需的特征数量。两个特征选择方法，递归特征消除（RFE）和包装方法（WM）培训，数据集具有7,330个样本的数据集和来自Pubchem Bioassay和DUD-E诱饵的667个特征。为了评估所选择的特征，通过手动对接至PDB HIV-1蛋白酶标记的368印度尼西亚草药化学化合物的数据集用于使用不同的特征集基准线性SVM分类器的性能。我们的实验表明，RFE选择的一组471个特征和249通过WM分别降低了分类时间4.0和8.2秒。虽然精度和敏感性也增加了8％和16％，但对于精度和特异性没有观察到的有意义的改进。

著录项

来源
《International Conference on Advanced Computer Science and information Systems》|2019年|1 v.|共6页
会议地点
作者
Rahman Pujianto; Yohanes Gultom; Ari Wibisono; Arry Yanuar; Heru Suhartanto;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
feature extraction; feature selection; learning (artificial intelligence); medical computing; medicine; microorganisms; pattern classification; support vector machines;

机译：特征提取;特征选择;学习（人工智能）;医学计算;医学;微生物;模式分类;支持矢量机器;

相似文献

外文文献
中文文献
专利

1. A novel feature selection method based on global sensitivity analysis with application in machine learning-based prediction model [J] . Zhang Pin Applied Soft Computing . 2019,第期

机译：基于全局敏感性分析的新颖特征选择方法，其应用于基于机器学习的预测模型
2. HIV-1 protease cleavage site prediction based on two-stage feature selection method [J] . Niu B., Yuan X.-C., Roeper P., Protein and peptide letters . 2013,第3期

机译：基于两阶段特征选择方法的HIV-1蛋白酶切割位点预测
3. Prediction of binding for a kind of non-peptic HCV NS3 serine protease inhibitors from plants by molecular docking and MM-PBSA method [J] . Xudong Li, Wei Zhang, Xuebin Qiao, Bioorganic and medicinal chemistry . 2007,第1期

机译：通过分子对接和MM-PBSA方法预测植物中非消化性HCV NS3丝氨酸蛋白酶抑制剂的结合
4. The impact of feature selection methods on machine learning-based docking prediction of Indonesian medicinal plant compounds and HIV-1 protease [C] . Rahman Pujianto, Yohanes Gultom, Ari Wibisono, International Conference on Advanced Computer Science and information Systems . 2019

机译：特征选择方法对基于机器学习的印尼药用植物化合物和HIV-1蛋白酶对接预测的影响
5. The taste and smell of Taban Kenyah (Kenyah medicine): An exploration of chemosensory selection criteria for medicinal plants among the Kenyah Leppo` Ke of East Kalimantan, Borneo, Indonesia. [D] . Gollin, Lisa. 2001

机译：Taban Kenyah（Kenyah药）的味道和气味：印度尼西亚婆罗洲东加里曼丹的Kenyah Leppo` Ke中药用植物化学感应选择标准的探索。
6. A Consistency-Based Feature Selection Method Allied with Linear SVMs for HIV-1 Protease Cleavage Site Prediction [O] . Orkun Öztürk, Alper Aksaç, Abdallah Elsheikh, -1

机译：基于一致性的线性支持向量机特征选择方法用于HIV-1蛋白酶切割位点预测
7. Prediction of SARS-CoV-2 Main Protease Inhibitors from Several Medicinal Plant Compounds by Drug Repurposing and Molecular Docking Approach. [O] . Sayma Farabi, Nihar Ranjan Saha, Noushin Anika Khan, 2020

机译：用药物重新施肥和分子对接方法预测来自多种药用植物化合物的SARS-COV-2主要蛋白酶抑制剂。

The impact of feature selection methods on machine learning-based docking prediction of Indonesian medicinal plant compounds and HIV-1 protease

摘要

著录项

相似文献

相关主题

期刊订阅