首页> 外文期刊>Journal of mathematical chemistry >Prediction of the datasets modelability for the building of QSAR classification models by means of the centroid based rivality index
【24h】

Prediction of the datasets modelability for the building of QSAR classification models by means of the centroid based rivality index

机译:Prediction of the datasets modelability for the building of QSAR classification models by means of the centroid based rivality index

获取原文
获取原文并翻译 | 示例
           

摘要

The modelability index of a dataset of molecules is a measurement of the capacity of the dataset to be modeled using a QSAR algorithm. This measure allows to predict the correct classification rate of the dataset counting the nearest neighbors to the molecules of the dataset belonging to their same class. In this paper, we propose a new measure for the prediction of the modelability of datasets based on the use of the nearest neighbors based rivality index and the centroids based rivality index. These indexes take into account the noise that the nearest neighbor belonging to a different class could generate in the results of the QSAR classification algorithm. Using thirty benchmark datasets, two types of dataset representation and six different algorithms, we show the excellent behavior of the proposed indexes, obtaining correlations with values of R-2 greater than 0.9 between the correct classification rate obtained in the classification processes using five folds cross-validation and the modelability index calculated using the centroid based rivality index.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号