首页> 外文会议>2012 12th International Conference on Intelligent Systems Design and Applications. >Model evaluation of datasets using critical dimension model invariants
【24h】

Model evaluation of datasets using critical dimension model invariants

机译:使用关键维模型不变式的数据集模型评估

获取原文
获取原文并翻译 | 示例

摘要

Critical dimension is the minimum number of features that is required to ensure the performance of a learning machine to be “high”. This critical dimension is usually unique to the learning machine and the ranking algorithm combination. Medical- and bio-informatics datasets are different from most other datasets in that there is an imbalance in most of these datasets and a high prediction accuracy often depends upon not just the overall accuracy but also the true positive and the false negative rates. To find a medically and bio-informatically accurate critical dimension and for better analysis of such datasets we develop two evaluation models, one using all features and the other using critical number of features. The performance measurements such as accuracy, specificity, sensitivity, area under the curve, F-score and kappa values are compared. This paper shows that at the critical dimension the evaluation model shows good results for all performance measurements measured on most datasets studied. The difference in performance measurements obtained using only critical number and using all features is significantly less, i.e., there is not much difference in sensitivity, specificity and other measurements calculated.
机译:关键尺寸是确保学习机的性能达到“最高”所需的最少功能数。这个关键维度通常是学习机和排名算法组合所特有的。医学和生物信息学数据集与大多数其他数据集的不同之处在于,这些数据集中的大多数都不平衡,并且高预测准确性通常不仅取决于总体准确性,还取决于真实的阳性率和假阴性率。为了找到医学和生物信息学上准确的关键维度,并为了更好地分析此类数据集,我们开发了两个评估模型,一个使用所有特征,另一个使用关键特征数。比较了诸如准确性,特异性,敏感性,曲线下面积,F得分和kappa值等性能指标。本文表明,在关键维度上,评估模型显示了在大多数研究的数据集上进行的所有性能测量的良好结果。仅使用临界数和使用所有功能获得的性能测量值的差异要小得多,即灵敏度,特异性和其他计算得出的测量值差异不大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号