首页> 外文会议>Bioinformatics Research and Applications >Hierarchical Clustering Support Vector Machines for Classifying Type-2 Diabetes Patients
【24h】

Hierarchical Clustering Support Vector Machines for Classifying Type-2 Diabetes Patients

机译:用于分类2型糖尿病患者的分层聚类支持向量机

获取原文
获取原文并翻译 | 示例

摘要

Using a large national health database, we propose an enhanced SVM-based model called Hierarchical Clustering Support Vector Machine (HCSVM) that utilizes multiple levels of clusters to classify patients diagnosed with type-2 diabetes. Multiple HCSVMs are trained for clusters at different levels of the hierarchy. Some clusters at certain levels of the hierarchy capture more separable sample spaces than the others. As a result, HCSVMs at different levels may develop different classification capabilities. Since the locations of the superior SVMs are data dependent, the HCSVM model in this study takes advantage of an adaptive strategy to select the most suitable HCSVM for classifying the testing samples. This model solves the large data set problem inherent with the traditional single SVM model because the entire data set is partitioned into smaller and more homogenous clusters. Other approaches also use clustering and multiple SVM to solve the problem of large datasets. These approaches typical employed only one level of clusters. However, a single level of clusters may not provide an optimal partition of the sample space for SVM trainings. On the contrary, HCSVMs utilize multiple partitions available in a multilevel tree to capture a more separable sample space for SVM trainings. Compared with the traditional single SVM model and one-level multiple SVMs model, the HCSVM Model markedly improves the accuracy for classifying testing samples.
机译:使用大型国家卫生数据库,我们提出了一种增强的基于SVM的模型,称为层次聚类支持向量机(HCSVM),该模型利用多级聚类对诊断为2型糖尿病的患者进行分类。对多个HCSVM进行了针对层次结构不同级别上的群集的培训。层次结构某些级别上的某些群集比其他群集捕获更多的可分离样本空间。结果,不同级别的HCSVM可能会开发不同的分类功能。由于上级支持向量机的位置取决于数据,因此本研究中的HCSVM模型利用自适应策略来选择最合适的HCSVM来对测试样本进行分类。该模型解决了传统单一SVM模型固有的大数据集问题,因为整个数据集被划分为更小且更同质的群集。其他方法也使用聚类和多个SVM解决大型数据集的问题。这些方法通常仅使用一个级别的集群。但是,单个级别的群集可能无法为SVM训练提供最佳的样本空间分区。相反,HCSVM利用多级树中的多个分区来捕获更分离的样本空间,以进行SVM训练。与传统的单SVM模型和一级多SVM模型相比,HCSVM模型显着提高了测试样本分类的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号