首页> 外文期刊>Journal of Universal Computer Science >The Modified IPrincipal Component Analysis/I Feature Extraction Method for the Task of Diagnosing Chronic Lymphocytic Leukemia Type B-CLL
【24h】

The Modified IPrincipal Component Analysis/I Feature Extraction Method for the Task of Diagnosing Chronic Lymphocytic Leukemia Type B-CLL

机译:改进的主成分分析特征提取方法,用于诊断慢性淋巴细胞白血病B-CLL的任务

获取原文
           

摘要

The vast majority of medical problems are characterised by the relatively high spatial dimensionality of the task, which becomes problematic for many classic pattern recognition algorithms due to the well-known phenomenon of the curse of dimensionality. This creates the need to develop methods of space reduction, divided into strategies for the selection and extraction of features. The most commonly used tool of the second group is the PCA , which, unlike selection methods, does not select a subset of the original set of features and performs its mathematical transformation into a less dimensional form. However, natural downside of this algorithm is the fact that class context is not present in supervised learning tasks. This work proposes a feature extraction algorithm using the approach of the pca method, trying not only to reduce the feature space, but also trying to separate the class distributions in the available learning set. The problematic issue of the work was the creation of a method of feature extraction describing the prognosis for a chronic lymphocytic leukemia type B-CLL , which will be at least as good, or even better than when compared to other quality extractions. The purpose of the research was accomplished for binary and three-class cases in the event in which for verification of extraction quality, five algorithms of machine learning were applied. The obtained results were compared with the application of paired samples Wilcoxon test.
机译:绝大多数医学问题的特征在于任务的相对高的空间维度,这对于许多经典模式识别算法而导致由于维数的众所周知的现象而产生问题。这将创建开发空间减少方法,分为选择和提取功能的策略。第二组最常用的工具是PCA,与选择方法不同,不选择原始特征集的子集,并将其数学转换执行成较少的维度。然而,该算法的自然缺点是课堂上下文不存在于监督的学习任务中的事实。这项工作提出了一种使用PCA方法的方法的特征提取算法,不仅尝试减少特征空间,还尝试将类分布分离在可用的学习集中。作品的问题问题是创建一种特征提取方法,其描述慢性淋巴细胞白血病B-CLL的预后,这将至少与其他质量提取相比一样好,甚至更好。该研究的目的是为二元和三类案例完成的,在其中用于验证提取质量的事件中,应用了五种机器学习算法。将得到的结果与配对样品的应用进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号