...
首页> 外文期刊>R News >Weighted Distance Based Discriminant Analysis: The R Package WeDiBaDis
【24h】

Weighted Distance Based Discriminant Analysis: The R Package WeDiBaDis

机译:基于加权距离的判别分析:R包WeDiBaDis

获取原文
           

摘要

The WeDiBaDis package provides a user friendly environment to perform discriminant analysis (supervised classification). WeDiBaDis is an easy to use package addressed to the biological and medical communities, and in general, to researchers interested in applied studies. It can be suitable when the user is interested in the problem of constructing a discriminant rule on the basis of distances between a relatively small number of instances or units of known unbalanced-class membership measured on many (possibly thousands) features of any type. This is a current situation when analyzing genetic biomedical data. This discriminant rule can then be used both, as a means of explaining differences among classes, but also in the important task of assigning the class membership for new unlabeled units. Our package implements two discriminant analysis procedures in an R environment: the well-known distance-based discriminant analysis (DB-discriminant) and a weighted distance-based discriminant (WDB-discriminant), a novel classifier rule that we introduce. This new procedure is based on an improvement of the DB rule taking into account the statistical depth of the units. This article presents both classifying procedures and describes the implementation of each in detail. We illustrate the use of the package using an ecological and a genetic experimental example. Finally, we illustrate the effectiveness of the new proposed procedure (WDB), as compared with DB. This comparison is carried out using thirty-eight, high-dimensional, class-unbalanced, cancer data sets, three of which include clinical features.
机译:WeDiBaDis软件包提供了一个用户友好的环境来执行判别分析(监督分​​类)。 WeDiBaDis是一个易于使用的软件包,面向生物学和医学界,一般来说,也面向对应用研究感兴趣的研究人员。当用户对基于相对较少数量的实例或已知不平衡级别成员资格的单元之间的距离(基于任何类型的许多(可能是数千个)特征进行测量)构造判别规则的问题感兴趣时,它可能是合适的。这是分析遗传生物医学数据时的当前情况。然后,该判别规则既可以用作解释类之间差异的一种手段,也可以用于为新的未标记单元分配类成员资格的重要任务。我们的程序包在R环境中实现了两个判别分析过程:众所周知的基于距离的判别分析(DB判别式)和加权基于距离的判别式(WDB判别式),这是我们引入的一种新颖的分类器规则。此新过程基于DB规则的改进,同时考虑了单位的统计深度。本文介绍了两种分类过程,并详细描述了每种分类的实现。我们通过一个生态和遗传实验示例来说明该软件包的使用。最后,我们说明了新提议的程序(WDB)与DB相比的有效性。使用38个高维,类别不平衡的癌症数据集进行了比较,其中三个数据包括临床特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号