...
首页> 外文期刊>Bioinformatics >Pooled variable scaling for cluster analysis
【24h】

Pooled variable scaling for cluster analysis

机译:集群分析的汇总变量缩放

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: Many popular clustering methods are not scale-invariant because they are based on Euclidean distances. Even methods using scale-invariant distances, such as the Mahalanobis distance, lose their scale invariance when combined with regularization and/or variable selection. Therefore, the results from these methods are very sensitive to the measurement units of the clustering variables. A simple way to achieve scale invariance is to scale the variables before clustering. However, scaling variables is a very delicate issue in cluster analysis: A bad choice of scaling can adversely affect the clustering results. On the other hand, reporting clustering results that depend on measurement units is not satisfactory. Hence, a safe and efficient scaling procedure is needed for applications in Bioinformatics and medical sciences research.
机译:动机:许多流行的聚类方法不是鳞片不变的,因为它们基于欧几里德距离。 甚至使用比例不变距离(例如Mahalanobis距离)的方法,在与正则化和/或变量选择结合时丢失了尺度不变性。 因此,来自这些方法的结果对聚类变量的测量单元非常敏感。 实现缩放不变性的简单方法是在群集之前缩放变量。 但是,缩放变量是集群分析中非常细致的问题:缩放的糟糕选择可能会对聚类结果产生不利影响。 另一方面,报告依赖于测量单元的聚类结果并不令人满意。 因此,生物信息学和医学科学研究中需要安全和有效的缩放程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号