Pooled variable scaling for cluster analysis

首页> 外文期刊>Bioinformatics >Pooled variable scaling for cluster analysis

【24h】

Pooled variable scaling for cluster analysis

机译：集群分析的汇总变量缩放

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivation: Many popular clustering methods are not scale-invariant because they are based on Euclidean distances. Even methods using scale-invariant distances, such as the Mahalanobis distance, lose their scale invariance when combined with regularization and/or variable selection. Therefore, the results from these methods are very sensitive to the measurement units of the clustering variables. A simple way to achieve scale invariance is to scale the variables before clustering. However, scaling variables is a very delicate issue in cluster analysis: A bad choice of scaling can adversely affect the clustering results. On the other hand, reporting clustering results that depend on measurement units is not satisfactory. Hence, a safe and efficient scaling procedure is needed for applications in Bioinformatics and medical sciences research.

机译：动机：许多流行的聚类方法不是鳞片不变的，因为它们基于欧几里德距离。甚至使用比例不变距离（例如Mahalanobis距离）的方法，在与正则化和/或变量选择结合时丢失了尺度不变性。因此，来自这些方法的结果对聚类变量的测量单元非常敏感。实现缩放不变性的简单方法是在群集之前缩放变量。但是，缩放变量是集群分析中非常细致的问题：缩放的糟糕选择可能会对聚类结果产生不利影响。另一方面，报告依赖于测量单元的聚类结果并不令人满意。因此，生物信息学和医学科学研究中需要安全和有效的缩放程序。

著录项

来源
《Bioinformatics》 |2020年第12期|共7页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;
关键词

相似文献

外文文献
中文文献
专利

1. Pooled variable scaling for cluster analysis [J] . Bioinformatics . 2020,第12期

机译：集群分析的汇总变量缩放
2. Pooled analysis of trials may, in the presence of heterogeneity inadvertently lead to fragile conclusions due to the importance of clinically relevant variables being either hidden or lost when the findings are pooled. [J] . Hull RD, Liang J, Brant R Thrombosis Research: An International Journal on Vascular Obstruction, Hemorrhage and Hemostasis . 2010,第3期

机译：在存在异质性的情况下，对试验的汇总分析可能会因得出结论而将临床相关变量隐藏或丢失的重要性而无意间导致脆弱的结论。
3. Simultaneous cloning of multiple nuclear genes by pooling PCR products of variable size: a cost-effective method of improving efficiency in large-scale genetic analyses [J] . Addison JA Molecular ecology notes . 2007,第3期

机译：通过合并可变大小的PCR产物同时克隆多个核基因：提高大规模遗传分析效率的经济有效方法
4. Linear Fuzzy Clustering of Mixed Databases Based on Cluster-wise Optimal Scaling of Categorical Variables [C] . Honda Katsuhiro, Uesugi Ryo, Ichihashi Hidetomo, IEEE International Conference on Fuzzy Systems . 2007

机译：基于聚类的混合数据库的线性模糊聚类基于集群 - 明智的分类变量的最佳缩放
5. Statistical strategies for scaling and weighting variables for cluster analysis. [D] . Maloor, Srinivas P. 2007

机译：用于聚类分析的缩放和加权变量的统计策略。
6. Sparse cluster analysis of large-scale discrete variables with application to single nucleotide polymorphism data [O] . Baolin Wu -1

机译：大规模离散变量的稀疏聚类分析应用于单核苷酸多态性数据
7. Variable Stars in Clusters and the Distance Scale - Some Recent Results Concerning the LMC Cluster NGC 1866 [O] . Alistair R. Walker, Gabriella Raimondo, Elisa di Carlo 2002

机译：集群变量恒星和距离尺度 - 关于LMC群集NGC 1866的一些最近结果

Pooled variable scaling for cluster analysis

摘要

著录项

相似文献

相关主题

期刊订阅