...
首页> 外文期刊>Science Advances >A trainable clustering algorithm based on shortest paths from density peaks
【24h】

A trainable clustering algorithm based on shortest paths from density peaks

机译:一种基于密度峰值最短路径的可训练聚类算法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Clustering is a technique to analyze empirical data, with a major application for biomedical research. Essentially, clustering finds groups of related points in a dataset. However, results depend on both metrics for point-to-point similarity and rules for point-to-group association. Non-appropriate metrics and rules can lead to artifacts, especially in case of multiple groups with heterogeneous structure. In this work, we propose a clustering algorithm that evaluates the properties of paths between points (rather than point-to-point similarity) and solves a global optimization problem, finding solutions not obtainable by methods relying on local choices. Moreover, our algorithm is trainable. Hence, it can be adapted and adopted for specific datasets and applications by providing examples of valid and invalid paths to train a path classifier. We demonstrate its applicability to identify heterogeneous groups in challenging synthetic datasets, segment highly nonconvex immune cells in confocal microscopy images, and classify arrhythmic heartbeats in electrocardiographic signals.
机译:聚类是一种分析经验数据的技术,具有生物医学研究的主要应用。基本上,群集在数据集中查找相关点组。但是,结果取决于点对点相似性和指向组关联规则的指标。非合适的指标和规则可以导致工件,特别是在具有异质结构的多个组的情况下。在这项工作中,我们提出了一种聚类算法,该算法评估点(而不是点对点相似度)之间的路径属性,并解决全局优化问题,找到未通过依赖于本地选择的方法获得的解决方案。此外,我们的算法是可训练的。因此,通过提供有效和无效路径的示例,可以对特定数据集和应用程序进行调整和采用,以培训路径分类器。我们展示其适用性识别挑战合成数据集中的异质组,在共聚焦显微镜图像中分段高度非凸起免疫细胞,并在心电图信号中分类心律失常心跳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号