首页> 外文会议> >Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations
【24h】

Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations

机译:减法聚类分析:一种寻找细胞亚群的新型数据挖掘方法

获取原文
获取原文并翻译 | 示例

摘要

A novel data mining program called "subtractive clustering" picks out the most important differences between two or more flow cytometry listmode data files. While making no assumptions about the data, the program uses a variable weight and skew metric in the determination of bin size allowing for subtractive clustering of data without the need for bit-reduction or projection. In contrast, other subtraction methods, such as channel-by-channel subtraction, are dependent upon dimensionality and resolution, which can lead to an overestimation of positive cells because they do not account for the overall distribution of the test and control data sets. By taking into account human visual inspection of the data it is possible for the experimenter to choose an optimal subtraction by choosing an appropriate weight and skew metric, but without allowing direct modification of the results. By maximizing a bin size which can still differentiate clusters, it is possible to minimize computation while still removing data. The choice of control weight allows for different levels of bin destruction during the subtraction stage, the smaller the number the more conservative the subtraction, the larger, the more liberal. Three data sets illustrate full dimensional subtraction, single step biological data and multi-stage subtraction to show definitive test results. Subtractive clustering was able to conservatively remove control information leaving populations of interest. Subtractive clustering provides a powerful comparison of clusters and is a first step for finding non-obvious (hidden) differences and minimizing human prejudice during the analysis.
机译:一种新颖的数据挖掘程序,称为“减法聚类”,可以找出两个或多个流式细胞术列表模式数据文件之间最重要的区别。在不假设数据的情况下,该程序使用可变权重和偏度度量来确定bin的大小,从而允许对数据进行减法聚类,而无需进行位缩减或投影。相反,其他减法,例如逐通道减法,则取决于维数和分辨率,这可能导致对阳性细胞的高估,因为它们没有考虑测试和控制数据集的总体分布。通过考虑数据的人眼检查,实验人员可以通过选择适当的权重和偏度度量来选择最佳减法,但不允许直接修改结果。通过最大化仍可区分群集的bin大小,可以最小化计算,同时仍删除数据。控制权重的选择允许在减法阶段对垃圾箱进行不同程度的破坏,数值越小,减法越保守,越大,越自由。三个数据集说明了全维减法,单步生物学数据和多阶段减法以显示确定的测试结果。减法聚类能够保守地删除控制信息,从而使目标群体感兴趣。减法聚类提供了强大的聚类比较,是查找非明显(隐藏)差异并在分析过程中最大程度减少人类偏见的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号