首页> 外文会议> >Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations

【24h】

Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations

机译：减法聚类分析：一种寻找细胞亚群的新型数据挖掘方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel data mining program called "subtractive clustering" picks out the most important differences between two or more flow cytometry listmode data files. While making no assumptions about the data, the program uses a variable weight and skew metric in the determination of bin size allowing for subtractive clustering of data without the need for bit-reduction or projection. In contrast, other subtraction methods, such as channel-by-channel subtraction, are dependent upon dimensionality and resolution, which can lead to an overestimation of positive cells because they do not account for the overall distribution of the test and control data sets. By taking into account human visual inspection of the data it is possible for the experimenter to choose an optimal subtraction by choosing an appropriate weight and skew metric, but without allowing direct modification of the results. By maximizing a bin size which can still differentiate clusters, it is possible to minimize computation while still removing data. The choice of control weight allows for different levels of bin destruction during the subtraction stage, the smaller the number the more conservative the subtraction, the larger, the more liberal. Three data sets illustrate full dimensional subtraction, single step biological data and multi-stage subtraction to show definitive test results. Subtractive clustering was able to conservatively remove control information leaving populations of interest. Subtractive clustering provides a powerful comparison of clusters and is a first step for finding non-obvious (hidden) differences and minimizing human prejudice during the analysis.

机译：一种新颖的数据挖掘程序，称为“减法聚类”，可以找出两个或多个流式细胞术列表模式数据文件之间最重要的区别。在不假设数据的情况下，该程序使用可变权重和偏度度量来确定bin的大小，从而允许对数据进行减法聚类，而无需进行位缩减或投影。相反，其他减法，例如逐通道减法，则取决于维数和分辨率，这可能导致对阳性细胞的高估，因为它们没有考虑测试和控制数据集的总体分布。通过考虑数据的人眼检查，实验人员可以通过选择适当的权重和偏度度量来选择最佳减法，但不允许直接修改结果。通过最大化仍可区分群集的bin大小，可以最小化计算，同时仍删除数据。控制权重的选择允许在减法阶段对垃圾箱进行不同程度的破坏，数值越小，减法越保守，越大，越自由。三个数据集说明了全维减法，单步生物学数据和多阶段减法以显示确定的测试结果。减法聚类能够保守地删除控制信息，从而使目标群体感兴趣。减法聚类提供了强大的聚类比较，是查找非明显（隐藏）差异并在分析过程中最大程度减少人类偏见的第一步。

著录项

来源
《》|2005年|P.354-361|共8页
会议地点 San JoseCA(US)
作者
Jacob N. Smith; Lisa Reece; Peter Szaniszlo; Leary; Rosemary C. Leary; James F. Leary;
展开▼
作者单位

University of Texas Medical Branch at Galveston, Molecular Cytometry Unit, Division of Infectious Diseases, Galveston, Texas;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类光、激光生物医学;
关键词
exploratory data analysis; data mining; subtractive clustering; flow cytometry;

机译：探索性数据分析;数据挖掘;减法聚类;流式细胞仪;

相似文献

外文文献
中文文献
专利

1. A comparison of three clustering methods for finding subgroups in MRI, SMS or clinical data: SPSS TwoStep Cluster analysis, Latent Gold and SNOB [J] . Peter Kent, Rikke K Jensen, Alice Kongsted BMC Medical Research Methodology . 2014,第1期

机译：三种用于在MRI，SMS或临床数据中查找亚组的聚类方法的比较：SPSS TwoStep聚类分析，Latent Gold和SNOB
2. Finding Relevant Parameters for the Thin-film Photovoltaic Cells Production Process with the Application of Data Mining Methods [J] . Ulaczyk Jan, Morawiec Krzysztof, Zabierowski Pawel, Molecular informatics . 2017,第9期

机译：通过应用数据采矿方法找到薄膜光伏电池生产过程的相关参数
3. A visual data-mining methodology for seismic facies analysis: Part 1 _ Testing and comparison with other unsupervised clustering methods [J] . lván Dimitri Marroqu_n, Jean-Jules Brault, Bruce S. Hart Geophysics: Journal of the Society of Exploration Geophysicists . 2009,第1期

机译：一种用于地震相分析的可视数据挖掘方法：第1部分_测试和与其他无监督聚类方法的比较
4. Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations [C] . Jacob N. Smith, Lisa Reece, Peter Szaniszlo, Society of Photo-Optical Instrumentation Engineers (SPIE);SPIE Proceedings . 2005

机译：减法聚类分析：一种寻找细胞亚群的新型数据挖掘方法
5. Statistical Methods for Analyzing DNA Methylation Data and Subpopulation Analysis of Continuous, Binary and Count Data for Clinical Trials [D] . Yip, Wai-Ki 2015

机译：用于临床试验的DNA甲基化数据分析和连续，二进制和计数数据亚群分析的统计方法
6. A comparison of three clustering methods for finding subgroups in MRI SMS or clinical data: SPSS TwoStep Cluster analysis Latent Gold and SNOB [O] . Peter Kent, Rikke K Jensen, Alice Kongsted 2014

机译：三种用于在MRISMS或临床数据中查找亚组的聚类方法的比较：SPSS TwoStep聚类分析Latent Gold和SNOB
7. Clustering of Cardiovascular Disease Patients Using Data Mining Techniques with Principal Component Analysis and K-Medoids Clustering of Cardiovascular Disease Patients Using Data Mining Techniques with Principal Component Analysis and K-Medoids [O] . Edy Irwansyah, Ebiet Salim Pratama, Margaretha Ohyver 2020

机译：使用具有主成分分析和K-yemoids的数据挖掘患者使用数据挖掘技术的心血管疾病患者使用数据挖掘技术和K-MEDOIDS患者K-MEDOIDS患者
8. Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats. [R] . Mills, R. T., Hoffman, F. M., Kumar, J., 2011

机译：基于聚类分析的海量数据集地理时空数据挖掘方法用于森林威胁识别。

Subtractive Clustering Analysis: A Novel Data Mining Method for Finding Cell Subpopulations

摘要

著录项

相似文献

相关主题

期刊订阅