Model-Based Clustering for Image Segmentation and Large Datasets via Sampling

Ron Wehrens; Lutgarde M.C. Buydens; Chris Fraley; Adrian E. Raftery

首页> 外文期刊>Journal of Classification >Model-Based Clustering for Image Segmentation and Large Datasets via Sampling

【24h】

Model-Based Clustering for Image Segmentation and Large Datasets via Sampling

机译：通过采样进行图像分割和大数据集的基于模型的聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The rapid increase in the size of data sets makes clustering all the more important to capture and summarize the information, at the same time making clustering more difficult to accomplish. If model-based clustering is applied directly to a large data set, it can be too slow for practical application. A simple and common approach is to first cluster a random sample of moderate size, and then use the clustering model found in this way to classify the remainder of the objects. We show that, in its simplest form, this method may lead to unstable results. Our experiments suggest that a stable method with better performance can be obtained with two straightforward modifications to the simple sampling method: several tentative models are identified from the sample instead of just one, and several EM steps are used rather than just one E step to classify the full data set. We find that there are significant gains from increasing the size of the sample up to about 2,000, but not from further increases. These conclusions are based on the application of several alternative strategies to the segmentation of three different multispectral images, and to several simulated data sets.

机译：数据集大小的迅速增加使聚类对于捕获和汇总信息显得尤为重要，同时使聚类更加难以完成。如果将基于模型的聚类直接应用于大型数据集，则对于实际应用而言可能太慢。一种简单而通用的方法是，首先对中等大小的随机样本进行聚类，然后使用以此方式找到的聚类模型对其余对象进行分类。我们表明，以其最简单的形式，该方法可能导致不稳定的结果。我们的实验表明，通过对简单采样方法进行两个简单的修改，就可以得到一种性能更好的稳定方法：从样本中识别出几个暂定模型，而不仅仅是一个，并且使用了多个EM步骤而不是一个E步骤进行分类完整的数据集。我们发现，将样本的大小增加到大约2,000可带来显着的收益，但不会进一步增加。这些结论基于对三种不同的多光谱图像的分割以及几种模拟数据集的几种替代策略的应用。

著录项

来源
《Journal of Classification》 |2004年第2期|231-253|共23页
作者
Ron Wehrens; Lutgarde M.C. Buydens; Chris Fraley; Adrian E. Raftery;
展开▼
作者单位

Department of Analytical Chemistry Radboud University;

Department of Analytical Chemistry Radboud University;

Department of Statistics University of Washington;

Department of Statistics University of Washington;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Model-based clustering for image segmentation and large datasets via sampling [J] . Wehrens R, Buydens LMC, Fraley C, Journal of classification . 2004,第2期

机译：通过采样对图像分割和大型数据集进行基于模型的聚类
2. Model-based clustering for image segmentation and large datasets via sampling [J] . Wehrens R, Buydens LMC, Fraley C, Journal of classification . 2004,第2期

机译：通过采样对图像分割和大型数据集进行基于模型的聚类
3. Efficient spatial segmentation of large imaging mass spectrometry datasets with spatially aware clustering [J] . Alexandrov Theodore, Kobarg Jan Hendrik . Bioinformatics . 2011,第13期

机译：具有空间感知聚类的大型成像质谱数据集的有效空间分割
4. Segmentation of blood vessels in 3D ultrasound-datasets by a model-based region growing algorithm [C] . Stephanie S Hold, Karin K Hensel, Susanne S Winter, International Society for Computer Assisted Orthopaedic Surgery. Meeting . 2007

机译：基于模型的区域生长算法在3D超声数据集中分割血管
5. Model-Based Image Processing Algorithms for CT Image Reconstruction, Artifact Reduction and Segmentation. [D] . Jin, Pengchong. 2015

机译：用于CT图像重建，伪影减少和分割的基于模型的图像处理算法。
6. Efficient spatial segmentation of large imaging mass spectrometry datasets with spatially aware clustering [O] . Theodore Alexandrov, Jan Hendrik Kobarg -1

机译：具有空间感知聚类的大型成像质谱数据集的有效空间分割
7. Model-based clustering for image segmentation and large datasets via sampling [O] . Ron Wehrens, Lutgarde M. C. Buydens, Chris Fraley, 2009

机译：基于模型的聚类通过采样进行图像分割和大型数据集
8. Model-Based Clustering for Image Segmentation and Large Datasets Via Sampling [R] . Wehrens, R. , Buydens, L. M. , Fraley, C. , 2003

机译：基于模型的聚类图像分割和大数据集通过采样

Model-Based Clustering for Image Segmentation and Large Datasets via Sampling

摘要

著录项

相似文献

相关主题

期刊订阅