Utilizing unsupervised learning to cluster data in the Bayesian Data Reduction Algorithm

机译：在贝叶斯数据约简算法中利用无监督学习对数据进行聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, unsupervised learning is utilized to illustrate the ability of the Bayesian Data Reduction Algorithm (BDRA) to cluster unlabeled training data. The BDRA is based on the assumption that the discrete symbol probabilities of each class are a priori uniformly Dirichlet distributed, and it employs a "greedy" approach (similar to a backward sequential feature search) for reducing irrelevant features from the training data of each class. Notice that reducing irrelevant features is synonymous here with selecting those features that provide best classification performance; the metric for making data reducing decisions is an analytic formula for the probability of error conditioned on the training data. The contribution of this work is to demonstrate how clustering performance varies depending on the method utilized for unsupervised training. To illustrate performance, results are demonstrated using simulated data. In general, the results of this work have implications for rinding clusters in data mining applications.

机译：在本文中，无监督学习被用来说明贝叶斯数据约简算法（BDRA）对未标记训练数据进行聚类的能力。 BDRA基于以下假设：每个类别的离散符号概率是先验均匀Dirichlet分布的，并且它采用“贪婪”方法（类似于向后顺序特征搜索）来减少来自每个类别的训练数据中的不相关特征。注意，在这里，减少不相关的特征是选择那些提供最佳分类性能的特征的同义词。做出减少数据决策的度量标准是针对以训练数据为条件的错误概率的解析公式。这项工作的目的是演示聚类性能如何根据用于无监督训练的方法而变化。为了说明性能，使用模拟数据演示了结果。总的来说，这项工作的结果对数据挖掘应用程序中的集群集群具有影响。

著录项

来源
《Data Mining, Intrusion Detection, Information Assurance, and Data Networks Security 2005》|2005年|P.158-167|共10页
会议地点 OrlandoFL(US)
作者
Robert S. Lynch; Jr.; Peter K. Willett;
展开▼
作者单位

Signal Processing Branch, Naval Undersea Warfare Center, Newport, RI, U.S.A.;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词
adaptive classification; noninformative prior; discrete data; unknown data distribution;

机译：自适应分类;非先验信息;离散数据;未知数据分布;

相似文献

外文文献
中文文献
专利

1. UNSUPERVISED LEARNING OF BAYESIAN NETWORKS VIA ESTIMATION OF DISTRIBUTION ALGORITHMS: AN APPLICATION TO GENE EXPRESSION DATA CLUSTERING [J] . J. M. PENA, J. A. LOZANO, P. LARRANAGA International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2004,第JanaSupplement期

机译：通过分布算法估算贝叶斯网络的无监督学习：在基因表达数据聚类中的应用
2. Unsupervised Learning from Multi-Dimensional Data: A Fast Clustering Algorithm Utilizing Canopies and Statistical Information [J] . Giyasettin Ozcan International Journal of Information Technology & Decision Making . 2018,第3期

机译：从多维数据中无监督学习：利用檐篷和统计信息的快速聚类算法
3. Clustering fMRI data with a robust unsupervised learning algorithm for neuroscience data mining [J] . Aljobouri Hadeel K., Jaber Hussain A., Kocak Orhan M., Journal of Neuroscience Methods . 2018,第期

机译：用强大的无监督学习算法聚类FMRI数据，用于神经科学数据挖掘
4. Bayesian ying-yang system and theory as a unified statistical learning approach: (III) models and algorithms for dependence reduction, data dimension reduction, ICA and supervised learning [C] . Lei Xu International workshop on theoretical aspects of neural computation : A multidisciplinary perspective . 1998

机译：Bayesian Ying-Yang系统和理论作为统一统计学习方法：（iii）依赖性减少，数据维度减少，ICA和监督学习的模型和算法
5. Supervised precision ordinal clustering – A human-machine learning algorithm to create accurate clusters in big datasets: Application to indiana water quality data with novel visualization techniques [D] . Singh, Sarabjit 2014

机译：有监督的有序序数聚类–一种人机学习算法，可在大型数据集中创建准确的聚类：采用新颖的可视化技术应用于印第安纳州水质数据
6. web-rMKL: a web server for dimensionality reduction and sample clustering of multi-view data based on unsupervised multiple kernel learning [O] . Benedict Röder, Nicolas Kersten, Marius Herr, 2019

机译：web-rMKL：一种基于无监督多核学习的降维和多视图数据样本聚类的Web服务器
7. Uncertainty Reduction of Unlabeled Features in Landslide Inventory Using Machine Learning t-SNE Clustering and Data Mining Apriori Association Rule Algorithms [O] . Omar F. Althuwaynee, Ali Aydda, In-Tak Hwang, 2021

机译：使用机器学习T-SNE聚类和数据挖掘APRISI关联规则算法的滑坡库存未标记特征的不确定性降低

Utilizing unsupervised learning to cluster data in the Bayesian Data Reduction Algorithm

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅