A Model-Based Approach for Discrete Data Clustering and Feature Weighting Using MAP and Stochastic Complexity

Bouguila Nizar

首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >A Model-Based Approach for Discrete Data Clustering and Feature Weighting Using MAP and Stochastic Complexity

【24h】

A Model-Based Approach for Discrete Data Clustering and Feature Weighting Using MAP and Stochastic Complexity

机译：一种基于模型的基于MAP和随机复杂度的离散数据聚类和特征加权方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we consider the problem of unsupervised discrete feature selection/weighting. Indeed, discrete data are an important component in many data mining, machine learning, image processing, and computer vision applications. However, much of the published work on unsupervised feature selection has concentrated on continuous data. We propose a probabilistic approach that assigns relevance weights to discrete features that are considered as random variables modeled by finite discrete mixtures. The choice of finite mixture models is justified by its flexibility which has led to its widespread application in different domains. For the learning of the model, we consider both Bayesian and information-theoretic approaches through stochastic complexity. Experimental results are presented to illustrate the feasibility and merits of our approach on a difficult problem which is clustering and recognizing visual concepts in different image data. The proposed approach is successfully applied also for text clustering.

机译：在本文中，我们考虑了无监督的离散特征选择/加权问题。实际上，离散数据是许多数据挖掘，机器学习，图像处理和计算机视觉应用程序中的重要组成部分。但是，有关无监督特征选择的许多已发表工作都集中在连续数据上。我们提出一种概率方法，将相关权重分配给离散特征，这些离散特征被视为由有限离散混合物建模的随机变量。有限混合模型的选择通过其灵活性证明了其合理性，该灵活性已导致其在不同领域中的广泛应用。为了学习模型，我们通过随机复杂性考虑了贝叶斯方法和信息理论方法。实验结果表明了我们的方法在一个难题上的可行性和优点，该难题是对不同图像数据中的视觉概念进行聚类和识别。所提出的方法也成功地应用于文本聚类。

著录项

来源
《Knowledge and Data Engineering, IEEE Transactions on》 |2009年第12期|p.1649-1664|共16页
作者
Bouguila Nizar;
展开▼
作者单位

Concordia University, Montreal;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Dirichlet prior; Discrete data; Fisher kernel; MAP; feature weighting/selection; finite mixture models; image databases; multinomial; stochastic complexity; text clustering.;

机译：Dirichlet先验;离散数据;Fisher核;MAP;特征加权/选择;有限混合模型;图像数据库;多项式;随机复杂度;文本聚类。;

相似文献

外文文献
中文文献
专利

1. Model-based approach for high-dimensional non-Gaussian visual data clustering and feature weighting [J] . Elguebaly Tarek, Bouguila Nizar Digital Signal Processing . 2015,第Null期

机译：基于模型的高维非高斯视觉数据聚类和特征权重方法
2. Boosting scRNA-seq data clustering by cluster-aware feature weighting [J] . Li Rui-Yi, Guan Jihong, Zhou Shuigeng BMC Bioinformatics . 2021,第6期

机译：通过群集感知功能加权提升ScrNA-SEQ数据群集
3. A stochastic approximation approach to simultaneous feature weighting and selection for nearest neighbour learners [J] . Yeo Guo Feng Anders, Aksakalli Vural Expert systems with applications . 2021,第Deca期

机译：关于最近邻居学习者的同时特征加权和选择的随机近似方法
4. Roadway feature mapping from point cloud data: A graph-based clustering approach [C] . Mohammad Billah, Arash Maskooki, Farzana Rahman, IEEE Intelligent Vehicles Symposium . 2017

机译：从点云数据映射道路特征：基于图的聚类方法
5. Application of discrete wavelet transforms and self-organizing feature maps to classifying hyperspectral reflectance data. [D] . Logan, Michael J. 1998

机译：离散小波变换和自组织特征图在高光谱反射率数据分类中的应用。
6. Cross Lingual Sentiment Analysis: A Clustering-Based Bee Colony Instance Selection and Target-Based Feature Weighting Approach [O] . Mohammed Abbas Mohammed Almansor, Chongfu Zhang, Wasiq Khan, 2020

机译：交叉语言情绪分析：基于聚类的蜂殖民地实例选择和基于目标的特征加权方法
7. Roadway feature mapping from point cloud data: A graph-based clustering approach [O] . Mohammad Billah, Arash Maskooki, Farzana Rahman, 2017

机译：巷道特征映射从点云数据：基于图形的聚类方法

A Model-Based Approach for Discrete Data Clustering and Feature Weighting Using MAP and Stochastic Complexity

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅