The Accuracy of Fuzzy C-Means in Lower-Dimensional Space for Topic Detection

机译：低维空间中模糊C均值的主题检测精度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Topic detection is an automatic method to discover topics in textual data. The standard methods of the topic detection are nonnegative matrix factorization (NMF) and latent Dirichlet allocation (LDA). Another alternative method is a clustering approach such as a k-means and fuzzy c-means (FCM). FCM extend the k-means method in the sense that the textual data may have more than one topic. However, FCM works well for low-dimensional textual data and fails for high-dimensional textual data. An approach to overcome the problem is transforming the textual data into lower dimensional space, i.e., Eigenspace, and called Eigenspace-based FCM (EFCM). Firstly, the textual data are transformed into an Eigenspace using truncated singular value decomposition. FCM is performed on the eigenspace data to identify the memberships of the textual data in clusters. Using these memberships, we generate topics from the high dimensional textual data in the original space. In this paper, we examine the accuracy of EFCM for topic detection. Our simulations show that EFCM results in the accuracies between the accuracies of LDA and NMF regarding both topic interpretation and topic recall.

机译：主题检测是一种在文本数据中发现主题的自动方法。主题检测的标准方法是非负矩阵分解（NMF）和潜在Dirichlet分配（LDA）。另一种替代方法是聚类方法，例如k均值和模糊c均值（FCM）。 FCM在文本数据可能包含多个主题的意义上扩展了k-means方法。但是，FCM适用于低维文本数据，而不适用于高维文本数据。解决该问题的一种方法是将文本数据转换到低维空间，即本征空间，并称为基于本征空间的FCM（EFCM）。首先，使用截断的奇异值分解将文本数据转换为特征空间。对特征空间数据执行FCM，以识别群集中文本数据的成员资格。使用这些成员资格，我们可以从原始空间中的高维文本数据中生成主题。在本文中，我们检查了EFCM用于主题检测的准确性。我们的仿真表明，EFCM导致LDA和NMF的精度介于主题解释和主题回忆方面。

著录项

来源
《International conference on smart computing and communication》|2018年|321-334|共14页
会议地点
作者
Hendri Murfi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Topic detection; Clustering; Fuzzy c-means; Eigenspace; Accuracy;

机译：主题检测;集群;模糊c均值;特征空间准确性;
入库时间 2022-08-26 13:50:03

相似文献

外文文献
中文文献
专利

1. 基于主题概念空间的文本模糊c-均值聚类方法 [J] . 吉翔华, 陈超, 邵正荣, 东南大学学报（英文版） . 2007,第003期
2. Topic Detection using Fuzzy C-Means with Nonnegative Double Singular Value Decomposition Initialization [J] . Hamimah Alatas, Hendri Murfi, Alhadi Bustamam International Journal of Advances in Soft Computing and Its Applications . 2018,第2期

机译：使用带有非负双奇异值分解初始化的模糊C均值的主题检测
3. Enhanced Forecasting Accuracy of Fuzzy Time Series Model Based on Combined Fuzzy C-Mean Clustering with Particle Swam Optimization [J] . International Journal of Computational Intelligence and Applications . 2020,第2期

机译：基于组合模糊C均值聚类的模糊时间序列模型增强预测精度
4. Human Detection by Fourier Descriptors and Fuzzy Color Histograms with Fuzzy c-Means Method [J] . Shohei Akimoto, Tomokazu Takahashi, Masato Suzuki, Journal of robotics and mechatronics . 2016,第4a164期

机译：傅里叶描述符和模糊色直方图的模糊c均值方法进行人体检测
5. The Accuracy of Fuzzy C-Means in Lower-Dimensional Space for Topic Detection [C] . Hendri Murfi International Conference on Smart Computing and Communication . 2018

机译：主题检测下尺寸空间中模糊C型方式的准确性
6. An Investigation into Fuzzy Clustering Quality and Speed: Fuzzy C-Means with Effective Seeding [D] . Stetco, Mihai Adrian. 2017

机译：模糊聚类质量和速度的研究：有效播种的模糊C均值
7. Skin Cancer Detection Using Kernel Fuzzy C-Means and Improved Neural Network Optimization Algorithm [O] . Jia Huaping, Zhao Junlong, A. M. Norouzzadeh Gil Molk 2021

机译：使用内核模糊C型型和改进神经网络优化算法的皮肤癌检测
8. Eigenspace-based fuzzy c-means for sensing trending topics in Twitter [O] . T. Muliawati, H. Murfi 2017

机译：基于EIGenspace的模糊C-merior，用于在Twitter中传感趋势主题
9. Predictability in space launch vehicle anomaly detection using intelligent neuro-fuzzy systems [R] . Gulati, Sandeep, Toomarian, Nikzad, Barhen, Jacob, 1994

机译：利用智能神经模糊系统进行空间运载火箭异常检测的可预测性

The Accuracy of Fuzzy C-Means in Lower-Dimensional Space for Topic Detection

摘要

著录项

相似文献

相关主题

期刊订阅