A New Text Clustering Method Using Hidden Markov Model

机译：使用隐马尔可夫模型的新文本群集方法

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Being high-dimensional and relevant in semantics, text clustering is still an important topic in data mining. However, little work has been done to investigate attributes of clustering process, and previous studies just focused on characteristics of text itself. As a dynamic and sequential process, we aim to describe text clustering as state transitions for words or documents. Taking K-means clustering method as example, we try to parse the clustering process into several sequences. Based on research of sequential and temporal data clustering, we propose a new text clustering method using HMM(Hidden Markov Model). And through the experiments on Reuters-21578, the results show that this approach provides an accurate clustering partition, and achieves better performance rates compared with K-means algorithm.

机译：在语义中是高维和相关的，文本聚类仍然是数据挖掘中的一个重要主题。但是，已经完成了很少的工作来调查聚类过程的属性，之前的研究刚刚专注于文本本身的特征。作为动态和顺序过程，我们的目标是将文本群集描述为单词或文档的状态转换。以K-means聚类方法为例，我们尝试将聚类过程解析为几个序列。基于顺序和时间数据聚类的研究，我们提出了一种使用HMM（隐马尔可夫模型）的新文本聚类方法。通过对Reuters-21578的实验，结果表明，该方法提供了准确的聚类分区，并与K-Means算法相比实现了更好的性能率。

著录项

来源
《International Conference on Applications of Natural Language to Information Systems》|2007年||共11页
会议地点
作者
Tan Fu; Dongqing Yang; Shiwei Tang; Tengjiao Wang; Aiqiang Gao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models [J] . Victor R. L. Shen, Gwo-Jen Chiou, Yi-Nan Lin, Wireless personal communications: An Internaional Journal . 2020,第3期

机译：基于修改的K群和隐马尔可夫模型的新颖文本识别
2. Evaluation of Hidden Semi-Markov Models Training Methods for Greek Emotional Text-to-Speech Synthesis [J] . Alexandros Lazaridis, Iosif Mporas International Journal of Information Technology and Computer Science . 2013,第4期

机译：对希腊情感文本到语音合成的隐藏半马尔可夫模型训练方法的评估
3. Hidden Markov model-based ensemble methods for offline handwritten text line recognition [J] . Bertolami R, Bunke H Pattern Recognition: The Journal of the Pattern Recognition Society . 2008,第11期

机译：基于隐马尔可夫模型的离线手写文本行识别的集成方法
4. A New Text Clustering Method Using Hidden Markov Model [C] . Yan Fu, Dongqing Yang, Shiwei Tang, International Conference on Applications of Natural Language to Information Systems(NLDB 2007); 20070627-29; Paris(FR) . 2007

机译：隐马尔可夫模型的文本聚类新方法
5. Protein structure analysis and prediction utilizing the Fuzzy Greedy K-means Decision Forest model and Hierarchically-Clustered Hidden Markov Models method. [D] . Hudson, Cody Landon. 2013

机译：利用模糊贪婪K均值决策森林模型和层次聚类的隐马尔可夫模型方法对蛋白质结构进行分析和预测。
6. Hidden-Markov methods for the analysis of single-molecule actomyosin displacement data: the variance-Hidden-Markov method. [O] . D A Smith, W Steffen, R M Simmons, 2001

机译：用于单分子放线菌素置换数据分析的隐马尔可夫方法：方差隐马尔可夫方法。
7. Evaluation of Hidden Semi-Markov Models Training Methods for Greek Emotional Text-to-Speech Synthesis [O] . Alexandros Lazaridis, Iosif Mporas 2013

机译：隐藏半马尔可夫模型对希腊情绪文学致辞综合培养方法的评价

A New Text Clustering Method Using Hidden Markov Model

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅