Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

Haiqun Ma; Tao Zhang

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

【24h】

Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

机译：基于LDA-GIBBS模型的策略文本聚类算法研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Policy text contains large amount of diversified data and strictly conforms to standards and specifications, but the traditional text clustering method cannot solve the problems of high dimensionality, sparse features, and similar meanings, so this paper proposes a weighted algorithm based on the LDA-Gibbs model to improve the accuracy of policy text clustering. Firstly, it provides realistic basis for the assumptions of the LDA-Gibbs topic model and the weighted algorithm; secondly, it pre-processes the existing policy text simulated data, establishes the LDA-Gibbs model, forms a weighted algorithm, and generates training data to determine the number of optimal topics in the LDA-Gibbs model and completes the final clustering of the policy text; finally, by summarizing, classifying and deducing the conclusions of the experimental data, this paper proves the objective validity and effects of this method. Hopefully the overall design of this method can be applied in the prospective study on the formulation of new policies in the future, the retrospective evaluation and testing of the existing policies and the formation of a two-way interactive mechanism.

机译：策略文本包含大量多样化数据，严格符合标准和规范，但传统的文本聚类方法无法解决高维度，稀疏功能和类似含义的问题，因此本文提出了一种基于LDA-GIBBS的加权算法提高政策文本聚类准确性的模型。首先，它为LDA-GIBBS主题模型和加权算法的假设提供了现实基础;其次，它预先处理现有的策略文本模拟数据，建立LDA-GIBBS模型，形成加权算法，并生成培训数据，以确定LDA-GIBBS模型中的最佳主题的数量，并完成策略的最终聚类文本;最后，通过总结，分类和推导实验数据的结论，本文证明了这种方法的客观有效性和影响。希望这种方法的整体设计可以应用于未来新政策的准入研究，回顾性评估和测试现有政策以及双向互动机制的形成。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2019年第136期|共6页
作者
Haiqun Ma; Tao Zhang;
展开▼
作者单位

Center for Russian Language Literature and Culture Heilongjiang University;

Information and Network Center Heilongjiang University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
LDA-Gibbs; Topic model; Text clustering; Weighted algorithm;

机译：LDA-GIBBS;主题模型;文本聚类;加权算法;

相似文献

外文文献
中文文献
专利

1. Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model [J] . Haiqun Ma, Tao Zhang Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2019,第2a136期

机译：基于LDA-GIBBS模型的策略文本聚类算法研究
2. The Feature Selection Method based on Genetic Algorithm for Efficient of Text Clustering and Text Classification [J] . Sung-Sam Hong, Wanhee Lee, Myung-Mook Han International Journal of Advances in Soft Computing and Its Applications . 2015,第1aSpecial期

机译：基于遗传算法的高效文本聚类和分类的特征选择方法
3. Clustering Short Text using a Centroid-Based Lexical Clustering Algorithm [J] . Khaled Abdalgader IAENG Internaitonal journal of computer science . 2017,第4a2a期

机译：使用基于质心的词法聚类算法对短文本进行聚类
4. A text clustering algorithms based on hidden Markov model [C] . Liwei, Limeian WASE Global Conference on Science Engineering . 2012

机译：基于隐马尔可夫模型的文本聚类算法
5. An Evaluation of Clustering Algorithms for Modeling Game-Based Assessment Work Processes [D] . Fossey, William Austin. 2017

机译：用于建模基于游戏的评估工作过程的聚类算法的评估
6. Analysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets [O] . D. D. Shrimankar, S. R. Sathe 2016

机译：大型生物数据集基于新图块的并行编程模型对SMP节点和工作站集群的并行算法进行分析
7. Design and Application of a Text Clustering Algorithm Based on Parallelized K-Means Clustering [O] . Hui Wang, Chengdong Zhou, Leixiao Li 2019

机译：基于并行k均值聚类的文本聚类算法的设计与应用

Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

摘要

著录项

相似文献

相关主题

期刊订阅