首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model
【24h】

Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

机译:基于LDA-GIBBS模型的策略文本聚类算法研究

获取原文
获取原文并翻译 | 示例
           

摘要

Policy text contains large amount of diversified data and strictly conforms to standards and specifications, but the traditional text clustering method cannot solve the problems of high dimensionality, sparse features, and similar meanings, so this paper proposes a weighted algorithm based on the LDA-Gibbs model to improve the accuracy of policy text clustering. Firstly, it provides realistic basis for the assumptions of the LDA-Gibbs topic model and the weighted algorithm; secondly, it pre-processes the existing policy text simulated data, establishes the LDA-Gibbs model, forms a weighted algorithm, and generates training data to determine the number of optimal topics in the LDA-Gibbs model and completes the final clustering of the policy text; finally, by summarizing, classifying and deducing the conclusions of the experimental data, this paper proves the objective validity and effects of this method. Hopefully the overall design of this method can be applied in the prospective study on the formulation of new policies in the future, the retrospective evaluation and testing of the existing policies and the formation of a two-way interactive mechanism.
机译:策略文本包含大量多样化数据,严格符合标准和规范,但传统的文本聚类方法无法解决高维度,稀疏功能和类似含义的问题,因此本文提出了一种基于LDA-GIBBS的加权算法提高政策文本聚类准确性的模型。首先,它为LDA-GIBBS主题模型和加权算法的假设提供了现实基础;其次,它预先处理现有的策略文本模拟数据,建立LDA-GIBBS模型,形成加权算法,并生成培训数据,以确定LDA-GIBBS模型中的最佳主题的数量,并完成策略的最终聚类文本;最后,通过总结,分类和推导实验数据的结论,本文证明了这种方法的客观有效性和影响。希望这种方法的整体设计可以应用于未来新政策的准入研究,回顾性评估和测试现有政策以及双向互动机制的形成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号