首页> 外文会议>International Conference on Intelligent Systems and Knowledge Engineering >Textual topic evolution analysis based on term co-occurrence: A case study on the government work report of the State Council (1954–2017)
【24h】

Textual topic evolution analysis based on term co-occurrence: A case study on the government work report of the State Council (1954–2017)

机译:基于术语共同发生的文本主题演变分析 - 以国务院政府工作报告为例(1954 - 2017年)

获取原文

摘要

The government work report of the State Council is a kind of comprehensive policy text. This paper uses text mining technology to carry out a comprehensive multi-granularity, multi-level quantitative analysis of the government work reports, which has a great practical and instructive significance for relevant personnels to understand the evolution of domain knowledge in a short time. Firstly, a series of text preprocessing is done by using the Chinese word segmentation tool combined with three kind of dictionary built by authors, i.e., the domain word dictionary, the domain synonym dictionary and the domain stopword dictionary. Then, according to the co-occurrence information of words in the government work reports, we attempt to conduct topic modeling on the corpus consisted of all the government work reports and single government work report respectively, Finally, we find 12 latent topics for the corpus, such as the "Economic reform", "Agriculture", "Government construction", "Defense military" and so on. Based on the 12 topics, we conduct the topic modeling on every single government work report, with which topic evolution analysis is carried out over the whole period of all government work reports.
机译:国务院政府工作报告是一种全面的政策案文。本文采用文本挖掘技术进行了全面的多粒度,对政府工作报告的多级定量分析,对相关人员来说,对于在短时间内了解域知识的演变具有良好的实际和有效意义。首先,通过使用由作者构建的三种字典组合完成了一系列文本预处理,即作者,即域名字典,域名字典和域中字典。然后,根据政府工作报告中的单词的共同发生信息,我们试图分别对核肉组成的主题建模,其中包括所有政府工作报告和单一政府工作报告,最后,我们为语料库找到了12个潜在主题,如“经济改革”,“农业”,“政府建设”,“国防军”等。基于12个主题,我们对每个政府工作报告进行主题建模,在所有政府工作报告的整个时期都进行了哪些主题进化分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号