首页> 外文会议>International Conference on Intelligent Systems and Knowledge Engineering >Textual topic evolution analysis based on term co-occurrence: A case study on the government work report of the State Council (1954–2017)
【24h】

Textual topic evolution analysis based on term co-occurrence: A case study on the government work report of the State Council (1954–2017)

机译:基于词语共现的文本主题演变分析:以国务院政府工作报告为例(1954–2017)

获取原文

摘要

The government work report of the State Council is a kind of comprehensive policy text. This paper uses text mining technology to carry out a comprehensive multi-granularity, multi-level quantitative analysis of the government work reports, which has a great practical and instructive significance for relevant personnels to understand the evolution of domain knowledge in a short time. Firstly, a series of text preprocessing is done by using the Chinese word segmentation tool combined with three kind of dictionary built by authors, i.e., the domain word dictionary, the domain synonym dictionary and the domain stopword dictionary. Then, according to the co-occurrence information of words in the government work reports, we attempt to conduct topic modeling on the corpus consisted of all the government work reports and single government work report respectively, Finally, we find 12 latent topics for the corpus, such as the "Economic reform", "Agriculture", "Government construction", "Defense military" and so on. Based on the 12 topics, we conduct the topic modeling on every single government work report, with which topic evolution analysis is carried out over the whole period of all government work reports.
机译:国务院的政府工作报告是一种综合性的政策文本。本文采用文本挖掘技术对政府工作报告进行了全面的多粒度,多层次的定量分析,对于相关人员在短时间内了解领域知识的发展具有重要的现实和指导意义。首先,使用中文分词工具,结合作者编写的三种词典,即域词词典,域同义词词典和域停用词词典,进行一系列的文本预处理。然后,根据政府工作报告中词语的共现信息,尝试对分别由所有政府工作报告和单个政府工作报告组成的语料库进行主题建模,最后,为该语料库找到12个潜在主题。如“经济改革”,“农业”,“政府建设”,“国防军”等。基于这12个主题,我们在每份政府工作报告上进行主题建模,并在所有政府工作报告的整个期间内进行主题演变分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号