...
首页> 外文期刊>Indian Journal of Science and Technology >Textual Mining- Evaluation of Mann Ki Baat Repository
【24h】

Textual Mining- Evaluation of Mann Ki Baat Repository

机译:Mann Ki BAAT存储库的文本挖掘评估

获取原文

摘要

Background and Objective: As computers and the Internet are broadly utilized in nearly every region, numerous computerized text data is produced each day. It becomes a fundamental task to explore and effectively search such massive data. The main aim of the present study is to emphasize the recurrence of topics and identifying main ideas from a popular monthly addressing radio program Mann Ki Baat by using topic modeling technique. Data and Method: The present study utilizes the unstructured data of Mann ki Baat from January 2020 to March 2020, collected from the PMINDIA website. This program was initiated by the Honorable Prime Minister of India, Mr. Narendra Modi. This examination uses a popular technique Topic modeling based on LDA (Latent Dirichlet Allocation). Findings: The results show that the method automatically extracts the main ideas and issues discussed. Besides it provides information about the most likely topics and themes discussed in each month that left an impact on people and helped in raising awareness. Novelty: This is a first study of the application of popular technique topic modelling on Mann ki Baat. Further, this is the first attempt to extract the ideas discussed in a social campaign using a statistical model.
机译:背景和目标:由于计算机和互联网在几乎每个区域广泛利用,每天产生许多计算机化的文本数据。它成为探索和有效搜索此类大规模数据的基本任务。本研究的主要目的是通过使用主题建模技术来强调主题的重复,并从流行的月度地址无线电计划Mann Ki BAAT识别主要思想。数据和方法:本研究利用从PMIndia网站收集的Mann Ki BAAT的非结构化数据。该计划由印度首相,Narendra Modi先生发起。此检查使用基于LDA(潜在Dirichlet分配)的流行技术主题建模。调查结果:结果表明,该方法自动提取了讨论的主要思想和问题。此外,它提供了有关每个月内讨论的最有可能的主题和主题的信息,这些主题和主题对人们产生了影响并帮助提高了认识。新颖性:这是对曼基·卡特的流行技术主题建模的第一次研究。此外,这是首次尝试使用统计模型提取在社交活动中讨论的想法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号