Topic Modelling Twitter Data with Latent Dirichlet Allocation Method

机译：主题建模推特数据与潜在的dirichlet分配方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Twitter is a popular social media for every user to issue thoughts and emotional forms which are tweets, tweets that only have 140 characters with limitations to write in text. Twitter is one of the social media places to get information that is always up to date, tweets are categorized into big data because tweets are information that can be used as a source of data for research. Latent Dirichlet Allocation (LDA) as an algorithm that can process large text data (big data). In this study using the LDA method as an algorithm to produce topic modeling, each topic similarity, and visualization of topic clusters from the tweet data generated as many as 4 topics (Economic, Military, Sports, Technology) in Indonesian, where each topic has a number different tweets. The LDA method used in the processing of tweet data is successfully carried out and works optimally, in each topic extraction, topic modeling, generating index words that are in each topic cluster and computer visualization in the topic.LDA output shows optimal performance in the process of word indexing in Sport topics with 1260 tweets with an accuracy of 98% better than the LSI method in Topic Modeling.

机译：Twitter是一个受欢迎的社交媒体，为每个用户发出思想和情感形式，这是鸣叫，推文只有140个字符，indations在文本中写入。 Twitter是获取始终是最新信息的社交媒体场所之一，推文分为大数据，因为推文是可以用作研究数据来源的信息。潜在的Dirichlet分配（LDA）作为可以处理大文本数据（大数据）的算法。在本研究中，使用LDA方法作为产生主题建模的算法，每个主题相似性和主题集群的可视化从印度尼西亚的4个主题（经济，军事，体育，技术）产生的推文数据，每个主题都有一个不同的推文。用于处理Tweet数据处理的LDA方法是成功执行的，在每个主题提取，主题建模中，在每个主题集群中生成索引字和主题中的计算机可视化显示该过程中的最佳性能在体育主题中的单词索引，1260推文，精度比主题建模的LSI方法更好地提高98％。

著录项

来源
《International Conference on Electrical Engineering and Computer Science》|2019年|1 v.|共5页
会议地点
作者
Edi Surya Negara; Dendi Triadi; Ria Andryani;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Big Data; information retrieval; pattern clustering; social networking (online); sport; text analysis;

机译：大数据;信息检索;模式聚类;社交网络（在线）;运动;文本分析;

相似文献

外文文献
中文文献
专利

1. A comparative analysis of Latent Semantic analysis and Latent Dirichlet allocation topic modeling methods using Bible data [J] . Vasantha Kumari Garbhapu, Prajna Bodapati Indian Journal of Science and Technology . 2020,第44期

机译：潜在语义分析与潜在的Dirichlet分配主题建模方法的比较分析
2. Analysis of Health Research Topics in Indonesia Using the LDA (Latent Dirichlet Allocation) Topic Modeling Method [J] . Yoga Sahria, Dhomas Hatta Fudholi Jurnal RESTI: Rekayasa Sistem dan Teknologi Informasi . 2020,第2期

机译：使用LDA（潜在Dirichlet分配）主题建模方法分析印度尼西亚的健康研究主题
3. A Hybrid Model for Topic Modeling Using Latent Dirichlet Allocation and Feature Selection Method [J] . Christy A., Praveena Anto, Shabu Jany Journal of computational and theoretical nanoscience . 2019,第8期

机译：一种使用潜在Dirichlet分配和特征选择方法的主题建模的混合模型
4. Topic Modelling Twitter Data with Latent Dirichlet Allocation Method [C] . Edi Surya Negara, Dendi Triadi, Ria Andryani International Conference on Electrical Engineering and Computer Science . 2019

机译：使用潜在狄利克雷分配方法对Twitter数据进行主题建模
5. Performance of Latent Dirichlet Allocation with Different Topic and Document Structures [D] . Feng, Haotian. 2019

机译：不同主题和文档结构的潜在Dirichlet分配的性能
6. Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter [O] . Jia Xue, Junxiang Chen, Chen Chen, 2020

机译：Covid 19 Pandemery的公众话语和情绪：在推特上使用潜在的Dirichlet分配主题建模
7. Topic Modelling of Germas Related Content on Instagram Using Latent Dirichlet Allocation (LDA) [O] . Muhammad Habibi, Adri Priadana, Andika Bayu Saputra, 2021

机译：使用潜在Dirichlet分配（LDA）的Enderagram上的REMENAS相关内容的主题建模

Topic Modelling Twitter Data with Latent Dirichlet Allocation Method

摘要

著录项

相似文献

相关主题

期刊订阅