A time-series based aggregation scheme for topic detection in Weibo short texts

Ma Tinghuai; Li Jing; Liang Xinnian; Tian Yuan; Al-Dhelaan Abdullah; Al-Dhelaan Mohammed

首页> 外文期刊>Physica, A. Statistical mechanics and its applications >A time-series based aggregation scheme for topic detection in Weibo short texts

【24h】

A time-series based aggregation scheme for topic detection in Weibo short texts

机译：基于时间级的基于时间系列的微博短文本主题检测聚合方案

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Discovering hot topics within social network like Twitter and Weibo, has received much attention in recent years. While topic models such as Latent Dirichlet Allocation (LDA) have been successfully applied in topic discovery, they are often less coherent when applied to microblog content which is known as "posts". In this paper, we propose a time-series based aggregation scheme for topic modeling in Weibo. As Weibo topics are coherent within a time slice, we divide Weibo dataset into groups by time slice. With this scheme, posts in every group are aggregated into several longer pseudo-documents using paragraph-vector based similarity algorithms. While applying this scheme to LDA model, we dramatically decrease the topic model perplexity and increase the clustering quality, which also allows for better discovery of underlying topics in Weibo. Furthermore, we can let other topic models extended on LDA be directly used on such short texts. (C) 2019 Elsevier B.V. All rights reserved.

机译：在像Twitter和Weibo这样的社交网络中发现热门话题，近年来受到了很多关注。虽然主题模型（如潜在Dirichlet分配（LDA））已成功应用于主题发现，但在应用于称为“帖子”的微博内容时，它们通常不太一致。在本文中，我们提出了一种基于时间序列的微博主题建模聚合方案。由于Weibo主题在时间片中连贯，我们将Weibo DataSet划分为按时间片分组。使用此方案，每个组中的帖子使用基于段落 - 向量的相似性算法聚合成几个更长的伪文档。在将该方案应用于LDA模型的同时，我们大大降低了模型困惑，并提高了聚类质量，这也允许更好地发现微博中的基本主题。此外，我们可以让LDA上扩展的其他主题模型直接用于此类简短文本。（c）2019 Elsevier B.v.保留所有权利。

著录项

来源
《Physica, A. Statistical mechanics and its applications》 |2019年第2019期|共12页
作者
Ma Tinghuai; Li Jing; Liang Xinnian; Tian Yuan; Al-Dhelaan Abdullah; Al-Dhelaan Mohammed;
展开▼
作者单位

Nanjing Univ Informat Sci &

Technol Sch Comp &

Software Nanjing 210044 Jiangsu Peoples R China;

Nanjing Univ Informat Sci &

Technol Sch Comp &

Software Nanjing 210044 Jiangsu Peoples R China;

Nanjing Univ Informat Sci &

Technol Sch Comp &

Software Nanjing 210044 Jiangsu Peoples R China;

King Saud Univ Coll Comp &

Informat Sci Comp Sci Dept Riyadh 11362 Saudi Arabia;

King Saud Univ Coll Comp &

Informat Sci Comp Sci Dept Riyadh 11362 Saudi Arabia;

King Saud Univ Coll Comp &

Informat Sci Comp Sci Dept Riyadh 11362 Saudi Arabia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类物理学;
关键词
Topic discovery; Short texts; Aggregation; Time-series;

机译：主题发现;短文本;聚合;时间序列;
入库时间 2022-08-19 18:16:37

相似文献

外文文献
中文文献
专利

1. A time-series based aggregation scheme for topic detection in Weibo short texts [J] . Ma Tinghuai, Li Jing, Liang Xinnian, Physica, A. Statistical mechanics and its applications . 2019,第期

机译：基于时间级的基于时间系列的微博短文本主题检测聚合方案
2. A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text [J] . Nimala K., Jebakumar R. Journal of medical systems . 2019,第4期

机译：一种强大的用户情感比特妨据基于用户聚合策略的混合模型，以避免短文本的数据稀疏性
3. Experimental explorations on short text topic mining between LDA and NMF based Schemes [J] . Yong Chen, Hui Zhang, Rui Liu, Knowledge-Based Systems . 2019,第JANa1期

机译：基于LDA和NMF的方案之间的短文本主题挖掘的实验探索
4. Short and Sparse Text Topic Modeling via Self-Aggregation [C] . Xiaojun Quan, Chunyu Kit, Yong Ge, International Joint Conference on Artificial Intelligence . 2015

机译：通过自聚合进行短期和稀疏文本主题建模
5. Topic Modeling and Spam Detection for Short Text Segments in Web Forums [D] . Sun, Yingcheng. 2020

机译：网上论坛中短文本段的主题建模和垃圾邮件检测
6. Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis [O] . Rania Albalawi, Tet Hin Yeap, Morad Benyoucef 2020

机译：使用短文本数据的主题建模方法：比较分析
7. A Novel Hot Topic Detection Framework With Integration of Image and Short Text Information From Twitter [O] . Chengde Zhang, Shaozhen Lu, Chengming Zhang, 2019

机译：一种新颖的热门话题检测框架，通过Twitter集成图像和短文本信息

A time-series based aggregation scheme for topic detection in Weibo short texts

摘要

著录项

相似文献

相关主题

期刊订阅