【24h】

A Topic Detection Method for Chinese Microblog

机译:中文微博主题检测方法

获取原文
获取原文并翻译 | 示例

摘要

A model for topic detection in Chinese microblog is proposed. Based on the traditional vector space model, a feature selection and weight computation method are introduced to express messages in microblog due to their special characteristics. We also introduce a scoring method for the tweets which can filter out most topic-unrelated tweets at first, in order to minimize the impact of noise. Then a topic detection algorithm is proposed, using a new vector distance computation method. The results show that our method can filter out almost all the topic-unrelated tweets and identify topics in microblog accurately and efficiently. The study of topic detection method in microblog can help users and governments to find out hot topics dynamically.
机译:提出了中文微博中话题检测的模型。基于传统的矢量空间模型,引入了特征选择和权重计算方法,以利用微博中的特征来表达信息。我们还为推文引入了一种评分方法,该方法可以首先过滤掉大多数主题无关的推文,以最大程度地减少噪声的影响。然后,提出了一种使用新的矢量距离计算方法的主题检测算法。结果表明,我们的方法可以过滤掉几乎所有与主题无关的推文,并准确,有效地在微博中识别主题。微博中话题检测方法的研究可以帮助用户和政府动态发现热点话题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号