首页> 外文会议>International Conference on Information Systems Engineering >Labelling Topics in Weibo Using Word Embedding and Graph-Based Method
【24h】

Labelling Topics in Weibo Using Word Embedding and Graph-Based Method

机译:使用Word Embedding和基于图形方法的Weibo标记主题

获取原文

摘要

Nowadays, in China, Weibo is becoming an increasingly popular way for people to know what is happening in the world. Labelling topics is of much importance for better understanding the semantics of topics. Existing works mainly focus on deriving candidate labels by exploring the use of external knowledge, which may be more appropriate for well formatted and static documents. Recently, it has been a new trend to generate labels for sparse and dynamic microblogging environment using summarization method. The challenges of labelling topics are how to obtain coherent candidate labels and how to rank the labels. In this paper, based on the latest research work in deep learning, we propose a novel and unified model for labelling topics in Weibo, which firstly adopts word embedding and clustering method to learn dense semantic representation of topic words and mine the coherent candidate topic labels, then, generates interpretable labels using a graph-based model. Experimental results show that topics labels discovered by our model not only have high topic coherence, but also are meaningful and interpretable.
机译:如今,在中国,微博正在成为人们了解世界发生的事情的越来越受欢迎的方式。标签主题对于更好地理解主题的语义来说是非常重要的。现有的作品主要专注于通过探索外部知识的使用来派生候选标签,这可能更适合格式化和静态文件。最近,使用摘要方法为生成稀疏和动态微博环境生成标签的新趋势。标签主题的挑战是如何获得一致候选标签以及如何对标签进行排名。在本文的基础上,基于深入学习的最新研究工作,我们提出了一种新颖和统一的模型,用于在微博中标记主题,首先采用单词嵌入和聚类方法来学习主题词的密集语义表示,即连贯的候选主题标签然后,使用基于图形的模型生成可解释标签。实验结果表明,我们的模型发现的主题标签不仅具有高度的主题连贯性,而且是有意义和可解释的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号