首页> 外文期刊>Procedia Computer Science >Using Tweets Embeddings For Hashtag Recommendation in Twitter
【24h】

Using Tweets Embeddings For Hashtag Recommendation in Twitter

机译:在Twitter中使用推特嵌入进行标签推荐

获取原文
           

摘要

Social microblogging platforms such as Twitter have become hugely popular forms of this latest sort of blogging. Twitter users make and use hashtags in their tweets to categorize them according to topic or theme, likewise to make them ascertainable to other bloggers through search. However, the liberated hashtag creation policy make a wide hardness for bloggers to find appropriates hashtags for their posts. Indeed, the task of recommending hashtags has many benefits to afford; notably it assists users to choose relevant hashtags for their posts in real time, which will save them from a supplementary stress. Actually, the achieve success of several models of neural networks for calculating word embeddings, has driven approaches for generating syntactic and semantic embeddings for long and noisy text, such as paragraphs, sentences and micro-blogs. On the parallel lines, our aim is to develop a hashtag recommender system to assist users to choose relevant hashtags for their posts in real time, based on using semantic embeddings representation of tweets, which we can subsequently use to capture semantic similarity or relatedness between tweets. In the current paper, we introduce an approach to hashtag recommendation in Twitter that is based on the following proceedings: Using a pre-trained word embeddings on a large corpus such as Google News applying one of the famous embeddings methods, Representing a given tweet by a weighted averaging value of its word embeddings, Then combining these features with the DBSCAN (density-based spatial clustering of applications with noise) clustering algorithm, to divide the heterogeneous collection of tweets into clusters that contain syntactically and semantically similar tweets. Afterwards, Recommending the top-K suitable hashtags to the user after computing the similarity between the entered tweet and the centroids of obtained clusters. Our system achieved promising results which demonstrate the effectiveness of our approach.
机译:Twitter之类的社交微博平台已成为这种最新博客的一种非常流行的形式。 Twitter用户在推文中制作并使用主题标签,以根据主题或主题对主题进行分类,同样可以通过搜索确定其他博客作者的身份。但是,解放的主题标签创建政策使博主很难找到适合其帖子的主题标签。确实,推荐主题标签的任务有很多好处。值得注意的是,它可以帮助用户实时为其帖子选择相关的主题标签,这将使他们免于承受额外的压力。实际上,几种用于计算单词嵌入的神经网络模型的成功实现,推动了为长而嘈杂的文本(如段落,句子和微博)生成语法和语义嵌入的方法。在平行线上,我们的目标是开发一个标签推荐器系统,以基于使用推文的语义嵌入表示,帮助用户实时为其帖子选择相关的标签,随后我们可以用来捕获推文之间的语义相似性或相关性。 。在当前的论文中,我们基于以下过程介绍Twitter中的标签推荐方法:在大型语料库(例如Google新闻)上使用经过预先训练的单词嵌入,使用一种著名的嵌入方法,通过对其词嵌入的加权平均值进行计算,然后将这些功能与DBSCAN(基于噪声的应用程序的基于密度的空间聚类)聚类算法相结合,将异类推文集合划分为包含句法和语义上相似的推文的集群。然后,在计算输入的tweet和获得的聚类的质心之间的相似度后,向用户推荐前K个合适的主题标签。我们的系统取得了令人鼓舞的结果,证明了我们方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号