首页> 外文期刊>清华大学学报(英文版) >Modeling Chinese Microblogs with Five Ws for Topic Hashtags Extraction
【24h】

Modeling Chinese Microblogs with Five Ws for Topic Hashtags Extraction

机译:五W的主题链接标签抽取的中文微博建模。

获取原文
获取原文并翻译 | 示例
       

摘要

Hashtags are important metadata in microblogs and are used to mark topics or index messages.However,statistics show that hashtags are absent from most microblogs.This poses great challenges for the retrieval and analysis of these tagless microblogs.In this paper,we summarize the similarity between microblogs and shortmessage-style news,and then propose an algorithm,named 5WTAG,for detecting microblog topics based on a model of five Ws (When,Where,Who,What,hoW).As five-W attributes are the core components in event description,it is guaranteed theoretically that 5WTAG can properly extract semantic topics from microblogs.We introduce the detailed procedure of the algorithm in this paper including spam microblog identification,microblog segmentation,and candidate hashtag construction.In addition,we propose a novel recommendation computing method for ranking candidate hashtags,which combines syntax and semantic analysis and observes the distribution of artificial topic hashtags.Finally,we conduct comprehensive experiments to verify the semantic correctness and completeness of the candidate hashtags,as well as the accuracy of the recommendation method using real data from Sina Weibo.
机译:标记是微博客中重要的元数据,用于标记主题或索引消息。但是,统计数据表明,大多数微博客中都没有标记。这对无标记微博客的检索和分析提出了巨大挑战。本文总结了相似性在微博和短消息风格的新闻之间,然后提出一种名为5WTAG的算法,该算法基于五个W的模型(When,Where,Who,What,hoW)来检测微博主题。由于5 W属性是事件描述,从理论上保证5WTAG可以正确地从微博中提取语义主题。本文介绍了该算法的详细过程,包括垃圾邮件微博识别,微博分段和候选标签构建。此外,我们提出了一种新颖的推荐计算方法结合语法和语义分析并观察人工主题标签的分布的候选标签标签排名方法。最后,我们进行了全面的实验,以验证候选主题标签的语义正确性和完整性,以及使用新浪微博的真实数据来推荐方法的准确性。

著录项

  • 来源
    《清华大学学报(英文版)》 |2017年第2期|135-148|共14页
  • 作者单位

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

    College of Computer Science and Engineering,Northeastern University,Shenyang110819,China;

  • 收录信息 中国科学引文数据库(CSCD);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2024-01-27 06:19:53
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号