首页> 外文期刊>World Wide Web >Query-based unsupervised learning for improving social media search
【24h】

Query-based unsupervised learning for improving social media search

机译:基于查询的无监督学习改进社交媒体搜索

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In the current information era over the internet, social media has become one of the essential information sources for users. While the text is the primary information representation, finding relevant information is a challenging mission for researchers due to its nature (e.g., short length, sparseness). Acquiring high-quality search results from massive data, such as social media needs a set of representative query terms that are not always available. In this paper, we propose a novel query-based unsupervised learning model to represent the implicit relationships in the short text from social media. This bridges the gap of the lack of word co-occurrences without requiring many parameters to be estimated and external evidence to be collected. To confirm the proposed model effectiveness, we compare the proposed model with state-of-the-art lexical, topic model and temporal models on the large-scale TREC microblog 2011-2014 collections. The experimental results show that the proposed model significantly improved overall state-of-the-art lexical, topic model and temporal models with the maximum percentage of increase reaching 33.97% based on MAP value and 21.38% based on Precision at top 30 documents. The proposed model can improve the social media search effectiveness in potential closely retrieval tasks, such as question answering and timeline summarisation.
机译:在目前通过互联网的信息时代,社交媒体已成为用户的基本信息源之一。虽然该文本是主要信息表示,但由于其性质(例如,短而稀疏),查找相关信息是研究人员有挑战性的使命。从大规模数据获取高质量的搜索结果,例如社交媒体需要一组不始终可用的代表性查询术语。在本文中,我们提出了一种新的基于查询的无监督学习模型,以代表社交媒体的短文本中的隐含关系。这弥补了缺乏单词共同发生的差距,而无需估计许多参数和要收集的外部证据。为了确认提出的模型效率,我们将提出的模型与最先进的词汇,主题模型和时间模型进行比较2011-2014集合。实验结果表明,拟议的模型显着提高了整体最先进的词汇,主题模型和时间模型,基于地图值的最大增加的增加33.97%,基于前30名文件的精度增加21.38%。拟议的模型可以提高社交媒体搜索有效性,潜在的密切检索任务,例如问题应答和时间表总结。

著录项

  • 来源
    《World Wide Web》 |2020年第3期|1791-1809|共19页
  • 作者单位

    School of EECS Queensland University of Technology (QUT) Brisbane Australia Umm Al-Qura University Makkah Saudi Arabia;

    School of EECS Queensland University of Technology (QUT) Brisbane Australia;

    School of EECS Queensland University of Technology (QUT) Brisbane Australia;

    School of Economy and Management Hubei University of Technology Wuhan 430064 Hubei China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Information retrieval; Text mining; Microblog retrieval; Pseudo-relevance feedback;

    机译:信息检索;文字挖掘;微博检索;伪相关性反馈;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号