...
首页> 外文期刊>Information retrieval >Those were the days: learning to rank social media posts for reminiscence
【24h】

Those were the days: learning to rank social media posts for reminiscence

机译:那是过去的日子:学习对社交媒体帖子进行排名以使人联想到

获取原文
获取原文并翻译 | 示例
           

摘要

Social media posts are a great source for life summaries aggregating activities, events, interactions and thoughts of the last months or years. They can be used for personal reminiscence as well as for keeping track with developments in the lives of not-so-close friends. One of the core challenges of automatically creating such summaries is to decide which posts are memorable, i.e., should be considered for retention and which ones to forget. To address this challenge, we design and conduct user evaluation studies and construct a corpus that captures human expectations towards content retention. We analyze this corpus to identify a small set of seed features that are most likely to characterize memorable posts. Next, we compile a broader set of features that are leveraged to build general and personalized machine-learning models to rank posts for retention. By applying feature selection, we identify a compact yet effective subset of these features. The models trained with the presented feature sets outperform the baseline models exploiting an intuitive set of temporal and social features.
机译:社交媒体帖子是汇总过去几个月或几年中的活动,事件,互动和思想的生活摘要的重要来源。它们可以用于个人回忆,以及跟踪不太亲密的朋友的生活发展。自动创建此类摘要的核心挑战之一是确定哪些帖子值得纪念,即应考虑保留哪些帖子以及忘记哪些帖子。为了应对这一挑战,我们设计并进行了用户评估研究,并构建了一个语料库,以捕捉人类对内容保留的期望。我们分析该语料库,以确定一小组最有可能刻画难忘职位的种子特征。接下来,我们将编译一组更广泛的功能,这些功能可用于构建通用的和个性化的机器学习模型来对职位进行排名以进行保留。通过应用特征选择,我们确定了这些特征的紧凑而有效的子集。利用呈现的特征集训练的模型在利用直观的时间和社交特征集方面优于基线模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号