首页> 外文会议>Proceedings of 23rd ACM conference on hypertext and social media >Predictina Semantic Annotations on the Real-Time Web
【24h】

Predictina Semantic Annotations on the Real-Time Web

机译:实时Web上的Predictina语义注释

获取原文
获取原文并翻译 | 示例

摘要

The explosion of the real-time web has spurred a growing need for new methods to organize, monitor, and distill relevant information from these large-scale social streams. One especially encouraging development is the self-curation of the real-time web via user-driven linking, in which users annotate their own status updates with lightweight semantic annotations - or hashtags. Unfortunately, there is evidence that hashtag growth is not keeping pace with the growth of the overall real-time web. In a random sample of 3 million tweets, we find that only 10.2% contain at least one hashtag. Hence, in this paper we explore the possibility of predicting hashtags for un-annotated status updates. Toward this end, we propose and evaluate a graph-based prediction framework. Three of the unique features of the approach are: (i) a path aggregation technique for scoring the closeness of terms and hashtags in the graph; (ii) pivot term selection, for identifying high value terms in status updates; and (iii) a dynamic sliding window for recommending hashtags reflecting the current status of the real-time web. Experimentally we find encouraging results in comparison with Bayesian and data mining-based approaches.
机译:实时网络的爆炸性增长,激发了对新方法进行组织,监控和提取这些大规模社交流中的相关信息的需求。一种特别令人鼓舞的发展是通过用户驱动的链接对实时Web进行自我管理,其中用户使用轻量级的语义注释(或井号标签)注释自己的状态更新。不幸的是,有证据表明,主题标签的增长跟不上整个实时网络的增长。在300万条推文的随机样本中,我们发现只有10.2%的人至少包含一个标签。因此,在本文中,我们探索了预测未注释状态更新的标签的可能性。为此,我们提出并评估了基于图的预测框架。该方法的三个独特特征是:(i)一种路径聚合技术,用于对图中术语和标签的紧密程度进行评分; (ii)关键术语选择,用于在状态更新中识别高价值术语; (iii)动态滑动窗口,用于推荐反映实时Web当前状态的标签。在实验上,与贝叶斯方法和基于数据挖掘的方法相比,我们发现令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号