首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures
【24h】

Assessment of Effectiveness of Content Models for Approximating Twitter Social Connection Structures

机译:评估近似Twitter社交连接结构的内容模型的有效性

获取原文

摘要

This paper explores the social quality (goodness) of community structures formed across Twitter users, where social links within the structures are estimated based upon semantic properties of user-generated content (corpus). We examined the overlap of the community structures of the constructed graphs, and followership-based social communities, to find the social goodness of the links constructed. Unigram, bigram and LDA content models were empirically investigated for evaluation of effectiveness, as approximators of underlying social graphs, such that they maintain the community social property. Impact of content at varying granularities, for the purpose of predicting links while retaining the social community structures, was investigated. 100 discussion topics, spanning over 10 Twitter events, were used for experiments. The unigram language model performed the best, indicating strong similarity of word usage within deeply connected social communities. This observation agrees with the phenomenon of evolution of word usage behavior, that transform individuals belonging to the same community tending to choose the same words, made by [1], and raises a question on the literature that use, without validation, LDA for content-based social link prediction over other content models. Also, semantically finer-grained content was observed to be more effective compared to coarser-grained content.
机译:本文探讨了Twitter用户中形成的社区结构的社会素质(善良),其中基于用户生成的内容(语料库)的语义属性估计了结构内的社交链接。我们检查了构建的图表和基于追随者的社会社区的社区结构的重叠,以找到所构建的联系的社会优美。 uniagram,Bigram和LDA内容模型是经验研究的,以评估有效性,作为潜在的社会图表的近似值,使他们保持社区的社会财产。调查了含量在不同粒度下的影响,以便在保留社会界结构的同时预测链接。 100次讨论主题,跨越10次推特事件,用于实验。 UNIGRAM语言模型表现了最佳,表明在深层连接的社交社区中的单词使用情况强烈相似。此观察结果同意单词使用行为的演变现象,使属于同一社区的个人倾向于选择相同的单词,以选择相同的单词,并在没有验证的情况下提出了关于使用的文献的问题基于其他内容模型的社交链路预测。此外,与粗糙粒含量相比,观察到语义上细粒含量更有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号