首页> 外文期刊>Procedia Computer Science >Socio-demographic features meet interests: on subscription patterns and attention distribution in online social media
【24h】

Socio-demographic features meet interests: on subscription patterns and attention distribution in online social media

机译:社会人口统计特征符合兴趣:关于在线社交媒体的订阅模式和注意力分布

获取原文
获取外文期刊封面目录资料

摘要

This research is aimed to gain a better understanding of underlying connections between different demographic and social factors and interests as well as ways that can help to determine them. In contrast to existing studies of such correlations we focus on attention to specific topics of different socio-demographic classes. Interests are represented by topics that can be assigned to user’s subscriptions. As a measure of involvement in topics, we analyse interests heterogeneity and determine the most influential factors, associated with particular interests. Topic modelling is performed by ARTM; user’s attention to interests is measured by Gini Index and then related to socio-demographic factors. To investigate the influence of features on specific topics we trained an interpretable regression model (XGBoost and SHAP) and built a corresponding graph with clusters to analyze the results. To investigate further we scattered topics according to their socio-demographic profile and coloured according to clusters. Results show that patterns of user’s attention differ depending on socio-demographical features. We notice a shift in attention depending on age, and different patterns of attention for genders. Topics connected to gender mostly have a male audience, while age is more influential among topics with mostly female and mostly age-homogeneous audiences. We also suggest ways that can be used to improve interest prediction.
机译:该研究旨在更好地了解不同人口统计和社会因素和兴趣之间的基础联系以及可以帮助确定它们的方式。与对这种相关性的现有研究相比,我们关注对不同社会人口统计学课程的特定主题的关注。利益由可以分配给用户订阅的主题来表示。作为议题的衡量标题,我们分析了兴趣异质性并确定最有影响力的因素,特别是特别利益。主题建模由ARTM执行;用户对兴趣的关注是由Gini指数衡量的,然后与社会人口统计因素有关。为了调查特定主题对特定主题的影响,我们培训了可解释的回归模型(XGBoost和Shav),并用群集构建了相应的图表来分析结果。进一步调查我们根据其社会人口统计学配置文件分散主题,并根据集群着色。结果表明,用户的注意模式因社会人群特征而异。我们注意到根据年龄的关注,以及对性别的不同关注模式。与性别相连的主题主要有一个男性受众,而年龄在主要是女性和大多数年龄均等受众的主题中更具影响力。我们还建议提供可用于改善兴趣预测的方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号