首页> 外文期刊>The Electronic Library >Predicting users' demographic characteristics in a Chinese social media network
【24h】

Predicting users' demographic characteristics in a Chinese social media network

机译:在中国社交媒体网络中预测用户的人口统计特征

获取原文
获取原文并翻译 | 示例
       

摘要

Purpose - Based on user-generated content from a Chinese social media platform, this paper aims to investigate multiple methods of constructing user profiles and their effectiveness in predicting their gender, age and geographic location. Design/methodology/approach - This investigation collected 331, 634 posts from 4, 440 users of Sina Weibo. The data were divided into two parts, for training and testing. First, a vector space model and topic models were applied to construct user profiles. A classification model was then learned by a support vector machine according to the training data set. Finally, we used the classification model to predict users' gender, age and geographic location in the testing data set. Findings - The results revealed that in constructing user profiles, latent semantic analysis performed better on the task of predicting gender and age. By contrast, the method based on a traditional vector space model worked better in making predictions regarding the geographic location. In the process of applying a topic model to construct user profiles, the authors found that different prediction tasks should use different numbers of topics. Originality/value - This study explores different user profile construction methods to predict Chinese social media network users' gender, age and geographic location. The results of this paper will help to improve the quality of personal information gathered from social media platforms, and thereby improve personalized recommendation systems and personalized marketing.
机译:目的-基于来自中国社交媒体平台的用户生成的内容,本文旨在研究构建用户资料的多种方法及其在预测其性别,年龄和地理位置方面的有效性。设计/方法/方法-该调查收集了来自新浪微博的4,440位用户的331条634条帖子。数据分为两个部分,分别用于培训和测试。首先,将向量空间模型和主题模型应用于构建用户资料。然后由支持向量机根据训练数据集学习分类模型。最后,我们使用分类模型来预测测试数据集中用户的性别,年龄和地理位置。结果-结果显示,在构建用户个人资料时,潜在的语义分析在预测性别和年龄的任务上表现更好。相比之下,基于传统矢量空间模型的方法可以更好地预测地理位置。在应用主题模型构建用户资料的过程中,作者发现不同的预测任务应使用不同数量的主题。原创性/价值-这项研究探索了不同的用户资料构建方法来预测中国社交媒体网络用户的性别,年龄和地理位置。本文的结果将有助于提高从社交媒体平台收集的个人信息的质量,从而改善个性化推荐系统和个性化营销。

著录项

  • 来源
    《The Electronic Library》 |2017年第4期|758-769|共12页
  • 作者单位

    Department of Information Management, Nanjing University of Science and Technology, Nanjing, China;

    Department of Information Management, Nanjing University of Science and Technology, Nanjing, China;

    Department of Information Management, Nanjing University of Science and Technology, Nanjing, China,Jiangsu Key Laboratory of Data Engineering, Knowledge Service Nanjing University, Nanjing, China,Fujian Provincial Key Laboratory of Information Processing, Intelligent Control Minjiang University, Fuzhou, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Machine learning; Social media; Text classification; Users' profile;

    机译:机器学习;社交媒体;文字分类;用户资料;
  • 入库时间 2022-08-17 23:21:34

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号