首页> 外文会议>International conference on neural information processing >Inferring Users' Gender from Interests: A Tag Embedding Approach
【24h】

Inferring Users' Gender from Interests: A Tag Embedding Approach

机译:从兴趣推断用户性别:标签嵌入方法

获取原文

摘要

This paper studies the problem of gender prediction of users in social media using their interest tags. The challenge is that the tag feature vector is extremely sparse and short, i.e., less than 10 tags for each user. We present a novel conceptual class based method which enriches and centralizes the feature space. We first identify the discriminating tags based on the tag distribution. We then build the initial conceptual class by taking the advantage of the generalization and specification operations on these tags. For example, 'Kobe' is a specialized instance of 'basketball'. Finally, we model class expansion as a problem of computing the similarity between one tag and a set of tags in one conceptual class in the embedding space. We conduct extensive experiments on a real dataset from Sina Weibo. Results demonstrate that our proposed method significantly enhances the quality of the feature space and improves the performance of gender classification. Its accuracy reaches 82.25% while that for the original tag vector is only 62.75%.
机译:本文研究了社交媒体中用户使用其兴趣标签进行性别预测的问题。挑战在于标签特征向量非常稀疏且短,即每个用户少于10个标签。我们提出了一种新颖的基于概念类的方法,可以丰富和集中特征空间。我们首先根据标签分布来识别可区别的标签。然后,我们利用这些标签的归纳和规范操作优势来构建初始概念类。例如,“神户”是“篮球”的专门实例。最后,我们将类扩展建模为计算嵌入空间中一个概念类中一个标签和一组标签之间的相似性的问题。我们对来自新浪微博的真实数据集进行了广泛的实验。结果表明,我们提出的方法显着提高了特征空间的质量,并提高了性别分类的性能。其准确率达到82.25%,而原始标签矢量的准确率仅为62.75%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号