首页> 外文期刊>Future generation computer systems >Matching user accounts based on user generated content across social networks
【24h】

Matching user accounts based on user generated content across social networks

机译:根据用户在社交网络上生成的内容来匹配用户帐户

获取原文
获取原文并翻译 | 示例
       

摘要

Matching user accounts can help us build better users’ profiles and benefit many applications. It has attracted much attention from both industry and academia. Most of existing works are mainly based on rich user profile attributes. However, in many cases, user profile attributes are unavailable, incomplete or unreliable, either due to the privacy settings or just because users decline to share their information. This makes the existing schemes quite fragile. Users often share their activities on different social networks. This provides an opportunity to overcome the above problem. We aim to address the problem of user identification based on User Generated Content (UGC). We first formulate the problem of user identification based on UGCs and then propose a UGC-based user identification model. A supervised machine learning based solution is presented. It has three steps: firstly, we propose several algorithms to measure the spatial similarity, temporal similarity and content similarity of two UGCs; secondly, we extract the spatial, temporal and content features to exploit these similarities; afterwards, we employ the machine learning method to match user accounts, and conduct the experiments on three ground truth datasets. The results show that the proposed method has given excellent performance with F1 values reaching 89.79%, 86.78% and 86.24% on three ground truth datasets, respectively. This work presents the possibility of matching user accounts with high accessible online data.
机译:匹配用户帐户可以帮助我们建立更好的用户个人资料,并使许多应用程序受益。它已经引起了工业界和学术界的广泛关注。现有的大多数作品主要基于丰富的用户配置文件属性。但是,在许多情况下,由于隐私设置或仅由于用户拒绝共享其信息,用户配置文件属性不可用,不完整或不可靠。这使得现有方案非常脆弱。用户经常在不同的社交网络上分享他们的活动。这提供了克服上述问题的机会。我们旨在解决基于用户生成内容(UGC)的用户标识问题。我们首先提出基于UGC的用户识别问题,然后提出基于UGC的用户识别模型。提出了一种基于监督的机器学习解决方案。它包括三个步骤:首先,我们提出了几种算法来测量两个UGC的空间相似性,时间相似性和内容相似性。其次,我们提取空间,时间和内容特征以利用这些相似性。之后,我们采用机器学习方法来匹配用户帐户,并在三个地面真实数据集上进行实验。结果表明,该方法在三个地面真实数据集上具有优异的性能,F1值分别达到89.79%,86.78%和86.24%。这项工作提出了使用户帐户与可访问性高的在线数据匹配的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号