首页> 外文会议>IEEE International Conference on Communications >Affinity: A System for Latent User Similarity Comparison on Texting Data
【24h】

Affinity: A System for Latent User Similarity Comparison on Texting Data

机译:亲和力:用于文本数据潜在用户相似性比较的系统

获取原文

摘要

In the field of social networking services, finding similar users based on profile data is common practice. Smartphones harbor sensor and personal context data that can be used for user profiling. Yet, one vast source of personal data, that is text messaging data, has hardly been studied for user profiling. We see three reasons for this: First, private text messaging data is not shared due to their intimate character. Second, the definition of an appropriate privacy-preserving similarity measure is nontrivial. Third, assessing the quality of a similarity measure on text messaging data representing a potentially infinite set of topics is non-trivial. In order to overcome these obstacles we propose affinity, a system that assesses the similarity between text messaging histories of users reliably and efficiently in a privacypreserving manner. Private texting data stays on user devices and data for comparison is compared in a latent format that neither allows to reconstruct the comparison words nor any original private plain text. We evaluate our approach by calculating similarities between Twitter histories of 60 US senators. The resulting similarity network reaches an average 85.0% accuracy on a political party classification task.
机译:在社交网络服务领域,基于个人资料数据寻找相似用户是常见的做法。智能手机包含可用于用户配置文件的传感器和个人上下文数据。但是,几乎没有研究过用于个人资料分析的大量个人数据源,即文本消息传递数据。我们看到以下三个原因:首先,由于私人短信消息的私密性,因此无法共享。其次,适当的保护隐私的相似性度量的定义是不平凡的。第三,评估表示潜在的无限主题集的文本消息传递数据的相似性度量的质量并非易事。为了克服这些障碍,我们提出了亲和力,该系统以隐私保护的方式可靠且有效地评估用户的文本消息历史之间的相似性。私人短信数据保留在用户设备上,用于比较的数据以一种潜在的格式进行比较,既不允许重构比较词,也不允许任何原始的私人纯文本。我们通过计算60位美国参议员的Twitter历史之间的相似性来评估我们的方法。由此产生的相似性网络在政党分类任务中的平均准确度达到85.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号