【24h】

Cross-Domain Age Inference Framework Based on Common-Topics

机译:基于共同主题的跨域年龄推断框架

获取原文

摘要

Privacy leakage is one of the raising serious concerns in public. Some online service systems cope with this problem bypassing gathering user privacy information, age, for instance. Without collecting and studying these information, is it possible to infer user age or not in such systems? In this paper, we proposed a cross-domain age inference framework based on common-topics to show the possibility of age disclosure with aid of transferred knowledge from an auxiliary domain. Specifically, we take the online book and movie systems, i.e., BookCrossing and Movielens systems, as target and auxiliary domains respectively. First, we extract common-topics from item titles and descriptions to aggregate user behavior. Such aggregation behaviors of Movielens system are regarded as transferred knowledge, and bridge BookCrossing system to build age inference module. Moreover, various classical algorithms are compared. And targeting age imbalance distribution, balance evaluation metrics are proposed to evaluate our algorithm. Experiment results show that for BookCrossing system, age inference performance of our proposed cross-domain age inference framework based on common-topics is even better than that of system with available age information, which means that the privacy risk even exist in a supposedly safe environment.
机译:隐私泄露是引起公众严重关注的问题之一。例如,一些在线服务系统绕过了收集用户隐私信息(例如年龄)来解决此问题。在不收集和研究这些信息的情况下,是否可以推断出此类系统中的用户年龄?在本文中,我们提出了一个基于共同主题的跨域年龄推断框架,以借助辅助域中转移的知识来展示年龄披露的可能性。具体来说,我们将在线图书和电影系统(即BookCrossing和Movielens系统)分别作为目标域和辅助域。首先,我们从商品标题和描述中提取常见主题,以汇总用户行为。 Movielens系统的这种聚合行为被视为已转移的知识,并桥接BookCrossing系统以构建年龄推断模块。此外,比较了各种经典算法。针对年龄不平衡分布,提出平衡评估指标对算法进行评估。实验结果表明,对于BookCrossing系统,我们提出的基于公共主题的跨域年龄推断框架的年龄推断性能甚至优于具有可用年龄信息的系统的年龄推断性能,这意味着隐私风险甚至存在于假定的安全环境中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号