Affinity: A System for Latent User Similarity Comparison on Texting Data

机译：亲和力：用于文本数据潜在用户相似性比较的系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the field of social networking services, finding similar users based on profile data is common practice. Smartphones harbor sensor and personal context data that can be used for user profiling. Yet, one vast source of personal data, that is text messaging data, has hardly been studied for user profiling. We see three reasons for this: First, private text messaging data is not shared due to their intimate character. Second, the definition of an appropriate privacy-preserving similarity measure is nontrivial. Third, assessing the quality of a similarity measure on text messaging data representing a potentially infinite set of topics is non-trivial. In order to overcome these obstacles we propose affinity, a system that assesses the similarity between text messaging histories of users reliably and efficiently in a privacypreserving manner. Private texting data stays on user devices and data for comparison is compared in a latent format that neither allows to reconstruct the comparison words nor any original private plain text. We evaluate our approach by calculating similarities between Twitter histories of 60 US senators. The resulting similarity network reaches an average 85.0% accuracy on a political party classification task.

机译：在社交网络服务领域，基于个人资料数据寻找相似用户是常见的做法。智能手机包含可用于用户配置文件的传感器和个人上下文数据。但是，几乎没有研究过用于个人资料分析的大量个人数据源，即文本消息传递数据。我们看到以下三个原因：首先，由于私人短信消息的私密性，因此无法共享。其次，适当的保护隐私的相似性度量的定义是不平凡的。第三，评估表示潜在的无限主题集的文本消息传递数据的相似性度量的质量并非易事。为了克服这些障碍，我们提出了亲和力，该系统以隐私保护的方式可靠且有效地评估用户的文本消息历史之间的相似性。私人短信数据保留在用户设备上，用于比较的数据以一种潜在的格式进行比较，既不允许重构比较词，也不允许任何原始的私人纯文本。我们通过计算60位美国参议员的Twitter历史之间的相似性来评估我们的方法。由此产生的相似性网络在政党分类任务中的平均准确度达到85.0％。

著录项

来源
《IEEE International Conference on Communications》|2019年|1-7|共7页
会议地点
作者
Tobias Eichinger; Felix Beierle; Sumsam U. Khan; Robin Middelanis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
History; Electronic messaging; Task analysis; Twitter; Smart phones; Vocabulary;

机译：历史;电子消息;任务分析; Twitter;智能手机;词汇;

相似文献

外文文献
中文文献
专利

1. A Cross-Database Comparison to Discover Potential Product Opportunities Using Text Mining and Cosine Similarity [J] . Yung-Chi Shen, Grace T R Lin, Jan-Ruei Lin, Journal of Scientific & Industrial Research . 2017,第1期

机译：使用文本挖掘和余弦相似度进行跨数据库比较以发现潜在的产品机会
2. Analyzing Unstructured Text Data: Using Latent Categorization To Identify Intellectual Communities In Information Systems [J] . Kai R, Larsen, David E, Decision support systems . 2008,第4期

机译：分析非结构化文本数据：使用潜在分类识别信息系统中的知识社区
3. Design issues for socially intelligent user interfaces. A discourse analysis of a data-to-text system for summarizing clinical data. [J] . McKinlay A, McVittie C, Reiter E, Methods of information in medicine . 2010,第4期

机译：社交智能用户界面的设计问题。对用于汇总临床数据的数据到文本系统的话语分析。
4. Affinity: A System for Latent User Similarity Comparison on Texting Data [C] . Tobias Eichinger, Felix Beierle, Sumsam U. Khan, IEEE International Conference on Communications . 2019

机译：亲和力：发短信数据的潜在用户相似性比较系统
5. Mining texts and social users using time series and latent topics [D] . Yang, Tao. 2014

机译：使用时间序列和潜在主题挖掘文本和社会用户
6. Faulty Feeder Identification Based on Data Analysis and Similarity Comparison for Flexible Grounding System in Electric Distribution Networks [O] . Kangli Liu, Sen Zhang, Baorun Li, 2021

机译：基于数据分析和相似性比较的电力分配网络中的柔性接地系统的相似性识别故障
7. Affinity: A System for Latent User Similarity Comparison on Texting Data [O] . Tobias Eichinger, Felix Beierle, Sumsam U. Khan, 2019

机译：亲和力：发短信数据的潜在用户相似性比较系统
8. AN ON-LINE CONVERSATIONAL RETRIEVAL SYSTEM FOR ORCHIS TEXT-ORIENTED DATA BASES USER'S MANUAL [R] . V. A. Singletary 1975

机译：用于ORCHIs文本的数据库用户手册的在线对话检索系统

Affinity: A System for Latent User Similarity Comparison on Texting Data

摘要

著录项

相似文献

相关主题

期刊订阅