Classification of Private Tweets Using Tweet Content

机译：使用推文内容分类私人推文

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Online social networks (OSNs) like Twitter provide an open platform for users to easily convey their thoughts and ideas from personal experiences to breaking news. With the increasing popularity of Twitter and the explosion of tweets, we have observed large amounts of potentially sensitive/private messages being published to OSNs inadvertently or voluntarily. The owners of these messages may become vulnerable to online stalkers or adversaries, and they often regret posting such messages. Therefore, identifying tweets that reveal private/sensitive information is critical for both the users and the service providers. However, the definition of sensitive information is subjective and different from person to person. To develop a privacy protection mechanism that is customizable to fit the needs of diverse audiences, it is essential to accurately and automatically classify potentially sensitive tweets. In this paper, we make the first attempt to classify private tweets into 14 categories, such as alcohol & drugs, family information, etc. We model tweet semantic with term distribution features as well as users' topic-preferences based on personal tweet history. Experiments show that our method can boost classification accuracy compared with the well-known Bag-of-Words and tf-idf methods.

机译：在线社交网络（OSN）如Twitter，为用户提供了一个开放的平台，以便在个人经验中轻松地传达他们的思想和想法。随着Twitter的普及和推文的爆炸，我们已经观察到大量潜在的敏感/私人消息被无意或自愿地发布给OSNS。这些消息的所有者可能变得容易受到在线追踪者或对手的影响，他们经常后悔发布此类信息。因此，识别揭示私有/敏感信息的推文对于用户和服务提供商至关重要。然而，敏感信息的定义是主观的，与人的相同。要开发隐私保护机制，可定制以满足不同受众的需求，必须准确和自动分类潜在敏感的推文。在本文中，我们首次尝试将私人推文分为14个类别，例如酒精和药物，家庭信息等。我们根据个人推文历史记录为推文分发功能以及用户的主题首选项。实验表明，与众所周知的单词和TF-IDF方法相比，我们的方法可以提高分类精度。

著录项

来源
《IEEE International Conference on Semantic Computing》|2017年|535p|共4页
会议地点
作者
Qiaozhi Wang; Jaisneet Bhandal; Shu Huang; Bo Luo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301-53;
关键词
Twitter; Privacy; Drugs; Labeling; Boosting; Semantics; Data collection;

机译：Twitter;隐私;药物;标签;升压;语义;数据收集;

相似文献

外文文献
中文文献
专利

1. A novel domain and event adaptive tweet augmentation approach for enhancing the classification of crisis related tweets [J] . Ramachandran Dharini, Parvathi R. Data & Knowledge Engineering . 2021,第Sepa期

机译：一种新的域名和事件自适应推文增强方法，用于加强危机相关推文的分类
2. Semantic Abstraction for Generalization of Tweet Classification: An Evaluation on Incident-Related Tweets [J] . Schulz Axel, Guckelsberger Christian, Janssen Frederik Semantic web . 2017,第3期

机译：推文分类概括的语义抽象：对与事件相关的推文的评估
3. Tweeting on dementia: A snapshot of the content and sentiment of tweets associated with dementia [J] . David Robertshaw, Ivana Babicova First Monday . 2021,第6期

机译：关于痴呆症的推特：与痴呆症相关的推文的内容和情绪的快照
4. Classification of Private Tweets Using Tweet Content [C] . Qiaozhi Wang, Jaisneet Bhandal, Shu Huang, IEEE International Conference on Semantic Computing . 2017

机译：使用推文内容对私人推文进行分类
5. Competing by Tweeting: A Content Analysis of University Presidents’ Tweets [D] . Walton, Susan Balcom. 2019

机译：推特竞争：大学总统推文的内容分析
6. Qualitative Twitter analysis of participants tweet strategies and tweet content at a major urologic conference [O] . Hendrik Borgmann, Jan-Henning Woelm, Axel Merseburger, 2016

机译：在一次大型泌尿科会议上对参与者推特策略和推特内容进行定性Twitter分析
7. Tweets Classification and Sentiment Analysis for Personalized Tweets Recommendation [O] . Asad Masood Khattak, Rabia Batool, Fahad Ahmed Satti, 2020

机译：Tweets个性化推文推荐的分类和情感分析

Classification of Private Tweets Using Tweet Content

摘要

著录项

相似文献

相关主题

期刊订阅