Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

机译：通过基于注意力的相似性学习在社交媒体中可解释的作者身份验证

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not. The analysis is traditionally performed by experts who consider linguistic features, which include spelling mistakes, grammatical inconsistencies, and stylistics for example. Machine learning algorithms, on the other hand, can be trained to accomplish the same, but have traditionally relied on so-called stylometric features. The disadvantage of such features is that their reliability is greatly diminished for short and topically varied social media texts. In this interdisciplinary work, we propose a substantial extension of a recently published hierarchical Siamese neural network approach, with which it is feasible to learn neural features and to visualize the decision-making process. For this purpose, a new large-scale corpus of short Amazon reviews for text comparison research is compiled and we show that the Siamese network topologies outperform state-of-the-art approaches that were built up on stylometric features. Our linguistic analysis of the internal attention weights of the network shows that the proposed method is indeed able to latch on to some traditional linguistic categories.

机译：作者身份验证是分析两个或更多文本的语言模式以确定它们是否由同一作者撰写的任务。传统上，分析是由考虑语言特征的专家执行的，这些语言特征包括拼写错误，语法不一致和风格。另一方面，可以对机器学习算法进行训练以完成相同的任务，但是传统上一直依赖于所谓的测音特征。这种功能的缺点是，对于简短且局部变化的社交媒体文本，其可靠性会大大降低。在这项跨学科的工作中，我们提出了对最新发布的分层暹罗神经网络方法的实质性扩展，利用该方法可以学习神经特征并可视化决策过程。为此，我们编写了一个新的大规模的简短亚马逊评论文集，用于文本比较研究，我们证明了暹罗网络拓扑的性能优于基于样式功能的最新方法。我们对网络内部注意力权重的语言分析表明，所提出的方法确实能够锁定某些传统的语言类别。

著录项

来源
《IEEE International Conference on Big Data》|2019年|36-45|共10页
会议地点
作者
Benedikt Boenninghoff; Steffen Hessler; Dorothea Kolossa; Robert M. Nickel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Linguistics; Social network services; Forensics; Writing; Feature extraction; Task analysis; Big Data;

机译：语言学;社会网络服务;取证;写作;特征提取;任务分析;大数据;

相似文献

外文文献
中文文献
专利

1. Correction to: Novel authorship verification model for social media accounts compromised by a human [J] . Alterkavi Suleyman, Erbay Hasan Multimedia Tools and Applications . 2021,第9期

机译：惩戒：人类损害社交媒体账户的新型作者验证模型
2. Novel authorship verification model for social media accounts compromised by a human [J] . Alterkavi Suleyman, Erbay Hasan Multimedia Tools and Applications . 2021,第9期

机译：人类妥协的社交媒体账户的新型作者验证模式
3. Our Social Media Pioneer explains why social media isn't all about getting your message 'out there', it's about learning stuff too [J] . Chris Moore Highways . 2016,第5appa期

机译：我们的社交媒体先驱者解释了为什么社交媒体不仅仅意味着将信息“传递出去”，还在于学习东西
4. Explainable Authorship Verification in Social Media via Attention-based Similarity Learning [C] . Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, IEEE International Conference on Big Data . 2019

机译：通过基于关注的相似性学习可解释社交媒体的作者验证
5. From 'Social' Media to Collaborative Media: Cooperative Inquiry for Shoulder-To-Shoulder Youth Video Authorship Technologies [D] . ?McRoberts, Sarah 2020

机译：从 “社会” 向媒体合作媒体：合作探究的肩对肩青年视频原创技术
6. Learning the Language of Social Media: A Comparison of Engagement Metrics and Social Media Strategies Used by Food and Nutrition-Related Social Media Accounts [O] . Amy M. Barklamb, Annika Molenaar, Linda Brennan, 2020

机译：学习社交媒体的语言：食物和营养相关的社交媒体账户的参与度量和社交媒体策略的比较
7. Explainable Authorship Verification in Social Media via Attention-based Similarity Learning [O] . Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, 2019

机译：通过基于关注的相似性学习可解释社交媒体的作者验证

Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

摘要

著录项

相似文献

相关主题

期刊订阅