Early author profiling on Twitter using profile features with multi-resolution

Pastor Lopez-Monroy A.; Gonzalez Fabio A.; Solorio Thamar

首页> 外文期刊>Expert Systems with Application >Early author profiling on Twitter using profile features with multi-resolution

【24h】

Early author profiling on Twitter using profile features with multi-resolution

机译：使用多分辨率个人资料功能在Twitter上进行早期作者分析

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Author Profiling (AP) task aims to predict demographic characteristics about the authors from documents (e.g., age, gender, native language). The research so far has focused only on forensic scenarios by performing post-analysis using all the available text evidence. This paper introduces the task of Early Author Profiling (EAP) in Twitter. The goal is to effectively recognize profiles using as few tweets as possible from the user history. The task is highly relevant to support social media analysis and different problems related to security and marketing, where prevention and anticipation is crucial. This work proposes a novel strategy that combines a state of the art representation for early text classification and specialized word-vectors for author profiling tasks. In this strategy we build prototypical features called Profile based Meta-Words, which allow us to model AP information at different levels of granularity. Our evaluation shows that the proposed methodology is well suited for profiling little text evidence (e.g., a handful of tweets) in early stages, but as more tweets become available other granularities better encode larger amounts of text in late stages. We evaluated the proposed ideas on gender and language variety identification for English and Spanish, and showed that the proposal outperforms state of the art methodologies. (C) 2019 Elsevier Ltd. All rights reserved.

机译：作者分析（AP）任务旨在根据文档（例如年龄，性别，母语）预测作者的人口统计特征。迄今为止，该研究仅通过使用所有可用的文本证据执行后分析，仅将重点放在法医场景上。本文介绍了Twitter中的早期作者分析（EAP）的任务。目标是使用尽可能少的来自用户历史记录的推文来有效识别配置文件。这项任务与支持社交媒体分析以及与安全和营销有关的各种问题非常相关，在这些问题中，预防和预期至关重要。这项工作提出了一种新颖的策略，该策略结合了用于早期文本分类的最先进的表示方法和用于作者概要分析任务的专用词向量。在这种策略中，我们构建了称为基于配置文件的元词的原型功能，该功能使我们可以在不同的粒度级别上对AP信息进行建模。我们的评估表明，所提出的方法非常适合在早期阶段分析少量文本证据（例如，少数推文），但是随着可用的推文越多，其他粒度可以在后期更好地编码大量文本。我们评估了关于英语和西班牙语的性别和语言多样性识别的提议思想，并表明该提议优于最新的方法论。（C）2019 Elsevier Ltd.保留所有权利。

著录项

来源
《Expert Systems with Application》 |2020年第2期|112909.1-112909.10|共10页
作者
Pastor Lopez-Monroy A.; Gonzalez Fabio A.; Solorio Thamar;
展开▼
作者单位

Math Res Ctr CIMAT Dept Comp Sci Jalisco S-N Guanajuato 36023 Gto Mexico;

Univ Nacl Colombia Comp Syst & Ind Engn Dept MindLab Cra 30 45 03 Ciudad Univ Bogota DC Colombia;

Univ Houston Dept Comp Sci 4800 Calhoun Rd Houston TX 77004 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Early text classification; Author profiling; Social media analysis; Text mining;

机译：早期文本分类;作者简介;社交媒体分析;文字挖掘;

相似文献

外文文献
中文文献
专利

1. 'Less is more': Mining useful features from Twitter user profiles for Twitter user classification in the public health domain [J] . Online Information Review . 2020,第1期

机译：“少即是多”：从Twitter用户配置文件中挖掘有用的功能，以在公共卫生领域中对Twitter用户进行分类
2. An evolutionary approach to mining robust multi-resolution web profiles and context sensitive URL associations [J] . Olfa Nasraoui, Raghu Krishnapuram International Journal of Computational Intelligence and Applications . 2002,第3期

机译：挖掘健壮的多分辨率Web配置文件和上下文相关的URL关联的进化方法
3. BROCELIANDE: A COMPARATIVE STUDY OF ATTRIBUTE PROFILES AND FEATURE PROFILES FROM DIFFERENT ATTRIBUTES [J] . F. Merciol, M.-T. Pham, D. Santana Maia, International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第4期

机译：Broceliande：来自不同属性的属性配置文件和特征概况的比较研究
4. Overview of PAN 2020: Authorship Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter, and Style Change Detection [C] . Janek Bevendorff, Bilal Ghanem, Anastasia Giachanou, International Conference of the CLEF Association . 2020

机译：PAN 2020概述：作者身份验证，名人分析，在Twitter上分析假新闻传播者以及样式更改检测
5. The state of women's sports on the web: Content analyses of international sports news websites and athletes' Twitter profiles. [D] . Coche, Roxane. 2013

机译：网络上的女子体育状况：国际体育新闻网站的内容分析和运动员的Twitter个人资料。
6. Who is mentally healthy? Mental health profiles of Japanese social networking service users with a focus on LINE Facebook Twitter and Instagram [O] . Ryota Sakurai, Yuta Nemoto, Hiroko Mastunaga, 2021

机译：谁是精神健康的？日本社交网络服务用户的心理健康概况专注于线路FacebookTwitter和Instagram
7. A Rede Globo no ecossistema da Social TV: uma análise sobre as postagens do perfil @redeglobo no Twitter / Rede Globo in the Social TV ecosystem: an analysis of @redeglobo profile posts on TwitterRede Globo in the Social TV ecosystem: an analysis of @redeglobo profile posts on Twitter [O] . Daiana Sigiliano, Gabriela Borges 2016

机译：社交电视生态系统中的网络Globo：在社交电视生态系统中对Twitter / Rede Globo上的简介帖子@redlobo分析：在社交电视生态系统中的Twitterrede Globo上的分析：@redeglobo配置文件分析在推特上的帖子

Early author profiling on Twitter using profile features with multi-resolution

摘要

著录项

相似文献

相关主题

期刊订阅