首页> 外文会议>Workshop on Computational Modeling of PEople's Opinions, PersonaLity, and Emotions in Social media >Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality
【24h】

Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

机译:匹配理论和数据与个人信息:意大利语youtube评论的语料库揭示了个性

获取原文

摘要

As a contribution to personality detection in languages other than English, we rely on distant supervision to create Personal-ITY, a novel corpus of YouTube comments in Italian, where authors are labelled with personality traits. The traits are derived from one of the mainstream personality theories in psychology research, named MBT1. Using personality prediction experiments, we (ⅰ) study the task of personality prediction in itself on our corpus as well as on TwiSty, a Twitter dataset also annotated with MBTI labels; (ⅱ) carry out an extensive, in-depth analysis of the features used by the classifier, and view them specifically under the light of the original theory that we used to create the corpus in the first place. We observe that no single model is best at personality detection, and that while some traits are easier than others to detect, and also to match back to theory, for other, less frequent traits the picture is much more blurred.
机译:作为英语以外的语言的人格检测的贡献,我们依靠遥远的监督来创建个人信息,这是意大利人的youtube评论的新语料库,其中作者用个性特征标记。 该特征来自于心理学研究中的主流人格理论之一,名为MBT1。 使用个性预测实验,我们(Ⅰ)研究人格预测本身在我们的语料库以及Twisty上的任务,Twitter数据集也用MBTI标签注释; (Ⅱ)对分类器使用的特征进行广泛,深入的分析,并在原始理论的光线下专门对其进行观察,以便首先创建语料库。 我们观察到任何单一模型都是最好的人格检测,而一些特征比其他特征比其他特征更容易检测,而且还要匹配返回理论,对于其他,较少的频繁的特征图像更加模糊。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号