【24h】

Computational analysis to explore authors' depiction of characters

机译:计算分析探索作者描绘的人物

获取原文

摘要

This study involves automatically identifying the sociolinguistic characteristics of fictional characters in plays by analyzing their written "speech". We discuss three binary classification problems: predicting the characters' gender (male vs. female), age (young vs. old), and socio-economic standing (upper-middle class vs. lower class). The text corpus used is an annotated collection of August Strind-berg and Henrik Ibsen plays, translated into English, which are in the public domain. These playwrights were chosen for their known attention to relevant socio-economic issues in their work. Linguistic and textual cues are extracted from the characters' lines (turns) for modeling purposes. We report on the dataset as well as the performance and important features when predicting each of the sociolinguistic characteristics, comparing intra- and inter-author testing.
机译:本研究涉及通过分析他们的书面“语音”自动识别戏剧中的虚构人物的社会语言学特征。 我们讨论了三个二进制分类问题:预测人物的性别(男性与女性与女性),年龄(年轻与旧的)和社会经济地位(中产阶级与下层阶级)。 所用的文本语料库是八月中博格和Henrik Ibsen的注释集合,翻译成英文,在公共领域。 这些剧作家被选为他们对其工作相关的社会经济问题的关注。 语言和文本线索从字符的线路(转弯)中提取,以进行建模目的。 我们在预测每个社会语言语言特征时报告数据集以及性能和重要特征,比较作者间测试和互联测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号