【24h】

Narrative Schema Stability in News Text

机译:新闻文本中的叙事架构稳定性

获取原文
获取外文期刊封面目录资料

摘要

We investigate the stability of narrative schemas (Chambers & Jurafsky, 2009) automatically induced from a news corpus, representing recurring narratives in a corpus. If such techniques produce meaningful results, we should expect that small changes to the corpus will result in only small changes to the induced schemas. We describe experiments involving successive ablation of a corpus and cross-validation at each stage of ablation, on schemas generated by three different techniques over a general news corpus and topically-specific subcorpora. We also develop a method for evaluating the similarity between sets of narrative schemas, and thus the stability of the schema induction algorithms. This stability analysis affirms the heterogeneous/homogeneous document category hypothesis first presented in Simonson & Davis (2016). whose technique is problematically limited. Additionally, increased ablation leads to increasing stability, so the smaller the remaining corpus, the more stable schema generation appears to be. We surmise that as a corpus grows larger, novel and more varied narratives continue to appear and stability declines, though at some point this decline levels off as new additions to the corpus consist essentially of "more of the same."
机译:我们调查从新闻语料库中自动诱导的叙事模式(Chambers&Jurafsky,2009)的稳定性,代表了一种语料库中的经常性叙述。如果这些技术会产生有意义的结果,我们应该期望对语料库的小变化将导致对诱导模式的较小变化。我们描述了涉及连续消融的实验,在一次消融的每个阶段的次阶段,在一般新闻语料库和局部特定的Subcorpora上由三种不同技术产生的模式。我们还开发了一种评估叙事模式集之间的相似性的方法,从而实现了模式归纳算法的稳定性。这种稳定性分析肯定了Simonson&Davis(2016年)首次介绍的异构/同质文件类别假设。其技术有限有限。另外,增加的消融导致稳定性的增加,因此剩余的骨库越小,似乎越稳定的模式。我们猜测,随着毒品毒品表现大,新颖和更具不同的叙述继续出现,稳定性下降,但在某些时候,这种下降水平随着核心的新增的差异,基本上由“更多相同”。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号