Modeling intra-textual variation with entropy and surprisal: topical vs. stylistic patterns

机译：用熵和惊奇来建模文本内变化：主题模式与风格模式

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a data-driven approach to investigate intra-textual variation by combining entropy and surprisal. With this approach we detect linguistic variation based on phrasal lexico-grammatical patterns across sections of research articles. Entropy is used to detect patterns typical of specific sections. Surprisal is used to differentiate between more and less informationally-loaded patterns as well as types of information (topical vs. stylistic). While we here focus on research articles in biology/genetics, the methodology is especially interesting for digital humanities scholars, as it can be applied to any text type or domain and combined with additional variables (e.g. time, author or social group) to obtain insights on intra-textual variation.

机译：我们提出了一种数据驱动的方法，通过结合熵和惊奇来研究文本内变异。通过这种方法，我们可以跨研究文章的各个部分，根据短语词典语法模式检测语言变异。熵用于检测特定部分的典型模式。 Surprusal用于区分越来越多的信息加载模式以及信息类型（主题与风格）。虽然我们在这里专注于生物学/遗传学方面的研究文章，但是该方法对于数字人文学科的学者特别感兴趣，因为它可以应用于任何文本类型或领域，并可以与其他变量（例如时间，作者或社会团体）结合使用以获得见解文字内变化。

著录项

来源
《Joint SIGHUM workshop on computational linguistics for cultural heritage, social sciences, humanities and literature 2017》|2017年|68-77|共10页
会议地点 Vancouver(CA)
作者
Stefania Degaetano-Ortlieb; Elke Teich;
展开▼
作者单位

Saarland University Campus A2.2 66123 Saarbriicken;

Saarland University Campus A2.2 66123 Saarbriicken;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Conservative Models: Parametric Entropy vs. Temporal Entropy in Outcomes [J] . Lumeng Huang, Robert W. Ritzi Jr., Ramya Ramanathan Ground water . 2012,第2期

机译：保守模型：结果中的参数熵与时间熵
2. Variations in patterns of concurrent androgen deprivation therapy use based on dose escalation with external beam radiotherapy vs. brachytherapy boost for prostate cancer [J] . Mohiuddin Jahan J., Narayan Vivek, Venigalla Sriram, Brachytherapy . 2019,第3期

机译：基于剂量升级与外梁放射疗法对近辐射治疗前列腺癌的近距离放射治疗的变异
3. Variation of dendritic cells distribution patterns inmycosis fungoides vs. inflammatory dermatosis [J] . Costras C., Constantin C., Nichita L., Virchows Archiv: an international journal of pathology . 2018,第Suppla1期

机译：树突细胞分布模式的变异嵌入诱发诱发皮肤病
4. Modeling intra-textual variation with entropy and surprisal: topical vs. stylistic patterns [C] . Stefania Degaetano-Ortlieb, Elke Teich Annual meeting of the Association for Computational Linguistics . 2017

机译：用熵和惊喜模拟文本内变异：主题与风格模式
5. Integrating Camera Trap Data and Occupancy Modeling to Estimate Seasonal Variations in Occurrence, Detection, and Activity Patterns of Mesocarnivores in Southcentral Oklahoma [D] . Premathilake, E.M. Dineesha. 2018

机译：集成相机陷阱数据和居住模型以估计俄克拉荷马州中南部食肉动物的发生，检测和活动模式的季节性变化
6. From the Cover: Quantitative patterns of stylistic influence in the evolution of literature [O] . James M. Hughes, Nicholas J. Foti, David C. Krakauer, 2012

机译：从封面：文学演变中的风格影响定量模式
7. Modeling intra-textual variation with entropy and surprisal: topical vs. stylistic patterns [O] . Stefania Degaetano-Ortlieb, Elke Teich 2017

机译：用熵和惊喜模拟文本内变异：主题与风格模式

Modeling intra-textual variation with entropy and surprisal: topical vs. stylistic patterns

摘要

著录项

相似文献

相关主题

期刊订阅