【24h】

Life still goes on: Analysing Australian WW1 Diaries through Distant Reading

机译:生活仍然持续下去:通过遥远的阅读分析澳大利亚WW1日记

获取原文

摘要

An increasing amount of historic data is now available in digital (text) formats. This gives quantitative researchers an opportunity to use distant reading techniques, as opposed to traditional close reading, in order to analyse larger quantities of historic data. Distant reading allows researchers to view overall patterns within the data and reduce researcher bias. One such data set that has recently been transcribed is a collection of over 500 Australian World War I (WW1) diaries held by the State Library of New South Wales. Here we apply distant reading techniques to this corpus to understand what soldiers wrote about and how they felt over the course of the war. Extracting dates accurately is important as it allows us to perform our analysis over time, however, it is very challenging due to the variety of date formats and abbreviations diarists use. But with that data, topic modelling and sentiment analysis can then be applied to show trends, for instance, that despite the horrors of war, Australians in WW1 primarily wrote about their everyday routines and experiences. Our results detail some of the challenges likely to be encountered by quantitative researchers intending to analyse historical texts, and provide some approaches to these issues.
机译:现在可以使用数字(文本)格式的越来越多的历史数据。这使定量研究人员有机会使用远处阅读技术,而不是传统的密切阅读,以分析更大的历史数据。遥远的阅读允许研究人员在数据中查看整体模式并减少研究人员偏见。最近被转录的一种数据集是由新南威尔士州的国家图书馆持有的超过500名澳大利亚第一次世界大战的一系列(WW1)日记。在这里,我们将遥远的阅读技巧应用于此语料库,以了解士兵的写作以及如何在战争过程中感受到的方式。准确提取日期非常重要,因为它允许我们随着时间的推移进行分析,因此由于各种日期格式和缩写术语使用,这是非常具有挑战性的。但是,通过该数据,可以应用主题建模和情感分析来显示趋势,例如,尽管战争的恐怖,但WW1的澳大利亚人主要是关于他们日常生活和经验。我们的结果详细介绍了有意分析历史文本的量化研究人员可能会遇到一些挑战,并为这些问题提供一些方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号