首页> 美国卫生研究院文献>PLoS Clinical Trials >Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library
【2h】

Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library

机译:HathiTrust数字图书馆中跨学科研究的多层计算方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other research areas and ultimately support a system for semi-automatic identification of argument structures. We provide a case study for the application of the methods to the problem of identifying and extracting arguments about anthropomorphism during a critical period in the development of comparative psychology. We show how a combination of classification systems and mixed-membership models trained over large digital libraries can inform resource discovery in this domain. Through a novel approach of “drill-down” topic modeling—simultaneously reducing both the size of the corpus and the unit of analysis—we are able to reduce a large collection of fulltext volumes to a much smaller set of pages within six focal volumes containing arguments of interest to historians and philosophers of comparative psychology. The volumes identified in this way did not appear among the first ten results of the keyword search in the HathiTrust digital library and the pages bear the kind of “close reading” needed to generate original interpretations that is the heart of scholarly work in the humanities. Zooming back out, we provide a way to place the books onto a map of science originally constructed from very different data and for different purposes. The multilevel approach advances understanding of the intellectual and societal contexts in which writings are interpreted.
机译:我们展示了使用传统分类系统和混合成员主题模型的组合进行的面搜索如何能够超越关键字搜索来为跨学科研究提供资源发现,假设表述和论点提取的信息。我们的测试领域是动物思想和认知方面的科学工作的历史和哲学。这些方法可以推广到其他研究领域,并最终支持用于论点结构的半自动识别的系统。我们提供了一个案例研究,说明了该方法在比较心理学发展的关键时期识别和提取有关拟人化论点的问题中的应用。我们展示了通过大型数字图书馆训练的分类系统和混合成员模型的组合如何在此领域中发现资源。通过一种新颖的“向下钻取”主题建模方法-同时减小了语料库的大小和分析单位的大小-我们能够将大量的全文本减少到六个焦点册中包含更少内容的一组页面,其中比较心理学的历史学家和哲学家感兴趣的论点。在HathiTrust数字图书馆中,以这种方式确定的书卷未出现在关键字搜索的前十个结果中,并且页面带有生成原始解释所需的“近距离阅读”,这是人文学科学术工作的核心。缩小范围,我们提供了一种方法,可以将书籍放置在最初由非常不同的数据和不同目的构成的科学地图上。多层方法可以增进对解释著作的思想和社会背景的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号