首页> 外文会议>Annual meeting of the Association for Computational Linguistics >An Open Source Toolkit for Quantitative Historical Linguistics
【24h】

An Open Source Toolkit for Quantitative Historical Linguistics

机译:定量历史语言学的开源工具包

获取原文

摘要

Given the increasing interest and development of computational and quantitative methods in historical linguistics, it is important that scholars have a basis for documenting, testing, evaluating, and sharing complex workflows. We present a novel open-source toolkit for quantitative tasks in historical linguistics that offers these features. This toolkit also serves as an interface between existing software packages and frequently used data formats, and it provides implementations of new and existing algorithms within a homogeneous framework. We illustrate the toolkit's functionality with an exemplary workflow that starts with raw language data and ends with automatically calculated phonetic alignments, cognates and borrowings. We then illustrate evaluation metrics on gold standard datasets that are provided with the toolkit.
机译:鉴于历史语言学对计算和定量方法的兴趣与日俱增,因此重要的是,学者们必须具有记录,测试,评估和共享复杂工作流的基础。我们提出了一种新颖的开源工具包,用于历史语言学中的定量任务,提供了这些功能。该工具包还可以用作现有软件包和常用数据格式之间的接口,并且可以在同类框架内提供新算法和现有算法的实现。我们以示例性工作流程为例来说明该工具包的功能,该工作流程以原始语言数据开始,以自动计算的语音对齐方式,同音字和借位结尾。然后,我们说明了随工具包提供的黄金标准数据集的评估指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号