首页> 外文会议>Workshop on Resourcesfor African Indigenous Languages;Language Resources and Evaluation Conference >Mobilizing Metadata: Open Data Kit (ODK) for Language Resource Development in East Africa
【24h】

Mobilizing Metadata: Open Data Kit (ODK) for Language Resource Development in East Africa

机译:动员元数据:用于东非语言资源开发的开放数据工具包(ODK)

获取原文

摘要

Linguistic fieldworkers collect and archive metadata as part of the language resources (LRs) that they create, but they often work in resource-constrained environments that prevent them from using computers for data entry. In such situations, linguists must complete time-consuming and error-prone digitization tasks that limit the quantity and quality of the resources and metadata that they produce (Thieberger & Berez 2012; Margetts & Margetts 2012). This paper describes a method for entering linguistic metadata into mobile devices using the Open Data Kit (ODK) platform, a suite of open source tools designed for mobile data collection. The method was incorporated into two community-based language documentation projects in Tanzania, involving twelve researchers simultaneously collecting data in four administrative regions (Griscom & Harvey 2019). Through the identification of project-specific data dependencies and redundancies, a number of efficiencies were built into the metadata entry system. These include the use of closed vocabularies, unique data entry forms for distinct data collector categories, and separate forms for entering participant and resource metadata. The resulting system serves as the basis for the ongoing development of general purpose bilingual English-Swahili metadata entry tools, to be made available for use by other researchers working in East Africa.
机译:语言现场工作人员收集和存档元数据,并将其作为他们创建的语言资源(LR)的一部分,但是他们经常在资源受限的环境中工作,从而阻止他们使用计算机进行数据输入。在这种情况下,语言学家必须完成耗时且容易出错的数字化任务,从而限制他们产生的资源和元数据的数量和质量(Thieberger&Berez 2012; Margetts&Margetts 2012)。本文介绍了一种使用开放数据工具包(ODK)平台将语言元数据输入到移动设备中的方法,该平台是专为移动数据收集而设计的一套开源工具。该方法被并入坦桑尼亚的两个基于社区的语言文档项目中,涉及十二名研究人员同时在四个行政区域中收集数据(Griscom&Harvey 2019)。通过识别特定于项目的数据依存关系和冗余,在元数据输入系统中建立了许多效率。其中包括使用封闭的词汇表,用于不同数据收集器类别的独特数据输入表单以及用于输入参与者和资源元数据的单独表单。由此产生的系统为正在进行的通用英语-斯瓦希里语双语元数据输入工具的开发提供了基础,该工具可供在东非工作的其他研究人员使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号