首页> 外文会议>International conference on language resources and evaluation >The Language Archive - a new hub for language resources
【24h】

The Language Archive - a new hub for language resources

机译:语言档案馆-语言资源的新中心

获取原文

摘要

This contribution presents "The Language Archive" (TLA), a new unit at the MPI for Psycholinguistics, discussing the current developments in management of scientific data, considering the need for new data research infrastructures. Although several initiatives worldwide in the realm of language resources aim at the integration, preservation and mobilization of research data, the state of such scientific data is still often problematic. Data are often not well organized and archived and not described by metadata - even unique data such as field-work observational data on endangered languages is still mostly on perishable carriers. New data centres are needed that provide trusted, quality-reviewed, persistent services and suitable tools and that take legal and ethical issues seriously. The CLARIN initiative has established criteria for suitable centres. TLA is in a good position to be one of such centres. It is based on three essential pillars: (1) A data archive; (2) management, access and annotation tools; (3) archiving and software expertise for collaborative projects. The archive hosts mostly observational data on small languages worldwide and language acquisition data, but also data resulting from experiments.
机译:本文稿介绍了“语言档案库”(TLA),这是MPI心理语言学的新部门,讨论了对科学数据管理的最新发展,并考虑了对新数据研究基础架构的需求。尽管世界范围内语言资源领域的一些举措都旨在整合,保存和调动研究数据,但此类科学数据的状态仍然经常有问题。数据通常没有很好的组织和归档,也没有用元数据来描述-甚至诸如濒危语言的实地观察数据之类的独特数据仍大多处于易腐的载体上。需要新的数据中心,以提供可信赖的,经过质量审查的持久服务和合适的工具,并认真对待法律和道德问题。 CLARIN倡议为合适的中心建立了标准。 TLA处于成为此类中心之一的良好位置。它基于三个基本支柱:(1)数据存档; (2)管理,访问和注释工具; (3)协作项目的归档和软件专业知识。该档案馆主要存储有关全球小型语言的观测数据和语言习得数据,还包含实验所得的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号