首页> 外文会议>International Conference on Language Resources and Evaluation >RKorAPClient: An R Package for Accessing the German Reference Corpus DEREKO via KorAP
【24h】

RKorAPClient: An R Package for Accessing the German Reference Corpus DEREKO via KorAP

机译:rkorapclient:通过korap访问德语参考语料库德里科的R包

获取原文

摘要

Making corpora accessible and usable for linguistic research is a huge challenge in view of (too) big data, legal issues and a rapidly evolving methodology. This does not only affect the design of user-friendly graphical interfaces to corpus analysis tools, but also the availability of programming interfaces supporting access to the functionality of these tools from various analysis and development environments. RKorAPClient is a new research tool in the form of an R package that interacts with the Web API of the corpus analysis platform KorAP, which provides access to large annotated corpora, including the German reference corpus DEREKO with 45 billion tokens. In addition to optionally authenticated KorAP API access, RKorAPClient provides further processing and visualization features to simplify common corpus analysis tasks. This paper introduces the basic functionality of RKorAPClient and exemplifies various analysis tasks based on DEREKO, that are bundled within the R package and can serve as a basic framework for advanced analysis and visualization approaches.
机译:使Corpora可访问和可用于语言研究是一个巨大的挑战,鉴于(TOO)的大数据,法律问题和快速发展的方法论是一项巨大的挑战。这不仅影响到语料库分析工具的用户友好的图形接口设计,还影响支持访问这些工具功能的编程接口,从各种分析和开发环境中介绍。 Rkorapclient是一种新的研究工具,其形式是一个R包的形式,它与语料库分析平台korap的Web API交互,这提供了对大型注释的语料库的访问,包括德国参考语料库德涅克德涅克,具有450亿令牌。除了可选的经过身份验证的Korap API Access之外,RkorApClient还提供了进一步的处理和可视化功能,以简化常用的语料库分析任务。本文介绍了RkorApClient的基本功能,并列举了基于德里卡的各种分析任务,该任务在R包内捆绑在一起,可以作为高级分析和可视化方法的基本框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号