首页> 外文会议>Workshop on Domain Adaptation for NLP >Dependency Parsing Evaluation for Low-resource Spontaneous Speech
【24h】

Dependency Parsing Evaluation for Low-resource Spontaneous Speech

机译:低资源自发性语音的依赖解析评估

获取原文
获取外文期刊封面目录资料

摘要

How well can a state-of-the-art parsing system, developed for the written domain, perform when applied to spontaneous speech data involving different interlocutors? This study addresses this question in a low-resource setting using child-parent conversations from the CHILDES databse. Specifically, we focus on dependency parsing evaluation for utterances of one specific child (18 - 27 months) and her parents. We first present a semi-automatic adaption of the dependency annotation scheme in CHILDES to that of the Universal Dependencies project, an annotation style that is more commonly applied in dependency parsing. Our evaluation demonstrates that an out-of-domain biaffine parser trained only on written texts performs well with parent speech. There is, however, much room for improvement on child utterances, particularly at 18 and 21 months, due to cases of omission and repetition that are prevalent in child speech. By contrast, parsers trained or fine-tuned with in-domain spoken data on a much smaller scale can achieve comparable results for parent speech and improve the weak parsing performance for child speech at these earlier ages.
机译:如何为书面域开发的最先进的解析系统,在应用于涉及不同对话者的自发语音数据时执行?本研究在使用来自Childes Databse中使用子父对话的低资源设置中解决了这个问题。具体而言,我们专注于对一个特定儿童(18 - 27个月)和父母的话语的依赖评价。我们首先在彼此依赖性项目中依赖依赖注释方案的半自动调整,是依赖于解析中更常用的注释样式。我们的评估表明,仅在书面文本上培训的域名双基因解析器对父母言语进行良好。然而,由于儿童言论中普遍存在和重复的情况,儿童话语的改善,特别是在18和21个月内有很多空间。相比之下,培训或微调域名口语数据的解析器可以实现更小的规模,可以实现父言论的可比结果,并在这些早期的年龄提高儿童演讲的弱解析性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号