首页> 外文会议>Workshop on Domain Adaptation for NLP >Genres, Parsers, and BERT: The Interaction Between Parsers and BERT Models in Cross-Genre Constituency Parsing in English and Swedish
【24h】

Genres, Parsers, and BERT: The Interaction Between Parsers and BERT Models in Cross-Genre Constituency Parsing in English and Swedish

机译:流派,解析剂和伯特:在英语和瑞典语中交叉类型选区中的解析器和伯特模型之间的相互作用

获取原文

摘要

Genre and domain are often used interchangeably, but are two different properties of a text. Successful parser adaptation requires both cross-domain and cross-genre sensitivity (Rehbein and Bildhauer, 2017). While the impact domain differences have on parser performance degradation is more easily measurable in respect to lexical differences, impact of genre differences can be more nu-anced. With the predominance of pre-trained language models (LMs; e.g. BERT (Devlin et al., 2019)), there are now additional complexities in developing cross-genre sensitive models due to the infusion of linguistic characteristics derived from, usually, a third genre. We perform a systematic set of experiments using two neural constituency parsers to examine how different parsers behave in combination with different BERT models with varying source and target genres in English and Swedish. We find that there is extensive difficulty in predicting the best source due to the complex interactions between genres, parsers, and LMs. Additionally, the influence of the data used to derive the underlying BERT model heavily inlluences how best to create more robust and effective cross-genre parsing models.
机译:类型和域通常可互换使用,但是文本的两个不同的属性。成功的解析器适应需要跨域和交叉类型敏感度(rehbein和bildhauer,2017)。虽然对解析器性能下降的影响域差异更容易可测量,但在词汇差异方面更容易衡量,而流派差异的影响可能会更加nu-and。具有预先训练的语言模型的优势(LMS;例如BERT(Devlin等,2019)),现在由于衍生自源自的语言特征而发展交叉类型敏感模型,现在存在额外的复杂性。通常是第三个类型。我们使用两个神经组件解析器进行系统的一组实验,以检查不同的解析者如何与不同的伯特模型组合,具有不同的源和瑞典语的不同来源和靶系列。我们发现由于类型,解析器和LMS之间的复杂相互作用,在预测最佳来源方面存在广泛的困难。此外,用于导出底层BERT模型的数据的影响重大Inlluences如何最好地创建更强大和有效的交叉类型解析模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号