首页> 外文会议>Workshop on Knowledge Extraction and Integration for Deep Learning Architectures >Low Anisotropy Sense Retrofitting (LASeR) : Towards Isotropic and Sense Enriched Representations
【24h】

Low Anisotropy Sense Retrofitting (LASeR) : Towards Isotropic and Sense Enriched Representations

机译:低各向异性感官改造(激光):朝向各向同性和感官丰富的表达

获取原文

摘要

Contextual word representation models have shown massive improvements on a multitude of NLP tasks, yet their word sense disambiguation capabilities remain poorly explained. To address this gap, we assess whether contextual word representations extracted from deep pretrained language models create distinguishable representations for different senses of a given word. We analyze the representation geometry and find that most layers of deep pretrained language models create highly anisotropic representations, pointing towards the existence of representation degeneration problem in contextual word representations. After accounting for anisotropy. our study further reveals that there is variability in sense learning capabililies across different language models. Finally, we propose LASeR, a 'Low Anisotropy Sense Retrofitting' approach that renders off-the-shelf representations isotropic and semantically more meaningful, resolving the representation degeneration problem as a post-processing step, and conducting sense-enrichment of contextualized representations extracted from deep neural language models.
机译:上下文词表示模型在许多NLP任务上都有了巨大的改进,但它们的词义消歧能力仍然没有得到很好的解释。为了弥补这一差距,我们评估了从深度预训练语言模型中提取的语境词表征是否能为给定词的不同意义创建可区分的表征。我们分析了表示几何,发现大多数深层预训练语言模型都会产生高度各向异性的表示,这表明上下文词表示中存在表示退化问题。在考虑各向异性之后。我们的研究进一步揭示,在不同的语言模式中,感官学习能力存在差异。最后,我们提出了LASeR,这是一种“低各向异性意义改造”方法,使现成的表示具有各向同性,语义上更有意义,将表示退化问题作为后处理步骤加以解决,并对从深层神经语言模型中提取的语境化表示进行意义丰富。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号