首页> 外文会议>Conference on empirical methods in natural language processing >Outta Control: Laws of Semantic Change and Inherent Biases in Word Representation Models
【24h】

Outta Control: Laws of Semantic Change and Inherent Biases in Word Representation Models

机译:失控:单词表示模型中的语义变化规律和固有偏见

获取原文

摘要

This article evaluates three proposed laws of semantic change. Our claim is that in order to validate a putative law of semantic change, the effect should be observed in the genuine condition but absent or reduced in a suitably matched control condition, in which no change can possibly have taken place. Our analysis shows that the effects reported in recent literature must be substantially revised. (i) the proposed negative correlation between meaning change and word frequency is shown to be largely an artefact of the models of word representation used; (ii) the proposed negative correlation between meaning change and pro-totypicality is shown to be much weaker than what has been claimed in prior art; and (iii) the proposed positive correlation between meaning change and polysemy is largely an artefact of word frequency. These empirical observations are corroborated by analytical proofs that show that count representations introduce an inherent dependence on word frequency, and thus word frequency cannot be evaluated as an independent factor with these representations.
机译:本文评估了三种提议的语义变化定律。我们的主张是,为了验证推定的语义变化定律,应在真实条件下观察效果,​​而在适当匹配的控制条件下则不存在或降低效果,在这种条件下可能不会发生任何变化。我们的分析表明,必须对近期文献中报道的影响进行实质性修订。 (i)拟议的意义变化和词频之间的负相关关系在很大程度上显示出是所使用的词表示模型的假象; (ii)与现有技术相比,拟议的意义变化与原型之间的负相关性被证明弱得多; (iii)所提出的含义变化与多义性之间的正相关主要是词频的人工产物。这些经验性观察得到分析性证明的证实,这些证明表明,计数表示引入了对词频的固有依赖性,因此,不能用这些表示将词频作为独立因素进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号