【24h】

Learning Diachronic Analogies to Analyze Concept Change

机译:学习历史模糊类比分析概念变化

获取原文

摘要

We propose to study the evolution of concepts by learning to complete diachronic analogies between lists of terms which relate to the same concept at different points in time. We present a number of models based on operations on word embedddings that correspond to different assumptions about the characteristics of diachronic analogies and change in concept vocabularies. These are tested in a quantitative evaluation for nine different concepts on a corpus of Dutch newspapers from the 1950s and 1980s. We show that a model which treats the concept terms as analogous and learns weights to compensate for diachronic changes (weighted linear combination) is able to more accurately predict the missing term than a learned transformation and two baselines for most of the evaluated concepts. We also find that all models tend to be coherent in relation to the represented concept, but less discriminative in regard to other concepts. Additionally, we evaluate the effect of aligning the time-specific embedding spaces using orthogonal Procrustes, finding varying effects on performance, depending on the model, concept and evaluation metric. For the weighted linear combination, however, results improve with alignment in a majority of cases. All related code is released publicly.
机译:我们建议研究概念的演变,通过学习完成与在不同点的相同概念相关的术语之间完成历史记录。我们提出了许多基于Word Embedddings的操作的模型,所述操作对应于关于关于探讨探讨类别的特征的不同假设和概念词汇表的变化。这些在20世纪50年代和20世纪80年代的荷兰报纸的九种不同概念的定量评估中进行了测试。我们表明,一种将概念术语视为类似的概念术语的模型,并学习权重以补偿历时的变化(加权线性组合)能够更准确地预测缺失的术语,而是用于大多数评估概念的来自学习的转换和两个基线。我们还发现,所有型号都往往与代表的概念相干,但在其他概念方面的歧视较小。此外,我们评估了使用正交汇总对准时间特定的嵌入空间的效果,从而为性能产生不同影响,具体取决于模型,概念和评估度量。然而,对于加权线性组合,结果改善了大多数情况下的对准。所有相关代码都公开发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号