【24h】

Analyzing Bayesian Crosslingual Transfer in Topic Models

机译:在主题模型中分析贝叶斯跨语言迁移

获取原文

摘要

We introduce a theoretical analysis of crosslin-gual transfer in probabilistic topic models. By formulating posterior inference through Gibbs sampling as a process of language transfer, we propose a new measure that quantifies the loss of knowledge across languages during this process. This measure enables us to derive a PAC-Bayesian bound that elucidates the factors affecting model quality, both during training and in downstream applications. We provide experimental validation of the analysis on a diverse set of five languages, and discuss best practices for data collection and model design based on our analysis.
机译:在概率主题模型中,我们介绍了交叉蛋白-瓜尔转移的理论分析。通过将通过吉布斯采样的后验推理公式化为语言转移的过程,我们提出了一种新的方法,可以量化在此过程中跨语言的知识损失。此度量使我们能够得出PAC-贝叶斯界线,该界线阐明了在训练过程中和在下游应用程序中影响模型质量的因素。我们通过五种语言提供了实验验证性分析,并根据分析结果讨论了数据收集和模型设计的最佳实践。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号