【24h】

Analyzing Bayesian Crosslingual Transfer in Topic Models

机译:分析主题模型中的贝叶斯奇数转移

获取原文

摘要

We introduce a theoretical analysis of crosslin-gual transfer in probabilistic topic models. By formulating posterior inference through Gibbs sampling as a process of language transfer, we propose a new measure that quantifies the loss of knowledge across languages during this process. This measure enables us to derive a PAC-Bayesian bound that elucidates the factors affecting model quality, both during training and in downstream applications. We provide experimental validation of the analysis on a diverse set of five languages, and discuss best practices for data collection and model design based on our analysis.
机译:我们介绍了概率主题模型中的横梁 - 仪式转移的理论分析。通过通过GIBBS抽样制定后部推理作为语言转移的过程,我们提出了一种新的度量,这些措施在此过程中量化了跨语言的知识丧失。这项措施使我们能够推导出一种PAC-Bayesian的约束,阐明在训练和下游应用期间影响模型质量的因素。我们提供了对多种五种语言的分析的实验验证,并根据我们分析讨论数据收集和模型设计的最佳实践。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号