首页> 外文会议>Workshop on NLP for similar languages, varieties and dialects >How Many Languages Can a Language Model Model? (invited talk)
【24h】

How Many Languages Can a Language Model Model? (invited talk)

机译:语言模型可以模拟几种语言? (特邀演讲)

获取原文

摘要

One of the purposes of the VarDial workshop series is to encourage research into NLP methods that treat human languages as a continuum, by designing models that exploit the similarities between languages and variants. In my work, I am using a continuous vector representation of languages that allows modeling and exploring the language continuum in a very direct way. The basic tool for this is a character-based recurrent neural network language model conditioned on language vectors whose values are learned during training. By feeding the model Bible translations in a thousand languages, not only does the learned vector space capture language similarity, but by interpolating between the learned vectors it is possible to generate text in unattested intermediate forms between the training languages.
机译:VarDial研讨会系列的目的之一是通过设计利用语言和变体之间相似性的模型来鼓励研究将人类语言视为连续体的NLP方法。在我的工作中,我使用的是语言的连续向量表示形式,它允许以非常直接的方式建模和探索语言连续体。为此的基本工具是一个基于字符的递归神经网络语言模型,其条件是在训练过程中学习其值的语言向量。通过以一千种语言提供示范圣经翻译,不仅学习的向量空间捕获了语言的相似性,而且通过在学习的向量之间进行插值,有可能生成训练语言之间未经验证的中间形式的文本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号