首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Grapheme-to-Phoneme Models for (Almost) Any Language
【24h】

Grapheme-to-Phoneme Models for (Almost) Any Language

机译:几乎所有语言的音素到音素模型

获取原文

摘要

Grapheme-to-phoneme (g2p) models are rarely available in low-resource languages, as the creation of training and evaluation data is expensive and time-consuming. We use Wiktionary to obtain more than 650k word-pronunciation pairs in more than 500 languages. We then develop phoneme and language distance metrics based on phonological and linguistic knowledge; applying those, we adapt g2p models for high-resource languages to create models for related low-resource languages. We provide results for models for 229 adapted languages.
机译:音素到音素(g2p)模型很少以低资源语言提供,因为创建训练和评估数据既昂贵又费时。我们使用Wiktionary来获取500多种语言中的650k以上的单词发音对。然后,我们根据语音和语言知识开发音素和语言距离度量;应用这些,我们将g2p模型用于高资源语言,以创建用于相关低资源语言的模型。我们提供了229种适应语言的模型结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号