One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble

机译：一个模型来全部发音：带有变压器组的多语言音素到音素转换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The task of grapheme-to-phoneme (G2P) conversion is important for both speech recognition and synthesis. Similar to other speech and language processing tasks, in a scenario where only small-sized training data are available, learning G2P models is challenging. We describe a simple approach of exploiting model ensembles, based on multilingual Transformers and self-training, to develop a highly effective G2P solution for 15 languages. Our models are developed as part of our participation in the SIGMORPHON 2020 Shared Task 1 focused at G2P. Our best models achieve 14.99 word error rate (WER) and 3.30 phoneme error rate (PER), a sizeable improvement over the shared task competitive baselines.

机译：字素到音素（G2P）转换的任务对于语音识别和合成都很重要。与其他语音和语言处理任务类似，在只有小型培训数据可用的情况下，学习G2P模型具有挑战性。我们描述了一种基于模型的简单方法，该方法基于多语言的Transformers和自我训练，为15种语言开发了高效的G2P解决方案。我们的模型是作为我们参与针对G2P的SIGMORPHON 2020共享任务1的一部分而开发的。我们的最佳模型可实现14.99字错误率（WER）和3.30音素错误率（PER），比共享任务竞争基准大幅度提高。

著录项

来源
《SIGMORPHON workshop on computational research in phonetics phonology, and morphology》|2020年|146-152|共7页
会议地点
作者
Kaili Vesik; Muhammad Abdul-Mageed; Miikka Silfverberg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework [J] . JOSEF ROBERT NOVAK, NOBUAKI MINEMATSU, KEIKICHI HIROSE Natural language engineering . 2016,第pta6期

机译：Phonetisaurus：使用WFST框架中的联合n-gram模型探索音素到音素的转换
2. Arabic grapheme-to-phoneme conversion based on joint multi-gram model [J] . El-Hadi Cherifi, Mhania Guerti International journal of speech technology . 2021,第1期

机译：基于联合多克模型的阿拉伯语图形到 - 音素转换
3. Incorporating syllabification points into a model of grapheme-to-phoneme conversion [J] . Suyanto Suyanto International journal of speech technology . 2019,第2期

机译：将音节化点合并到音素到音素转换的模型中
4. Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model [C] . Omnia ElSaadany, Benjamin Suter SIGMORPHON workshop on computational research in phonetics phonology, and morphology . 2020

机译：使用多语言变压器模型的音素到音素转换
5. Grapheme-to-phoneme conversion and its application to transliteration. [D] . Jiampojamarn, Sittichai. 2011

机译：音素到音素的转换及其在音译中的应用。
6. Toward an Executive Origin for Acquired Phonological Dyslexia: A Case of Specific Deficit of Context-Sensitive Grapheme-to-Phoneme Conversion Rules [O] . Noémie Auclair-Ouellet, Marion Fossard, Marie-Catherine St-Pierre, 2013

机译：迈向获得性语音阅读障碍的行政起源：上下文敏感的音素到音素转换规则的特定缺陷案例
7. One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble [O] . Kaili Vesik, Muhammad Abdul-Mageed, Miikka Silfverberg 2020

机译：一个模型来发音全部：使用变压器集合的多语言图形到音素转换

One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble

摘要

著录项

相似文献

相关主题

期刊订阅