Grapheme-to-Phoneme Models for (Almost) Any Language

机译：几乎所有语言的音素到音素模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Grapheme-to-phoneme (g2p) models are rarely available in low-resource languages, as the creation of training and evaluation data is expensive and time-consuming. We use Wiktionary to obtain more than 650k word-pronunciation pairs in more than 500 languages. We then develop phoneme and language distance metrics based on phonological and linguistic knowledge; applying those, we adapt g2p models for high-resource languages to create models for related low-resource languages. We provide results for models for 229 adapted languages.

机译：音素到音素（g2p）模型很少以低资源语言提供，因为创建训练和评估数据既昂贵又费时。我们使用Wiktionary来获取500多种语言中的650k以上的单词发音对。然后，我们根据语音和语言知识开发音素和语言距离度量;应用这些，我们将g2p模型用于高资源语言，以创建用于相关低资源语言的模型。我们提供了229种适应语言的模型结果。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2016年|399-408|共10页
会议地点
作者
Aliya Deri; Kevin Knight;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy [J] . Bellegarda JR Speech Communication . 2005,第2期

机译：通过潜在类比进行无监督，语言无关的音素到音素转换
2. Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework [J] . JOSEF ROBERT NOVAK, NOBUAKI MINEMATSU, KEIKICHI HIROSE Natural language engineering . 2016,第pta6期

机译：Phonetisaurus：使用WFST框架中的联合n-gram模型探索音素到音素的转换
3. Arabic grapheme-to-phoneme conversion based on joint multi-gram model [J] . El-Hadi Cherifi, Mhania Guerti International journal of speech technology . 2021,第1期

机译：基于联合多克模型的阿拉伯语图形到 - 音素转换
4. Grapheme-to-phoneme model generation for Indo-European languages [C] . Schlippe Tim IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：印欧语言的音素到音素模型生成
5. Grapheme-to-Phoneme Mapping in L2 and L3: Lexical and Sublexical Processing in Reading Aloud. [D] . Andino, Emily Alicia. 2016

机译：L2和L3中的音素到音素映射：朗读中的词法和子词法处理。
6. Enhancing African low-resource languages: Swahili data for language modelling [O] . Casper S. Shikali, Refuoe Mokhosi 2020

机译：增强非洲低资源语言：语言建模的斯瓦希里语数据
7. Multimodal, Multilingual Grapheme-to-Phoneme Conversion for Low-Resource Languages [O] . James Route, Steven Hillis, Isak Czeresnia Etinger, 2019

机译：用于低资源语言的多模式，多语言石墨对 - 音素转换

Grapheme-to-Phoneme Models for (Almost) Any Language

摘要

著录项

相似文献

相关主题

期刊订阅