首页> 外文会议>INTERSPEECH 2012 >Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation

【24h】

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation

机译：组合声学数据驱动的G2P和资源lexicon生成的字母到声音规则

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In a recent work, we proposed an acoustic data-driven grapheme-to-phoneme (G2P) conversion approach, where the probabilistic relationship between graphemes and phonemes learned through acoustic data is used along with the orthographic transcription of words to infer the phoneme sequence. In this paper, we extend our studies to under-resourced lexicon development problem. More precisely, given a small amount of transcribed speech data consisting of few words along with its pronunciation lexicon, the goal is to build a pronunciation lexicon for unseen words. In this framework, we compare our G2P approach with standard letter-to-sound (L2S) rule based conversion approach. We evaluated the generated lexicons on PhoneBook 600 words task in terms of pronunciation errors and ASR performance. The G2P approach yields a best ASR performance of 14.0% word error rate (WER), while L2S approach yields a best ASR performance of 13.7% WER. A combination of G2P approach and L2S approach yields a best ASR performance of 9.3% WER.

机译：在最近的工作中，我们提出了一种声学数据驱动的标记到音素（G2P）转换方法，其中通过声学数据学习的图形和音素之间的概率关系与单词的正交转录一起推断音素序列。在本文中，我们将我们的研究扩展到资源不足的词典发育问题。更确切地说，给定少量转录的语音数据包括几个单词以及它的发音词典，目标是为未经语言构建一个发音词典。在此框架中，我们将G2P方法与基于标准的字母到声音（L2S）规则的转换方法进行比较。我们在电话簿上的生成词典评估了600字任务的发音错误和ASR性能。 G2P方法产生14.0％字的误差率（WER）的最佳ASR性能，而L2S方法会产生13.7％WER的最佳ASR性能。 G2P方法和L2S方法的组合产生了最佳的ASR性能为9.3％。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Ramya Rasipuram; Mathew Magimai Doss;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
Kullback-Leibler divergence based HMM; Lexicon; grapheme; phoneme; grapheme-to-phoneme converter; letter-to-sound rules; multilayer perceptron;

机译：基于kullback-leibler的肝脏分歧;lexicon;grapheme;phoneme;grapheme-to-phoneme转换器;字母到声音规则;多层的意思;multilayer perceptron;
入库时间 2022-08-20 22:09:19

相似文献

外文文献
中文文献
专利

1. Combining rough sets and data-driven fuzzy learning for generation of classification rules [J] . Shen Qiang, Chouchoulas Alexios Pattern Recognition: The Journal of the Pattern Recognition Society . 1999,第12期

机译：结合粗糙集和数据驱动的模糊学习以生成分类规则
2. Rough set-based rule generation and Apriori-based rule generation from table data sets II: SQL-based environment for rule generation and decision support [J] . Hiroshi Sakai, Zhiwen Jian CAAI Transactions on Intelligence Technology . 2019,第4期

机译：从表数据集II：规则生成和决策支持的基于SQL的环境基于粗糙的规则生成和基于APRiori的规则生成
3. Data driven business rule generation based on fog computing [J] . Yifei Zhang, Hongming Cai, Boyi Xu, Future generation computer systems . 2018,第DECa期

机译：基于雾计算的数据驱动业务规则生成
4. Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation [C] . Ramya Rasipuram, Mathew Magimai Doss Annual conference of the International Speech Communication Association . 2012

机译：结合声学数据驱动的G2P和字母到声音规则以生成资源不足的词典
5. Integrating knowledge-driven and data-driven approaches in the derivation of clinical prediction rules [D] . Kwiatkowska, Bogumila 2006

机译：在临床预测规则的推导中整合知识驱动和数据驱动的方法
6. Tunable pulsatile chemical gradient generation via acoustically driven oscillating bubbles [O] . Daniel Ahmed, Chung Yu Chan, Sz-Chin Steven Lin, -1

机译：可调的通过声驱动振荡气泡脉动化学梯度产生
7. Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework [O] . Zhang, Xiaohui, Manohar, Vimal, Povey, Daniel, 2017

机译：基于贪婪发音的声学数据驱动的词典学习选择框架

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation

摘要

著录项

相似文献

相关主题

期刊订阅