首页> 外文会议> >Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems

【24h】

Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems

机译：具有字符特征的不等式最大熵分类器，用于普通话TTS系统中的语音歧义消除

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Grapheme-to-phoneme (G2P) conversion is an important component in TTS systems. The difficulty in Chinese G2P conversion is to disambiguate the polyphones. In this paper, we formulate the polyphone disambiguation problem into a classification problem and propose a language independent classifier based on maximum entropy to address the issue. Furthermore, we introduce inequality smoothing to alleviate data sparseness and exploit language independent character features as linguistic knowledge. Experimental results show that the character features perform as well as the language dependent features such as words and part-of-speech, compared with the widely-used Gaussian smoothing, the inequality smoothing can greatly reduce the active features used in the classifier and achieve better performance. Our classifier achieves 96.35% in term of overall accuracy, greatly superior to 81.22% by using high-frequent "pin-yin"(Romanization of Chinese phoneme). Finally, we explore to merge all key polyphones into 6 groups and find that the overall accuracy only decreases about 2% and the active features are reduced more than 33% further

机译：音素到音素（G2P）转换是TTS系统中的重要组成部分。中文G2P转换的困难在于消除复音语音的歧义。在本文中，我们将多音素消歧问题公式化为分类问题，并提出了基于最大熵的独立于语言的分类器以解决该问题。此外，我们引入不等式平滑以减轻数据稀疏性，并利用独立于语言的字符特征作为语言知识。实验结果表明，与广泛使用的高斯平滑相比，字符特征的表现与语言相关的特征（如单词和词性）表现得更好，与不依赖语言的特征相比，不等式平滑可以大大减少分类器中使用的有效特征并获得更好的效果。表现。我们的分类器的整体准确率达到96.35％，通过使用频繁的“拼音”（汉语音素的罗马化）大大超过了81.22％。最后，我们探索将所有关键的复音电话合并为6组，发现整体准确度仅下降了约2％，有效功能进一步下降了33％以上

著录项

来源
《》|2007年|705-708|共4页
会议地点
作者
Xinnian Mao; Yuan Dong; Jinyu Han; Dezhi Huang; Haila Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
feature extraction; linguistics; maximum entropy methods; smoothing methods; speech processing; speech synthesis; Chinese phoneme; Gaussian smoothing; Mandarin TTS systems; Romanization; character features; grapheme-to-phoneme conversion; inequality maximum entropy c;

机译：特征提取语言学最大熵方法平滑方法语音处理语音合成汉语音素高斯平滑普通话TTS系统罗马化字符特征音素到音素转换不等式最大熵c;

相似文献

外文文献
中文文献
专利

1. Word Sense Disambiguation based on Maximum Entropy Classifier [J] . Chunxiang Zhang, Xuesong Zhou, Xueyao Gao, International Journal of Performability Engineering . 2019,第5期

机译：基于最大熵分类器的字感消除歧义
2. On-line Character Recognition for Handwritten kannada Characters using Wavelet Features and Neural Classifier [J] . R SRINIVASA RAO KUNTE, MIETE, R D SUDHARKER SAMUEL IETE Journal of Research . 2000,第5期

机译：基于小波特征和神经分类器的手写卡纳达语字符在线识别
3. Disambiguating the senses of non-text symbols for Mandarin TTS systems with a three-layer classifier [J] . Ming-Shing Yu, Feng-Long Huang Speech Communication . 2003,第3a4期

机译：使用三层分类器消除普通话TTS系统的非文本符号含义
4. Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems [C] . Xinnian Mao, Yuan Dong, Jinyu Han, . -1

机译：具有字符特征的不等式最大熵分类器，用于普通话TTS系统中的语音歧义消除
5. Maximum entropy model for Korean word sense disambiguation. [D] . Shin, Donghun. 2009

机译：用于朝鲜语单词歧义消除的最大熵模型。
6. BIOSMILE: A semantic role labeling system for biomedical verbs using a maximum-entropy model with automatically generated template features [O] . Richard Tzong-Han Tsai, Wen-Chi Chou, Ying-Shan Su, 2007

机译：BIOSMILE：生物医学动词的语义角色标记系统使用具有最大熵模型和自动生成的模板特征的生物医学动词
7. Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features [O] . Zexin Cai, Yaogen Yang, Chuxiong Zhang, 2019

机译：使用多级嵌入功能使用条件神经网络的普通话歧义
8. Effectiveness of Feature and Classifier Algorithms in Character RecognitionSystems [R] . Wilson, C. L. 1992

机译：特征和分类器算法在字符识别系统中的有效性

Inequality Maximum Entropy Classifier with Character Features for Polyphone Disambiguation in Mandarin TTS Systems

摘要

著录项

相似文献

相关主题

期刊订阅