Learning Phrase Break Detection in Thai Text-to-Speech

机译：泰语语音转换中的学习短语中断检测

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the crucial problems in developing high quality Thai text-to-speech synthesis is to detect phrase break from Thai texts. Unlike English, Thai has no word boundary delimiter and no punctuation mark at the end of a sentence. It makes the problem more serious. Because when we detect phrase break incorrectly, it is not only producing unnatural speech but also creating the wrong meaning. In this paper, we apply machine learning algorithms namely C4.5 and RIPPER in detecting phrase break. These algorithms can learn useful features for locating a phrase break position. The features which are investigated in our experiments are collocations in different window sizes and the number of syllables before and after a word in question to a phrase break position. We compare the results from C4.5 and RIPPER with a based-line method (Part-of-Speech sequence model). The experiment shows that C4.5 and RIPPER appear to outperform the based-line method and RIPPER performs better accuracy results than C4.5.

机译：开发高质量的泰语文本到语音合成的关键问题之一是检测泰语文本中的短语中断。与英语不同，泰语在单词末尾没有单词边界定界符，也没有标点符号。这使问题更加严重。因为当我们错误地检测到短语断开时，它不仅会产生不自然的语音，而且还会产生错误的含义。在本文中，我们将C4.5和RIPPER等机器学习算法用于检测词组中断。这些算法可以学习有用的功能来定位词组中断位置。在我们的实验中研究的特征是在不同的窗口大小中的搭配以及在所讨论的单词到短语中断位置之前和之后的音节数量。我们将C4.5和RIPPER的结果与基于基线的方法（词性序列模型）进行比较。实验表明，C4.5和RIPPER的性能似乎优于基线方法，并且RIPPER的准确度结果优于C4.5。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.1; 20030901-04; Geneva(CH)》|2003年|P.325-328|共4页
会议地点 Geneva(CH)
作者
Virongrong Tesprasit; Paisarn Charoenpornsawat; Virach Sornlertlamvanich;
展开▼
作者单位

Information Research and Development Division National Electronics and Computer Technology Center, Thailand;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词
入库时间 2022-08-26 13:48:54

相似文献

外文文献
中文文献
专利

1. Unit Generation Based on Phrase Break Strength and Pruning for Corpus-Based Text-to-Speech [J] . Sanghun Kim, Youngjik Lee, Keikichi Hirose ETRI journal . 2001,第4期

机译：基于短语断裂强度和修剪的语料库文本转语音单元生成
2. A Model of Text-to-Speech Application for Learning Thai Language [J] . NUCHAREE PREMCHAISWADI, WICHIAN PREMCHAISWADI WSEAS Transactions on Information Science and Applications . 2007,第3期

机译：泰语学习中的文本转语音应用模型
3. Evaluation of machine learning models for automatic detection of DNA double strand breaks after irradiation using a γH2AX foci assay [J] . Tim Hohmann, Jacqueline Kessler, Dirk Vordermark, PLoS One . 2020,第2期

机译：使用γH2AX焦点测定辐照后DNA双链自动检测机器学习模型的评估
4. Learning Phrase Break Detection in Thai Text-to-Speech [C] . Virongrong Tesprasit, Paisarn Charoenpornsawat, Virach Sornlertlamvanich, European Conference on Speech Communication and Technology . 2003

机译：学习短语中断检测在泰语文本中的语音
5. From peasant farmers to construction workers: The breaking down of the boundaries between agrarian and urban life in northern Thailand, 1974-1992. [D] . Ritchie, Mark Andrew. 1996

机译：从农民到建筑工人：1974年至1992年，泰国北部农业和城市生活之间的界限被打破。
6. Evaluation of machine learning models for automatic detection of DNA double strand breaks after irradiation using a γH2AX foci assay [O] . Tim Hohmann, Jacqueline Kessler, Dirk Vordermark, 2020

机译：使用γH2AX病灶分析评估用于辐照后自动检测DNA双链断裂的机器学习模型
7. Online Language Learning for Thai EFL Learners: An Analysis of Effective Alternative Learning Methods in Response to the Covid-19 Outbreak [O] . Pongpatchara Kawinkoonlasate 2020

机译：泰国EFL学习者的在线语言学习：对响应Covid-19爆发的有效替代学习方法的分析
8. Text-To-Speech Phrasing Enhancement System Using Neural Networks [R] . Julig, L. F. 1995

机译：基于神经网络的文本语音语音增强系统

Learning Phrase Break Detection in Thai Text-to-Speech

摘要

著录项

相似文献

相关主题

期刊订阅