Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

机译：Proteno：文本归一化与有限的数据，用于语音系统文本的快速部署

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Developing Text Normalization (TN) systems for Text-to-Speech (TTS) on new languages is hard. We propose a novel architecture to facilitate it for multiple languages while using data less than 3% of the size of the data used by the state of the art results on English. We treat TN as a sequence classification problem and propose a granular tok-enization mechanism that enables the system to learn majority of the classes and their normalizations from the training data itself. This is further combined with minimal pre-coded linguistic knowledge for other classes. We publish the first results on TN for TTS in Spanish and Tamil and also demonstrate that the performance of the approach is comparable with the previous work done on English.

机译：开发用于新语言的文本语音（TTS）的文本归一化（TN）系统很难。我们提出了一种新颖的架构，以方便多种语言，同时使用少于艺术状态的数据尺寸的数据占英语的数据的大小的3％。我们将TN视为序列分类问题，提出了一种粒度的TOK统治机制，使系统能够从培训数据本身中学习大多数类别和他们的常规程度。这与其他类的最小预编码语言知识相结合。我们在西班牙语和泰米尔中发布TN的第一个结果，也表明该方法的性能与英语上以前的工作相当。

著录项

来源
《Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2021年|72-79|共8页
会议地点
作者
Shubhi Tyagi; Antonio Bonafonte; Jaime Lorenzo-Trueba; Javier Latorre;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Neural Text Normalization in Speech-to-Text Systems with Rich Features [J] . Tran Oanh Thi, Bui Viet The Applied Artificial Intelligence . 2021,第1a4期

机译：具有丰富功能的语音到文本系统中的神经文本规范化
2. LINGUISTIC ASPECTS OF TEXT NORMALIZATION IN A POLISH TEXT-TO-SPEECH SYSTEM [J] . Filip Gralinski, Krzysztof Jassem, Agnieszka Wagner, Systems Science . 2006,第4期

机译：波兰语文本到语音系统中文本规范化的语言方面
3. Feature Extraction and Analysis of Speech Quality for Tamil Text-To-Speech Synthesis System using Fast Fourier Transform [J] . Dr.P.Uma Maheswari, K.C.Rajeswari Australian Journal of Basic and Applied Sciences . 2015,第2015期

机译：快速傅里叶变换的泰米尔语语音合成系统特征提取与语音质量分析
4. A THREE-STAGE TEXT NORMALIZATION STRATEGY FOR MANDARIN TEXT-TO-SPEECH SYSTEMS [C] . Tao Zhou, Yuan Dong, Dezhi Huang, International Symposium on Chinese Spoken Language Processing . 2008

机译：普通话文本到语音系统的三阶段正常化策略
5. Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis system. [D] . Yoon, Kyuchul. 2005

机译：为韩国文字转语音合成系统建立一个对韵律敏感的diphone数据库。
6. Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task [O] . Abeed Sarker, Maksim Belousov, Jasper Friedrichs, 2018

机译：Twitter上与药物有关的文本分类和概念归一化的数据和系统：来自社交媒体健康促进会（SMM4H）-2017的共享任务的见解
7. Text normalization with varied data sources for conversational speech language modeling [O] . Sarah Schwarm, Mari Ostendorf 2002

机译：使用各种数据源进行文本规范化，以进行对话语音建模
8. PROTEUS (PROtotype TExt Understanding System) and PUNDIT (Prolog UNDerstander of Integrated Text): Research in Text Understanding at the Department of Computer Science, New York University and System Development Corporation--A Bur [R] . Grishman, R., Hirschman, L. 1986

机译：pROTEUs（pROtotype TExt理解系统）和pUNDIT（综合文本的prolog UNDerstander）：纽约大学计算机科学系和系统开发公司的文本理解研究 - Bur

Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

摘要

著录项

相似文献

相关主题

期刊订阅