首页> 外国专利> SYSTEM-EFFECTED TEXT ANNOTATION FOR EXPRESSIVE PROSODY IN SPEECH SYNTHESIS AND RECOGNITION

SYSTEM-EFFECTED TEXT ANNOTATION FOR EXPRESSIVE PROSODY IN SPEECH SYNTHESIS AND RECOGNITION

机译：语音合成与识别中表达性语体的系统影响文本标注

页面导航

摘要
著录项
相似文献

摘要

The inventive system can automatically annotate the relationship of text and acoustic units for the purposes of: (a) predicting how the text is to be pronounced as expressively synthesized speech, and (b) improving the proportion of expressively uttered speech as correctly identified text representing the speaker's message. The system can automatically annotate text corpora for relationships of uttered speech for a particular speaking style and for acoustic units in terms of context and content of the text to the utterances. The inventive system can use kinesthetically defined expressive speech production phonetics that are recognizable and controllable according to kinesensic feedback principles. In speech synthesis embodiments of the invention, the text annotations can specify how the text is to be expressively pronounced as synthesized speech. Also, acoustically-identifying features for dialects or mispronunciations can be identified so as to expressively synthesize alternative dialects or stylistic mispronunciations for a speaker from a given text. In speech recognition embodiments of the invention, each text annotation can be uniquely identified from the corresponding acoustic features of a unit of uttered speech to correctly identify the corresponding text. By employing a method of rules-based text annotation, the invention enables expressiveness to be altered to reflect syntactic, semantic, and/or discourse circumstances found in text to be synthesized or in an uttered message.

机译：本发明的系统可以出于以下目的自动注释文本和声学单元的关系：（a）预测如何将文本发音为表达性合成语音，以及（b）提高表达性语音作为正确识别的文本表示的比例演讲者的信息。该系统可以自动注释文本语料库，以用于特定说话风格和语音单位的上下文关系和文本内容与发声的关系。本发明的系统可以使用根据运动感觉反馈原理可识别和控制的运动学定义的表达语音产生语音。在本发明的语音合成实施例中，文本注释可以指定如何将文本作为合成语音表达地发音。同样，可以识别针对方言或发音的声学识别特征，以便从给定的文本表达性地合成说话者的替代方言或风格错误的发音。在本发明的语音识别实施例中，每个文本注释可以从一口语音单元的相应声学特征中唯一地识别出来，以正确地识别相应的文本。通过采用基于规则的文本注释的方法，本发明使表达能力得以改变以反映在要合成的文本或说出的消息中发现的句法，语义和/或话语环境。

著录项

公开/公告号US2009048843A1

专利类型
公开/公告日2009-02-19

原文格式PDF
申请/专利权人 RATTIMA NITISAROJ;GARY MARPLE;NISHANT CHANDRA;
展开▼

申请/专利号US20080188763
发明设计人 RATTIMA NITISAROJ;GARY MARPLE;NISHANT CHANDRA;
展开▼

申请日2008-08-08
分类号G10L13/08;
国家 US
入库时间 2022-08-21 19:33:39

相似文献

专利
外文文献
中文文献