首页> 外国专利> Low-dimensional real-time concatenative speech synthesizer

Low-dimensional real-time concatenative speech synthesizer

机译:低维实时级联语音合成器

摘要

A method of providing real-time speech synthesis based on user input includes presenting a graphical user interface having a low-dimensional representation of a multi-dimensional phoneme space, a first dimension representing degree of vocal tract constriction and voicing, a second dimension representing location in a vocal tract. One example employs a disk-shaped layout. User input is received via the interface and translated into a sequence of phonemes that are rendered on an audio output device. Additionally, a synthesis method includes maintaining a library of prerecorded samples of diphones organized into diphone groups, continually receiving a time-stamped sequence of phonemes to be synthesized, and selecting a sequence of diphone groups with their time stamps. A best diphone within each group is identified and placed into a production buffer from which diphones are rendered according to their time stamps.
机译:一种基于用户输入提供实时语音合成的方法,包括呈现图形用户界面,该界面具有多维音素空间的低维表示,第一维表示声道收缩和发声的程度,第二维表示位置在声带中。一个示例采用盘形布局。通过接口接收用户输入,并将其转换为在音频输出设备上呈现的音素序列。另外,一种合成方法包括:维护被组织成双音素组的双音素的预记录样本的库;连续接收要合成的音素的时间戳序列;以及选择具有其时间戳的双音素组的序列。识别出每个组中最佳的diphone,并将其放入生产缓冲区中,并根据其时间戳渲染diphone。

著录项

  • 公开/公告号US10553199B2

    专利类型

  • 公开/公告日2020-02-04

    原文格式PDF

  • 申请/专利权人 TRUSTEES OF BOSTON UNIVERSITY;

    申请/专利号US201615570889

  • 申请日2016-05-20

  • 分类号G10L13;G10L13/04;G10L13/06;G10L19;G10L13/08;G10L15;G06F3/0482;G06F3/0484;G06F3/048;G10L13/027;

  • 国家 US

  • 入库时间 2022-08-21 11:24:47

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号