...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >An evaluation of automatic phoneme segmentation for concatenative speech synthesis
【24h】

An evaluation of automatic phoneme segmentation for concatenative speech synthesis

机译:级联语音合成中自动音素分割的评估

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper studies the performance of automatic phoneme segmentation in three viewpoints: (1) temporal precision, (2) effects on segment selection, and (3) effects on the naturalness of synthetic speech. The absolute error of the phoneme beginning time for the best 90% and worst 10% were respectively 4.6ms and 25.9ms, which are comparable to discrepancies among human labelers. Our segment selection algorithm was found to have ability to eliminate waveform segments with large temporal errors, although not perfectly. As the result of a perception test in which naturalness was paircompared between synthetic speeches generated from hand4abeled data and auto-labeled data, it was found that the difference is marginal in practice although the latter is statistically inferior.
机译:本文从三个角度研究了自动音素分割的性能:(1)时间精度,(2)对片段选择的影响,以及(3)对合成语音的自然性的影响。最佳90%和最差10%的音素开始时间的绝对误差分别为4.6毫秒和25.9毫秒,这与人类标记者之间的差异相当。我们的片段选择算法被发现具有消除较大时间误差的波形片段的能力,尽管并不完美。进行感知测试的结果是,将手语数据和自动标记数据生成的合成语音进行自然配对,结果发现,尽管统计学上较差,但实际上差异很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号