首页> 外文会议>European Signal Processing Conference >Cross-entropic comparison of the effects of accent, speaker and database recording on spectral features of English accents
【24h】

Cross-entropic comparison of the effects of accent, speaker and database recording on spectral features of English accents

机译:口音,说话者和数据库记录对英语口音频谱特征的影响的跨熵比较

获取原文
获取外文期刊封面目录资料

摘要

This paper investigates the use of cross-entropy information measure for quantification and comparison of the impact of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Cross-entropy measure can be used in applications such as accent identification, improved speech recognition, cross-accent phonetic-tree analysis and analysis of the influence of accents on different sets of speech parameters and models. For the purpose of this study the focus is on British English, Australian English and two different databases of American English accents (namely WSJ and TIMIT). Comparison of the cross entropies of formants and cepstrum features indicate that cepstrum features are less indicative of accents compared to formants. In particular it appears that the measurements of differences in formants across accents are less sensitive to different recording or databases. It is found that the cross entropies of the same phonemes across different accents (inter-accent distances) are significantly greater than the cross entropies of the same phonemes across different speaker groups of the same accent (intra-accent distances). The cross entropy measure is also used to construct cross-accent phonetic trees, which serve to show the structural similarities and differences of the phonetic systems across accents.
机译:本文研究使用交叉熵信息量度来量化和比较口音,说话者组和录音变化对语音的语音单位频谱特征概率模型的影响。交叉熵度量可用于重音识别,改进的语音识别,重音语音树分析以及重音对不同语音参数和模型集的影响分析等应用中。出于本研究的目的,重点是英式英语,澳大利亚英语和两个不同的美式英语口音数据库(即WSJ和TIMIT)。共振峰和倒频谱特征的交叉熵的比较表明,与共振峰相比,倒频谱特征对口音的指示较少。尤其是,跨口音的共振峰差异的测量似乎对不同的记录或数据库不太敏感。发现相同音素在不同重音之间的交叉熵(重音间距离)明显大于相同音素在不同说话人组之间的重音的交叉熵(重音内距离)。交叉熵测度还用于构建交叉口音的语音树,该语音树用于显示跨口音的语音系统的结构相似性和差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号