首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Perceivable information structure in discourse prosody-Detecting prominent prosodic words in spoken discourse using F0 contour
【24h】

Perceivable information structure in discourse prosody-Detecting prominent prosodic words in spoken discourse using F0 contour

机译:话语韵律中的可感知信息结构-使用F0轮廓检测口语语篇中的突出韵律词

获取原文

摘要

The present study is an attempt to show how information structure as well as discourse structure is represented via prosodic patterns in continuous speech through F0 features. The complementary relationship found between information and discourse structure reflected by prosodic feature F0 can account for prosodic contribution towards speech understanding. We assume that perceived emphases or foci (prominence) are important information assigned by information structure and marked by hump peaks in the F0 contour by prosodic units. These F0 peaks are first compared with locations of linguistic units, lexical entries (words) and prosodic units (the prosodic words PWs), respectively, to decide optimized units representing allocation of key information. While the PW is defined as perceptually identifiable units at the lowest-level in a prosodic hierarchy of spoken discourse, higher-level location consistency between PWs and information arrangements operates via prosodic units that are larger than words, suggesting that the PW is a plausible unit to derive key information. The information foci in PW units are further detected and compared before and after considering higher-level context of discourse prosody. Detection accuracy of information foci is significantly improved after removing contributions from discourse context across different speech genres and languages (English and Mandarin). Specifically how F0 peaks are correlated to key information content. The findings thus shed lights on how and why prosodic features significantly contribute to speech understanding, and at the same time imply how such findings could be applied to enhance technological development.
机译:本研究试图说明如何通过F0特征在连续语音中通过韵律模式来表示信息结构以及话语结构。韵律特征F0反映的信息与话语结构之间的互补关系可以解释韵律对语音理解的贡献。我们假设感知到的重点或焦点(突出)是由信息结构分配的重要信息,并由韵律单元以F0轮廓中的驼峰来标记。首先将这些F0峰分别与语言单元,词汇条目(单词)和韵律单元(韵律词PW)的位置进行比较,以确定代表关键信息分配的优化单元。虽然PW被定义为语音话语的韵律层次中最低级别的可感知识别单位,但PW和信息安排之间较高级别的位置一致性是通过大于单词的韵律单位来运行的,这表明PW是一个合理的单位得出关键信息。在考虑话语韵律的高级上下文之前和之后,将进一步检测并比较PW单元中的信息焦点。从不同语音类型和语言(英语和普通话)的话语上下文中删除贡献后,信息焦点的检测准确性得到了显着提高。具体来说,F0峰如何与关键信息内容相关。因此,这些发现揭示了韵律特征如何以及为何显着有助于语音理解,同时也暗示了如何将这些发现应用于增强技术发展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号