首页> 外文期刊>Computational linguistics >Generating Tailored, Comparative Descriptions with Contextually Appropriate Intonation
【24h】

Generating Tailored, Comparative Descriptions with Contextually Appropriate Intonation

机译:生成具有上下文相关语调的量身定制的比较描述

获取原文
       

摘要

Generating responses that take user preferences into account requires adaptation at all levels of the generation process. This article describes a multi-level approach to presenting user-tailored information in spoken dialogues which brings together for the first time multi-attribute decision models, strategic content planning, surface realization that incorporates prosody prediction, and unit selection synthesis that takes the resulting prosodic structure into account. The system selects the most important options to mention and the attributes that are most relevant to choosing between them, based on the user model. Multiple options are selected when each offers a compelling trade-off. To convey these trade-offs, the system employs a novel presentation strategy which straightforwardly lends itself to the determination of information structure, as well as the contents of referring expressions. During surface realization, the prosodic structure is derived from the information structure using Combinatory Categorial Grammar in a way that allows phrase boundaries to be determined in a flexible, data-driven fashion. This approach to choosing pitch accents and edge tones is shown to yield prosodic structures with significantly higher acceptability than baseline prosody prediction models in an expert evaluation. These prosodic structures are then shown to enable perceptibly more natural synthesis using a unit selection voice that aims to produce the target tunes, in comparison to two baseline synthetic voices. An expert evaluation and f0 analysis confirm the superiority of the generator-driven intonation and its contribution to listeners' ratings.
机译:生成考虑用户偏好的响应需要在生成过程的所有级别进行调整。本文介绍了一种在语音对话中呈现用户量身定制的信息的多级方法,该方法首次将多属性决策模型,战略内容规划,结合韵律预测的表面实现以及采用最终韵律的单元选择综合在一起结构考虑在内。系统根据用户模型选择要提及的最重要的选项以及与它们最相关的属性。当每个选项都具有令人信服的权衡时,将选择多个选项。为了传达这些折衷,该系统采用了一种新颖的表示策略,该策略可以直接确定信息结构以及引用表达式的内容。在表面实现过程中,韵律结构是使用组合类别语法从信息结构派生的,该方式允许以灵活的数据驱动方式确定短语边界。在专家评估中,这种选择音高重音和边缘音调的方法显示出比基线韵律预测模型具有更高可接受性的韵律结构。然后,与两个基准合成声音相比,这些韵律结构可以使用旨在产生目标乐曲的单元选择声音来实现更自然的合成。专家评估和f0分析证实了发生器驱动音调的优越性及其对听众评级的贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号