首页> 外文会议>International Symposium on Chinese Spoken Language Processing >Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network
【24h】

Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network

机译:使用多通道卷积双向长短期记忆网络的语音对话应用重点检测

获取原文

摘要

Emphasis detection is important for user intention understanding in human-computer interaction scenario. Techniques have been developed to detect the emphatic words in speech, but challenges still exist in Voice Dialogue Applications (VDAs): the tremendous non-specific speakers and their various expressions. In this work, we present a novel approach to automatically detect emphasis in VDAs by using multi-channel convolutional bi -directional long short-term memory neural networks (MC-BLSTM), which can learn various expressions of large amounts of speakers and long span temporal dependencies across speech trajectories. In particular, we first use a multi-channel convolutional component in the proposed approach to extract high-level representation of input acoustic features for emphasis detection. The experimental results on a 3400 real-world dataset collected from Sogou 11http://yy.sogou.com Voice Assistant outper-form current state-of-the-art baseline systems (+6.2% in terms of F1-measure on average).
机译:重点检测对于理解人机交互场景中的用户意图很重要。已经开发了检测语音中有重点的单词的技术,但是语音对话应用程序(VDA)仍然存在挑战:巨大的非特定说话者及其各种表达方式。在这项工作中,我们提出了一种新颖的方法,该方法通过使用多通道卷积双向长短期记忆神经网络(MC-BLSTM)自动检测VDA中的重点,该方法可以学习大量说话者和长跨度的各种表达跨语音轨迹的时间依赖性。特别是,我们首先在提出的方法中使用多通道卷积分量,以提取输入声学特征的高级表示,以进行重点检测。从搜狗收集的3400个真实世界数据集上的实验结果 11 http://yy.sogou.com语音助手的性能优于当前最先进的基准系统(按F1度量平均增加6.2%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号