首页> 外文会议>International conference on advances in speech and language technologies for Iberian languages >Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls
【24h】

Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls

机译:自动语音功能学习可连续预测联络中心电话中的客户满意度

获取原文

摘要

Speech related processing tasks have been commonly tackled using engineered features, also known as hand-crafted descriptors. These features have usually been optimized along years by the research community that constantly seeks for the most meaningful, robust, and compact audio representations for the specific domain or task. In the last years, a great interest has arisen to develop architectures that are able to learn by themselves such features, thus by-passing the required engineering effort. In this work we explore the possibility to use Convo-lutional Neural Networks (CNN) directly on raw audio signals to automatically learn meaningful features. Additionally, we study how well do the learned features generalize for a different task. First, a CNN-based continuous conflict detector is trained on audios extracted from televised political debates in French. Then, while keeping previous learned features, we adapt the last layers of the network for targeting another concept by using completely unrelated data. Concretely, we predict self-reported customer satisfaction from call center conversations in Spanish. Reported results show that our proposed approach, using raw audio, obtains similar results than those of a CNN using classical Mel-scale filter banks. In addition, the learning transfer from the conflict detection task into satisfaction prediction shows a successful generalization of the learned features by the deep architecture.
机译:语音相关的处理任务通常使用工程特征(也称为手工描述符)来解决。这些功能通常经过研究团体多年来的优化,不断寻求针对特定领域或任务的最有意义,最强大和最紧凑的音频表示。在过去的几年中,人们对开发能够自行学习这些功能从而绕过所需的工程工作的体系结构产生了浓厚的兴趣。在这项工作中,我们探索了直接在原始音频信号上使用卷积神经网络(CNN)来自动学习有意义的功能的可能性。此外,我们研究了学习到的功能对于其他任务的概括效果如何。首先,对基于CNN的连续冲突检测器进行了从法语的电视政治辩论中提取的音频训练。然后,在保留以前学习的功能的同时,我们通过使用完全不相关的数据来调整网络的最后一层,以定位另一个概念。具体而言,我们通过西班牙语的呼叫中心对话来预测自我报告的客户满意度。报告的结果表明,我们提出的使用原始音频的方法所获得的结果与使用经典梅尔级滤波器组的CNN所获得的结果相似。此外,从冲突检测任务到满意度预测的学习转移表明,深层架构成功地概括了学习到的特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号