Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls

机译：自动语音功能学习可连续预测联络中心电话中的客户满意度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech related processing tasks have been commonly tackled using engineered features, also known as hand-crafted descriptors. These features have usually been optimized along years by the research community that constantly seeks for the most meaningful, robust, and compact audio representations for the specific domain or task. In the last years, a great interest has arisen to develop architectures that are able to learn by themselves such features, thus by-passing the required engineering effort. In this work we explore the possibility to use Convo-lutional Neural Networks (CNN) directly on raw audio signals to automatically learn meaningful features. Additionally, we study how well do the learned features generalize for a different task. First, a CNN-based continuous conflict detector is trained on audios extracted from televised political debates in French. Then, while keeping previous learned features, we adapt the last layers of the network for targeting another concept by using completely unrelated data. Concretely, we predict self-reported customer satisfaction from call center conversations in Spanish. Reported results show that our proposed approach, using raw audio, obtains similar results than those of a CNN using classical Mel-scale filter banks. In addition, the learning transfer from the conflict detection task into satisfaction prediction shows a successful generalization of the learned features by the deep architecture.

机译：语音相关的处理任务通常使用工程特征（也称为手工描述符）来解决。这些功能通常经过研究团体多年来的优化，不断寻求针对特定领域或任务的最有意义，最强大和最紧凑的音频表示。在过去的几年中，人们对开发能够自行学习这些功能从而绕过所需的工程工作的体系结构产生了浓厚的兴趣。在这项工作中，我们探索了直接在原始音频信号上使用卷积神经网络（CNN）来自动学习有意义的功能的可能性。此外，我们研究了学习到的功能对于其他任务的概括效果如何。首先，对基于CNN的连续冲突检测器进行了从法语的电视政治辩论中提取的音频训练。然后，在保留以前学习的功能的同时，我们通过使用完全不相关的数据来调整网络的最后一层，以定位另一个概念。具体而言，我们通过西班牙语的呼叫中心对话来预测自我报告的客户满意度。报告的结果表明，我们提出的使用原始音频的方法所获得的结果与使用经典梅尔级滤波器组的CNN所获得的结果相似。此外，从冲突检测任务到满意度预测的学习转移表明，深层架构成功地概括了学习到的特征。

著录项

来源
《International conference on advances in speech and language technologies for Iberian languages》|2016年|255-265|共11页
会议地点
作者
Carlos Segura; Daniel Balcells; Marti Umbert; Javier Arias; Jordi Luque;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature learning; End-to-end learning; Convolutional neural networks; Conflict speech retrieval; Automatic tagging;

机译：特征学习;端到端学习;卷积神经网络冲突语音检索;自动标记;

相似文献

外文文献
中文文献
专利

1. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
2. Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model [J] . Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：基于分层多任务模型的联络中心呼叫中的客户满意度估算
3. Robust phoneme classification for automatic speech recognition using hybrid features and an amalgamated learning model [J] . Mohammed Kamal Khwaja, Peddakota Vikash, P. Arulmozhivarman, International journal of speech technology . 2016,第4期

机译：强大的音素分类功能，可使用混合功能和混合学习模型进行自动语音识别
4. Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls [C] . Carlos Segura, Daniel Balcells, Marti Umbert, International Conference on Advances in Speech and Language Technologies for Iberian Languages . 2016

机译：自动语音特征学习，以持续预测联络中心电话的客户满意度
5. Contact Center Employee Characteristics Associated with Customer Satisfaction [D] . Pow, Lara 2017

机译：与客户满意度相关的联络中心员工特征
6. Emergence of an Action Repository as Part of a Biologically Inspired Model of Speech Processing: The Role of Somatosensory Information in Learning Phonetic-Phonological Sound Features [O] . Bernd J. Kröger, Tanya Bafna, Mengxue Cao 2005

机译：动作存储库的出现作为语音处理的生物学启发模型的一部分：体感信息在学习语音-语音音素特征中的作用
7. On customer contact centers with a call-back option: Customer decisions, routing rules, and system design [O] . Mor Armony, Constantinos Maglaras 2002

机译：在具有回叫选项的客户联系中心上：客户决策，路由规则和系统设计

Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls

摘要

著录项

相似文献

相关主题

期刊订阅