首页> 外文会议>IEEE International Conference on Data Science in Cyberspace >Toward Predicting Communication Effectiveness
【24h】

Toward Predicting Communication Effectiveness

机译:预测沟通效果

获取原文

摘要

Voice plays an important role in effective communication. This paper investigates communication effectiveness using call center audio records, where each record contains a conversation between a sales representative and a customer. The outcome is considered positive if the conversation ended with an appointment being made and negative otherwise. Two methods are proposed to predict the outcomes. The first method uses traditional machine learning algorithms on a small set of commonly-used, well-known acoustic features of voice in psychology research (i.e. prosodic features). A data processing pipeline has been developed that consists of three major steps: 1) an audio record is split into segments by speaker turns; 2) acoustic features are extracted from each segment; and 3) classification models are trained on the acoustic features to predict conversation outcomes. The second method uses deep learning on a larger set of vocal features. Each audio segment is split into fixed-interval pieces, from which acoustic feature vectors are computed and are stacked in temporal order to form a feature matrix as input to deep convolution neural networks. Experimental results based on real call center data shows that acoustic features, such as Mel-frequency cepstral coefficients (MFCCs), timbre and Chroma features, are good indicators of call success, much better than prosodic features. Temporal information in an audio record can be captured by deep convolutional neural networks for improved prediction accuracy, up to 82.6%.
机译:语音在有效的沟通中起着重要的作用。本文研究了使用呼叫中心音频记录的沟通效果,其中每个记录都包含销售代表和客户之间的对话。如果对话以约会结束而被认为是积极的,否则被否定。提出了两种方法来预测结果。第一种方法是在心理学研究中的一小套常用的,众所周知的声音声学特征上使用传统的机器学习算法。已经开发了一个数据处理管道,该管道包括三个主要步骤:1)音频记录通过扬声器的转动被分成几段; 2)从每个片段中提取声学特征; 3)在声学特征上训练分类模型,以预测对话结果。第二种方法是在较大的语音特征集上使用深度学习。每个音频片段被分成固定间隔的片段,从中计算出声学特征向量,并按时间顺序对其进行堆叠,以形成特征矩阵,作为深度卷积神经网络的输入。基于真实呼叫中心数据的实验结果表明,声学特征(例如梅尔频率倒谱系数(MFCC),音色和色度特征)是呼叫成功的良好指标,比韵律特征要好得多。音频记录中的时间信息可以通过深度卷积神经网络捕获,以提高预测精度,最高可达82.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号