首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Speaker diarization: A perspective on challenges and opportunities from theory to practice
【24h】

Speaker diarization: A perspective on challenges and opportunities from theory to practice

机译:扬声器日益改估:从理论到实践的挑战和机遇的视角

获取原文

摘要

This paper discusses some challenges and opportunities in developing a speaker diarization system for operation on real world call center telephony data. We contrast some of the differences between a standard data set akin to NIST evaluations and those found in call centers. In exploring these differences we discovered vulnerabilities and proposed changes to address them. In moving from theory into practice we introduce two tasks in which speaker diarization and recognition can be leveraged. First, we show that speaker diarization and recognition systems can be integrated to find the common speaker (the call center agent) across multiple calls and consequently their role. Furthermore, once the role is determined the corresponding speech recognition output can be analyzed to determine the type of support call.
机译:本文讨论了在现实世界呼叫中心电话数据上开发扬声器深度化系统方面的一些挑战和机遇。我们对比标准数据设置类似于NIST评估的一些差异以及在呼叫中心中找到的数据之间的差异。在探索这些差异时,我们发现漏洞并提出了解决它们的变化。在从理论转向实践中,我们介绍了两个任务,其中可以利用扬声器日益衰减和识别。首先,我们表明可以集成扬声器深度和识别系统,以便在多个呼叫中找到公共扬声器(呼叫中心代理),因此它们的角色。此外,一旦确定了角色,可以分析相应的语音识别输出以确定支持呼叫的类型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号