首页> 外文会议>International Conference on speech and computer >LSTM Neural Network for Speaker Change Detection in Telephone Conversations
【24h】

LSTM Neural Network for Speaker Change Detection in Telephone Conversations

机译:LSTM神经网络用于电话通话中说话人变化的检测

获取原文

摘要

In this paper, we analyze an approach to speaker change detection in telephone conversations based on recurrent Long Short-Term Memory Neural Networks. We compare this approach to speaker change detection via Convolutional Neural Networks. We show that by finetun-ing the architecture and using suitable input data in the form of spectrograms, we obtain better results relatively by 2%. We have discovered that a smaller architecture performs better on unseen data. Also, we found out that using stateful LSTM layers that try to remember whole conversations is much worse than using recurrent networks that memorize only small sequences of speech.
机译:在本文中,我们分析了一种基于循环长期短时记忆神经网络的电话交谈中说话人变化检测的方法。我们将这种方法与通过卷积神经网络进行的说话人变化检测进行了比较。我们表明,通过对体系结构进行微调并以频谱图的形式使用合适的输入数据,相对而言,我们可以获得更好的结果2%。我们发现较小的体系结构在看不见的数据上表现更好。此外,我们发现使用有状态的LSTM层尝试记住整个对话比使用仅存储少量语音序列的循环网络要糟糕得多。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号