首页> 外文会议>Pacific-Rim Conference on Multimedia >A Combined Feature Approach for Speaker Segmentation Using Convolution Neural Network
【24h】

A Combined Feature Approach for Speaker Segmentation Using Convolution Neural Network

机译:卷积神经网络扬声器分割的组合特征方法

获取原文

摘要

In this paper, a speaker segmentation algorithm is proposed based on a Combined feature approach using the Convolution Neural Network (CNN), which is used to deal with the speaker segmentation problem of dialogue speech with partial prior knowledge in the CALL_CENTER environment. For the first time, the Mel-Frequency Cepstral Coefficients (MFCC) feature and the SPEC-TROGRAM feature are combined as the input of CNN to train the speakers' voice feature model and to estimate the change point. In the experiments, a real database about the dialogue voice related to insurance sales and real estate sales industry is used to compare our proposed approach with Bayesian Information Criterion (BIC) approach using different acoustic features sets. The results show that the synthetical performance is improved, and our algorithm has a better segmentation.
机译:在本文中,基于使用卷积神经网络(CNN)的组合特征方法提出了一种扬声器分割算法,其用于处理对话的语音语音的扬声器分段问题,其中包含Call_Center环境中的部分先验知识。首次,熔融频率谱系数(MFCC)特征和规范堆栈特征被组合为CNN的输入,以训练扬声器的语音特征模型并估计变化点。在实验中,关于与保险销售和房地产销售行业相关的对话语音的真实数据库用于比较我们使用不同声学功能集的贝叶斯信息标准(BIC)方法的提出方法。结果表明,综合性能得到改善,我们的算法具有更好的分割。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号