首页> 外文会议>European Signal Processing Conference >Data-driven and physical model-based designs of probabilistic spatial dictionary for online meeting diarization and adaptive beamforming
【24h】

Data-driven and physical model-based designs of probabilistic spatial dictionary for online meeting diarization and adaptive beamforming

机译:基于数据驱动和基于物理模型的概率空间字典的设计,用于在线会议数字化和自适应波束形成

获取原文

摘要

In this paper, we comparatively study alternative dictionary designs for recently proposed meeting diarization and adaptive beamforming based on a probabilistic spatial dictionary. This dictionary models the feature distribution for each possible direction of arrival (DOA) of speech signals and the feature distribution for background noise. The dictionary enables online DOA detection, which in turn enables online diarization. Here we describe data-driven and physical model-based designs of the dictionary. Experiments on a meeting dataset showed that a physical model-based dictionary gave a word error rate (WER) of 24.9 %, which is close to that for the best-performing data-driven dictionary (24.1%). Therefore, the former has a significant advantage over the latter that it allows us to bypass the cumbersome measurement of training data without much degrading the performance of the automatic speech recognition (ASR).
机译:在本文中,我们比较研究了基于概率空间字典的最近提出的满足会议化和自适应波束形成的替代字典设计。该词典对语音信号的每个可能到达方向(DOA)的特征分布以及背景噪声的特征分布进行建模。该词典启用在线DOA检测,进而启用在线数字化。在这里,我们描述了字典的基于数据驱动和基于物理模型的设计。对会议数据集的实验表明,基于物理模型的字典的单词错误率(WER)为24.9 \%,与性能最佳的数据驱动字典的错误率(24.1 \%)接近。因此,前者比后者具有显着的优势,它使我们能够绕开训练数据的繁琐测量,而不会大大降低自动语音识别(ASR)的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号