A Hybrid Generative-Discriminative Approach to Speaker Diarization

机译：一种杂交生成鉴别的扬声器日益改复方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a sound probabilistic approach to speaker diarization. We use a hybrid framework where a distribution over the number of speakers at each point of a multimodal stream is estimated with a discriminative model. The output of this process is used as input in a generative model that can adapt to a novel test set and perform high accuracy speaker diarization. We manage to deal efficiently with the less common, and therefore harder, segments like silence and multiple speaker parts in a principled probabilistic manner.

机译：在本文中，我们提出了一种对扬声器日益改估的概率方法。我们使用混合框架，其中通过判别模型估计多模阶流的每个点处的扬声器数量的分布。该过程的输出用作生成模型中的输入，可以适应新型测试集并进行高精度扬声器日益率。我们设法以不太常见，更难的段，如沉默和多个扬声器零件的较小常见，以及以原则的概率方式处理。

著录项

来源
《International Workshop on Machine Learning for Multimodal Interaction》|2008年||共12页
会议地点
作者
Athanasios K. Noulas; Tim van Kasteren; Ben J.A. Kroese;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
入库时间 2022-08-20 21:20:25

相似文献

外文文献
中文文献
专利

1. Incomplete-Data-Driven Speaker Segmentation for Diarization Application; A Help-Training Approach [J] . Teimoori Farshad, Razzazi Farbod Circuits, systems, and signal processing . 2019,第6期

机译：数据驱动的不完整说话人细分，以实现数字化应用；帮助培训方法
2. Incomplete-Data-Driven Speaker Segmentation for Diarization Application; A Help-Training Approach [J] . Teimoori Farshad, Razzazi Farbod Circuits, systems, and signal processing . 2019,第6期

机译：无损数据驱动的扬声器分段用于日记应用;帮助培训方法
3. A novel approach for speaker diarization system using TMFCC parameterization and Lion optimization [J] . V.Subba Ramaiah, R.Rajeswara Rao 中南大学学报（英文版） . 2017,第011期

机译：基于TMFCC参数化和Lion优化的说话人区分系统的新方法
4. A Hybrid Generative-Discriminative Approach to Speaker Diarization [C] . Athanasios K. Noulas, Tim van Kasteren, Ben J.A. Kroese International Workshop on Machine Learning for Multimodal Interaction;MLMI 2008 . 2008

机译：说话人二元化的混合式生成－判别方法
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Combining SGMM speaker vectors and KL-HMM approach for speaker diarization [O] . Srikanth Madikeri, Petr Motlicek, Herve Bourlard 2015

机译：组合SGMM扬声器向量和KL-HMM方法进行扬声器日益化
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

A Hybrid Generative-Discriminative Approach to Speaker Diarization

摘要

著录项

相似文献

相关主题

期刊订阅