Factor analysis-based approaches applied to the speaker diarization task of meetings: a preliminary study

机译：基于因子分析的方法适用于会议发言人的差异化任务：初步研究

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a preliminary study on the use of the Factor Analysis (FA) methods in an automatic speaker diarization process, dedicated to the meeting rooms. Indeed, the speaker diarization process, based on the top-down E-HMM approach, integrates a FA-based speaker modeling in an additional resegmentation step, which aims at helping the refinement of the output segmentation. Classically applied in speaker recognition to deal with channel variability issues, two main schemes of the FA application are proposed here: to deal with the (1) inter-speaker variability and with (2) the inter-segment variability. Different kinds of experiments have been conducted on the dataset of the last NIST/RT'09 evaluation campaign, leading to very interesting and promising results. For instance, they show that the couple of schemes proposed in this paper obtained competitive performance, compared to the baseline process, despite the small amount of development data used in this paper for the FA parameter estimation. Unexpectedly, they tend to show that the inter-segment variability component can be helpful for speaker diarization.

机译：本文介绍了针对会议室专用的自动扬声器二值化过程中因素分析（FA）方法的使用的初步研究。的确，基于自上而下的E-HMM方法的说话人区分过程将基于FA的说话人建模集成到一个额外的细分步骤中，该步骤旨在帮助优化输出细分。在语音识别中经典地用于处理信道可变性的问题，在此提出了FA应用的两个主要方案：处理（1）说话者间可变性和（2）段间可变性。在最近的NIST / RT'09评估活动的数据集上进行了各种实验，得出了非常有趣和有希望的结果。例如，他们表明，尽管本文中用于FA参数估计的开发数据量很少，但与基线过程相比，本文提出的两种方案仍具有竞争优势。出乎意料的是，他们倾向于表明段间可变性成分可能有助于说话人区分。

著录项

来源
《Odyssey 2010: the speaker and language recognition workshop》|2010年|p.139-145|共7页
会议地点 Brno(CS)
作者
Pavel Tomasek; Corinne Fredouille; Driss Matrouf;
展开▼
作者单位

University of Avignon, CERI/LIA, Avignon France;

University of Avignon, CERI/LIA, Avignon France;

University of Avignon, CERI/LIA, Avignon France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization [J] . Evans N., Bozonnet S., Dong Wang, Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：自下而上和自上而下的说话人差异化方法的比较研究
2. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks [J] . Federico Landini, Jan Profant, Mireia Diez, Computer speech and language . 2022,第Jana期

机译：扬声器日复速病中X-Vector序列（VBX）的Bayesian HMM聚类：理论，实施和标准任务的实施和分析
3. Generalized Viterbi-based models for time-series segmentation and clustering applied to speaker diarization [J] . Itshak Lapidot, Alon Shoa, Tal Furmanov, Computer speech and language . 2017,第Sepa期

机译：基于通用维特比的时间序列分割和聚类模型，用于说话人区分
4. Hybrid Speech/non-speech detector applied to Speaker Diarization of Meetings [C] . Xavier Anguera, Mateu Aguilo, Chuck Wooters, IEEE Odyssey-The Speaker and Language Recognition Workshop . 2006

机译：混合言语/非语音探测器适用于会议的扬声器日期
5. Use of speaker location features in meeting diarization. [D] . Otterson, Scott. 2008

机译：会议发言者使用语音定位功能。
6. How Do Movement Patterns in Weightlifting (Clean) Change When Using Lighter or Heavier Barbell Loads?—A Comparison of Two Principal Component Analysis-Based Approaches to Studying Technique [O] . Inge Werner, Nicolai Szelenczy, Felix Wachholz, 2020

机译：在使用较轻或更重的杠铃载荷时如何在举重（清洁）变化中的运动模式？ - 将基于组件分析的三种基于分析的方法进行了比较
7. Hybrid speech/non-speech detector applied to speaker diarization of meetings [O] . Xavier Anguera, Mateu Aguilo, Chuck Wooters, 2006

机译：混合语音/非语音检测器应用于会议的扬声器分类
8. Preliminary study of applied load factors in bumpy air [R] . Rhode, Richard V, Lundquist, Eugene E 1931

机译：颠簸空气中应用载荷因子的初步研究

Factor analysis-based approaches applied to the speaker diarization task of meetings: a preliminary study

摘要

著录项

相似文献

相关主题

期刊订阅