Approaches to speaker detection and tracking in conversational speech

Dunn RB.; Quatieri TF.; Reynolds DA.

首页> 外文期刊>Digital Signal Processing >Approaches to speaker detection and tracking in conversational speech

【24h】

Approaches to speaker detection and tracking in conversational speech

机译：会话语音中说话人检测和跟踪的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partition the speech file into speaker homogenous regions and then to create scores for these regions. We refer to this approach as internal segmentation. Another approach uses an external segmentation algorithm, based on blind clustering, to partition the speech file into speaker homogenous regions. The adapted GMM-UBM system then scores each of these regions as in the single-speaker recognition case. We show that the external segmentation system outperforms the internal segmentation system for both detection and tracking. In addition, we show how different components of the detection and tracking algorithms contribute to the overall system performance. (C) 2000 Academic Press. [References: 15]

机译：描述了检测和跟踪多扬声器音频中的扬声器的两种方法。两种方法都使用自适应的高斯混合模型，通用背景模型（GMM-UBM）说话者检测系统作为核心说话者识别引擎。在一种方法中，由GMM-UBM系统逐帧生成的单个对数似然比得分用于首先将语音文件划分为说话者同质区域，然后为这些区域创建得分。我们将此方法称为内部细分。另一种方法是使用基于盲聚类的外部分段算法，将语音文件划分为多个说话者同质区域。然后，如在单讲话者识别情况下，适应的GMM-UBM系统对这些区域中的每一个进行评分。我们表明，外部分割系统在检测和跟踪方面都优于内部分割系统。此外，我们展示了检测和跟踪算法的不同组件如何对整体系统性能做出贡献。（C）2000学术出版社。 [参考：15]

著录项

来源
《Digital Signal Processing》 |2000年第3期|共20页
作者
Dunn RB.; Quatieri TF.; Reynolds DA.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Speaker recognition; Detection; Tracking; Multispeaker; Gaussian mixture model; Clustering;

机译：说话人识别;检测;跟踪;多说话者;高斯混合模型;聚类;

相似文献

外文文献
中文文献
专利

1. Approaches to speaker detection and tracking in conversational speech [J] . Dunn RB., Quatieri TF., Reynolds DA. Digital Signal Processing . 2000,第1a3期

机译：会话语音中说话人检测和跟踪的方法
2. Approaches to speaker detection and tracking in conversational speech [J] . Dunn RB., Quatieri TF., Reynolds DA. Digital Signal Processing . 2000,第1a3期

机译：会话语音中说话人检测和跟踪的方法
3. Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations [J] . Yella S.H., Bourlard H. Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2014,第12期

机译：会议室会话中使用长期会话特征进行语音重叠的语音检测重叠
4. Speaker Change Detection for Conversational Speech using Synthesized Voice Embedding [C] . Mathangi Krishnathasan, C.R.J. Amalraj International Conference on Information Technology Research . 2019

机译：使用合成语音嵌入的会话语音的扬声器改变检测
5. Advanced machine learning approaches for target detection, tracking and recognition. [D] . Venkataraman, Vijay. 2010

机译：用于目标检测，跟踪和识别的高级机器学习方法。
6. Cognitive and Structural Correlates of Conversational Speech Timing in Mild Cognitive Impairment and Mild-to-Moderate Alzheimer’s Disease: Relevance for Early Detection Approaches [O] . Céline De Looze, Amir Dehsarvi, Lisa Crosby, 2021

机译：轻度认知障碍和温和至中等阿尔茨海默病中会话语音时序的认知和结构性相关性：早期检测方法的相关性
7. Approaches to Speaker Detection and Tracking in Conversational Speech, [O] . Robert B. Dunn, Douglas A. Reynolds, Thomas F. Quatieri, 2013

机译：会话语音中说话人检测和跟踪的方法，

Approaches to speaker detection and tracking in conversational speech

摘要

著录项

相似文献

相关主题

期刊订阅