An overview of automatic speaker diarization systems

Tranter S.E.; Reynolds D.A.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >An overview of automatic speaker diarization systems

【24h】

An overview of automatic speaker diarization systems

机译：扬声器自动扩音系统概述

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio diarization is the process of annotating an input audio channel with information that attributes (possibly overlapping) temporal regions of signal energy to their specific sources. These sources can include particular speakers, music, background noise sources, and other signal source/channel characteristics. Diarization can be used for helping speech recognition, facilitating the searching and indexing of audio archives, and increasing the richness of automatic transcriptions, making them more readable. In this paper, we provide an overview of the approaches currently used in a key area of audio diarization, namely speaker diarization, and discuss their relative merits and limitations. Performances using the different techniques are compared within the framework of the speaker diarization task in the DARPA EARS Rich Transcription evaluations. We also look at how the techniques are being introduced into real broadcast news systems and their portability to other domains and tasks such as meetings and speaker verification.

机译：音频二值化是用信息注释输入音频通道的过程，该信息将信号能量的时间区域（可能重叠）归因于其特定来源。这些源可以包括特定的扬声器，音乐，背景噪声源以及其他信号源/通道特征。 Diarization可用于帮助语音识别，促进音频档案的搜索和索引，以及增加自动转录的丰富程度，使其更具可读性。在本文中，我们概述了当前音频扩音关键领域（即扬声器扩音）中使用的方法，并讨论了它们的相对优缺点。在DARPA EARS Rich Transcription评估中，在说话人差异化任务的框架内比较了使用不同技术的演奏。我们还将研究如何将这些技术引入真实的广播新闻系统中，以及它们在其他领域和任务（例如会议和演讲者验证）中的可移植性。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第5期|p.1557-1565|共9页
作者
Tranter S.E.; Reynolds D.A.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
audio signal processing; speaker recognition; DARPA EARS Rich Transcription evaluations; audio archives; audio diarization; automatic speaker diarization systems; automatic transcriptions; broadcast news systems; input audio channel annotation; speaker verificati;

机译：音频信号处理;说话人识别;DARPA EARS丰富的转录评估;音频档案;音频二值化;自动说话人二值化系统;自动转录;广播新闻系统;输入音频通道注释;扬声器验证;

相似文献

外文文献
中文文献
专利

1. Development of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study [J] . Mihelic France, Vesnicer Bostjan, Zibert Janez Journal of computing and information technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者区分系统的开发：一个案例研究
2. Development Of A Speaker Diarization System For Speaker Tracking In Audio Broadcast News: A Case Study [J] . Janez Zibert, Bostjan Vesnicer, France Mihelic Journal of Computing and Information Technology . 2008,第3期

机译：音频广播新闻中演讲者跟踪的演讲者差异化系统的开发：一个案例研究
3. A new architecture based VAD for speaker diarization/detection systems [J] . Ouassila Kenai, Siham Ouamour, Mhania Guerti, International journal of speech technology . 2019,第3期

机译：基于新架构的VAD，用于说话人区分/检测系统
4. Automatic named identification of speakers using diarization and ASR systems [C] . Jousse V., Petit-Renaud S., Meignier S., IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 . 2009

机译：使用差分和ASR系统自动命名演讲者
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. An overview of automatic speaker diarization systems [O] . Sue E. Tranter, Douglas A. Reynolds, Senior Member 2006

机译：自动扬声器二值化系统概述
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

An overview of automatic speaker diarization systems

摘要

著录项

相似文献

相关主题

期刊订阅