The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems

机译：2006年雅典信息技术演讲活动检测和扬声器日益改估系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes the Speech Activity Detection (SAD) and Speaker Diarization (SPKR) systems that were developed by the Athens Information Technology in the scope of the NIST RT-06S evaluations. The SAD system performs classification of recorded frames into speech and non-speech, using Linear Discriminant Analysis (LDA), while the SPKR one initially segments recordings into speech intervals based on the Bayesian Information Criterion (BIC), and then applies a two-step clustering strategy to group segments from the same speaker together. Following a discussion of the intrinsics of the two systems, we report and comment on our results on the RT-06S corpus [20].

机译：本文介绍了雅典信息技术在NIST RT-06S评估范围内开发的语音活动检测（SAD）和扬声器深度（SPKR）系统。 SAD系统使用线性判别分析（LDA）执行记录的帧的分类并非语音，而SPKR首先将录像为基于贝叶斯信息标准（BIC）的语音间隔，然后应用两步将策略与同一扬声器组合在一起进行分组。在讨论两个系统的内在机构之后，我们向RT-06S语料库进行报告和评论我们的结果[20]。

著录项

来源
《International workshop on machine learning for multimodal interaction》|2006年||共11页
会议地点
作者
Elias Rentzeperis; Andreas Stergiou; Christos Boukis; Aristodemos Pnevmatikakis; Lazaros C. Polymenakos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词
入库时间 2022-08-20 21:02:06

相似文献

外文文献
中文文献
专利

1. An Efficient Speaker Diarization using Privacy Preserving Audio Features Based of Speech/Non Speech Detection [J] . S.Sathyapriya, A.Indhumathi International Journal of Computer Trends and Technology . 2014,第4期

机译：基于语音/非语音检测的使用隐私保护音频功能的有效说话人区分
2. Overlapping Speech Detection Using Long-Term Conversational Features for Speaker Diarization in Meeting Room Conversations [J] . Yella S.H., Bourlard H. Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2014,第12期

机译：会议室会话中使用长期会话特征进行语音重叠的语音检测重叠
3. Simultaneous Speech Detection With Spatial Features for Speaker Diarization [J] . Zelenak M., Segura C., Luque J., Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第2期

机译：具有空间特征的同时语音检测，可实现说话人区分
4. The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems [C] . Elias Rentzeperis, Andreas Stergiou, Christos Boukis, Machine learning for multimodal interaction . 2006

机译：2006年雅典信息技术语音活动检测和说话者区分系统
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research [O] . Lukas Fürer, Nathalie Schenk, Volker Roth, 2020

机译：使用随机森林监督扬声器日期：一种心理治疗过程研究的工具
7. Speech overlap detection in a two-pass speaker diarization system [O] . Huijbregts M.A.H., Leeuwen D.A. van, Jong F. M. G de 2009

机译：两遍说话者区分系统中的语音重叠检测

The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems

摘要

著录项

相似文献

相关主题

期刊订阅