首页> 外文会议>International workshop on machine learning for multimodal interaction >The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems
【24h】

The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems

机译:2006年雅典信息技术演讲活动检测和扬声器日益改估系统

获取原文

摘要

This paper describes the Speech Activity Detection (SAD) and Speaker Diarization (SPKR) systems that were developed by the Athens Information Technology in the scope of the NIST RT-06S evaluations. The SAD system performs classification of recorded frames into speech and non-speech, using Linear Discriminant Analysis (LDA), while the SPKR one initially segments recordings into speech intervals based on the Bayesian Information Criterion (BIC), and then applies a two-step clustering strategy to group segments from the same speaker together. Following a discussion of the intrinsics of the two systems, we report and comment on our results on the RT-06S corpus [20].
机译:本文介绍了雅典信息技术在NIST RT-06S评估范围内开发的语音活动检测(SAD)和扬声器深度(SPKR)系统。 SAD系统使用线性判别分析(LDA)执行记录的帧的分类并非语音,而SPKR首先将录像为基于贝叶斯信息标准(BIC)的语音间隔,然后应用两步将策略与同一扬声器组合在一起进行分组。在讨论两个系统的内在机构之后,我们向RT-06S语料库进行报告和评论我们的结果[20]。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号