【24h】

Single-speaker/multi-speaker co-channel speech classification

机译:单口/多口同道语音分类

获取原文

摘要

The demand for content-based management and real-time manipulation of audio data is constantly increasing. This paper presents a method to identify temporal regions, in a segment of co-channel speech, as being either single-speaker or multi-speaker speech. The state of the art approach for this purpose is the kurtosis. In this paper, a set of complementary time-domain and frequency-domain features is studied. The employed classification scheme is the one-class SVM classifier. A recognition rate of 94.75 % is reached. The set of features providing the best performance is determined.
机译:基于内容的管理和音频数据的实时处理的需求在不断增长。本文提出了一种方法来识别同频道语音片段中的时间区域为单说话者语音还是多说话者语音。为此目的,最先进的方法是峰度。本文研究了一组互补的时域和频域特征。所采用的分类方案是一类SVM分类器。达到94.75%的识别率。确定提供最佳性能的功能集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号