首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >DYSFLUENT SPEECH DETECTION BY IMAGE FORENSICS TECHNIQUES
【24h】

DYSFLUENT SPEECH DETECTION BY IMAGE FORENSICS TECHNIQUES

机译:图像取证技术的渗透语音检测

获取原文
获取外文期刊封面目录资料

摘要

As speech recognition has become popular, the importance of dysfluency detection increased considerably. Once a dys-fluent event in spontaneous speech is identified, the speech recognition performance could be enhanced by eliminating its negative effect. Most existing techniques to detect such dysfluent events are based on statistical models. Sparse regularity of dysfluent events and complexity to describe such events in a speech recognition system makes its recognition rigorous. These problems are addressed by our algorithm inspired by image forensics. This paper suggests our algorithm developed to extract novel features of complex dysflu-encies. The common steps of classifier design were used to statistically evaluate the proposed features of complex dys-fluencies in spectral and cepstral domains. Support vector machines perform objective assessment of MFCC features, MFCC based derived features, PCA based derived features and kernel PCA based derived features of complex dysfluen-cies, where our derived features increased the performance by 46% opposite to MFCC.
机译:由于语音识别变得流行,但功能性检测的重要性大大增加。一旦识别出自发语音的患者流畅的事件,可以通过消除其负效应来提高语音识别性能。最多现有的检测这种漏洞事件的技术基于统计模型。漏洞事件和复杂性描述语音识别系统中此类事件的稀疏规律性使其认可严格。我们的算法通过Image取证的启发来解决这些问题。本文建议我们开发的算法,以提取复杂的Dysflu-Conuies的新功能。分类器设计的常用步骤用于统计评估谱和抗搏动结构域中复杂性能流量的拟议特征。支持向量机对MFCC特征,MFCC的派生功能,基于PCA的派生功能和基于复杂的Dysfluen-cies的派生功能进行客观评估,其中我们的衍生功能将性能提高了46%与MFCC相反的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号