首页> 外文会议>European Signal Processing Conference >Advances in Large Vocabulary Continuous Speech Recognition in Greek: Modeling and nonlinear features
【24h】

Advances in Large Vocabulary Continuous Speech Recognition in Greek: Modeling and nonlinear features

机译:希腊大词汇量连续语音识别的研究进展:建模和非线性特征

获取原文

摘要

The main goal of this work is the development of an improved Large Vocabulary Continuous Speech Recognition (LVCSR) framework in Greek. Language modeling is carried out in a collection of journalistic text and in the acoustic signal processing, a nonlinear approach is implemented for deriving features of the AM-FM type. Experimentation is carried out in both clean and simulated far-field speech offering insight about the acoustic modeling under adverse conditions with reverberation and additive ambient noise. Beyond the baseline implementation, a first step is made in exploring how standard (MFCCs and PLPs) and modulation features (AM-FM) behave in a LVCSR framework when the input speech is distant, like in real life home applications.
机译:这项工作的主要目标是开发改进的希腊语大词汇量连续语音识别(LVCSR)框架。语言建模在新闻文本集中进行,并且在声音信号处理中,采用非线性方法来推导AM-FM类型的功能。实验是在干净的和模拟的远场语音中进行的,可提供有关在不利条件下具有混响和附加环境噪声的声学建模的真知灼见。除了基线实现之外,第一步是探索当输入语音距离较远时,如在现实生活中的家庭应用中,标准(MFCC和PLP)和调制功能(AM-FM)在LVCSR框架中的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号