首页> 外文会议>IEEE International Symposium on Circuits and Systems >Open domain continuous filipino speech recognition with code-switching
【24h】

Open domain continuous filipino speech recognition with code-switching

机译:带代码切换的开放域连续菲律宾语音识别

获取原文

摘要

It is widely known that database quality has a huge impact on speech recognition system performance, most especially when the expected domain is well represented. In this paper, we use this idea as leverage for a data-driven solution to the problem of code-switching in Filipino. Practical Filipino conversations often contain English and other loan words in varying frequencies, demanding better training of parameters and models for its speech recognition system. We alleviate the underrepresentation of loan words through the development of a new speech database for training, and applied appropriate data analysis to make reliable evaluation results. The best system was searched via lattice rescoring from a cross-validation set containing almost three hours of unknown speech data. The description and results of our experiments serve as a new and competent baseline model for succeeding future developments.
机译:众所周知,数据库质量对语音识别系统的性能有很大的影响,尤其是当期望的域被很好地表示时。在本文中,我们将这种想法用作解决菲律宾代码转换问题的数据驱动解决方案的杠杆。实用的菲律宾对话通常包含不同频率的英语和其他借词,要求对其语音识别系统进行更好的参数和模型训练。我们通过开发新的语音数据库进行培训来减轻外来词不足的表现,并应用适当的数据分析来得出可靠的评估结果。最好的系统是通过从包含几乎三个小时的未知语音数据的交叉验证集中进行格点搜索来搜索的。我们的实验说明和结果可作为未来成功开发的新的,能胜任的基线模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号