首页> 外文会议>European Conference on Speech Communication and Technology >Compound decomposition in Dutch large vocabulary speech recognition
【24h】

Compound decomposition in Dutch large vocabulary speech recognition

机译:复合分解在荷兰大词汇语音识别

获取原文

摘要

This paper addresses compound splitting for Dutch in the context of broadcast news transcription. Language models were created using original text versions and text versions that were decomposed using a data-driven compound splitting algorithm. Language model performances were compared in terms of out-of-vocabulary rates and word error rates in a real-world broadcast news transcription task. It was concluded that compound splitting does improve ASR performance. Best results were obtained when frequent compounds were not decomposed.
机译:本文在广播新闻转录的背景下涉及荷兰语的复合拆分。使用使用数据驱动的复合拆分算法进行分解的原始文本版本和文本版本来创建语言模型。在现实世界广播新闻转录任务中的词汇流率和单词错误率方面进行了语言模型表演。结论是,复合分裂确实改善了ASR性能。当频繁化合物没有分解时获得了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号