首页> 外文会议>Iberoamerican congress on pattern recognition >Selection of Lexical Units for Continuous Speech Recognition of Basque
【24h】

Selection of Lexical Units for Continuous Speech Recognition of Basque

机译:基于巴斯克连续语音识别的词汇单位的选择

获取原文

摘要

The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continuous Speech Recognition (CSR) systems. Words have been used classically as the recognition unit in most of them. However, proposals of non-word units are beginning to arise. Basque is an agglutinative language with some structure inside words, for which non-word morpheme like units could be an appropriate choice. In this work a statistical analysis of units obtained after morphological segmentation has been carried out. This analysis shows a potential gain of confusion rates in CSR systems, due to the growth of the set of acoustically similar and short morphemes. Thus, several proposals of Lexical Units are analysed to deal with the problem. Measures of Phonetic Perplexity and Speech Recognition rates have been computed using different sets of units and, based on these measures, a set of alternative non-word units have been selected.
机译:选择适当的词汇单位(LUS)是开发连续语音识别(CSR)系统的重要问题。在大多数情况下,单词已被定期使用作为识别单元。但是,非词汇单位的建议开始出现。巴斯克是一种凝聚的语言,具有一些结构的语言,其中非词汇状况如此可能是一个适当的选择。在这项工作中,进行了形态分割后获得的单位的统计分析。由于声学类似和短的语素集的增长,该分析显示了CSR系统中的混淆率的潜在增益。因此,分析了词汇单位的若干建议以解决问题。使用不同的单位组计算了语音困惑和语音识别率的测量,并且基于这些措施,已经选择了一组替代的非字单元。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号