首页> 外文会议>International Workshop on Machine Learning for Multimodal Interaction >Speech Transcription and Spoken Document Retrieval in Finnish
【24h】

Speech Transcription and Spoken Document Retrieval in Finnish

机译:芬兰语中的语音转录和口头文献检索

获取原文

摘要

This paper presents a baseline spoken document retrieval system in Finnish that is based on unlimited vocabulary continuous speech recognition. Due to its agglutinative structure, Finnish speech can not be adequately transcribed using the standard large vocabulary continuous speech recognition approaches. The definition of a sufficient lexicon and the training of the statistical language models are difficult, because the words appear transformed by many inflections and compounds. In this work we apply the recently developed language model that enables n-gram models of morpheme-like subword units discovered in an unsupervised manner. In addition to word-based indexing, we also propose an indexing based on the subword units provided directly by our speech recognizer, and a combination of the both. In an initial evaluation of newsreading in Finnish, we obtained a fairly low recognition error rate and average document retrieval precisions close to what can be obtained from human reference transcripts.
机译:本文介绍了芬兰语中的基线口头文档检索系统,基于无限词汇连续语音识别。由于其凝集结构,使用标准的大词汇连续语音识别方法无法充分转录芬兰语演讲。足够的词典和统计语言模型的培训的定义很困难,因为这些词看起来被许多拐点和化合物转化。在这项工作中,我们应用最近开发的语言模型,该模型使得能够以无监督方式发现的类似语素样子字单元的n-gram模型。除了基于单词的索引之外,我们还提出了基于我们的语音识别器直接提供的子字单元的索引,以及两者的组合。在芬兰语中的新闻稿的初步评估中,我们获得了相当低的识别错误率和平均文档检索精度,接近可以从人类参考转录物中获得的内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号