Speech Transcription and Spoken Document Retrieval in Finnish

机译：芬兰语中的语音转录和口头文献检索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a baseline spoken document retrieval system in Finnish that is based on unlimited vocabulary continuous speech recognition. Due to its agglutinative structure, Finnish speech can not be adequately transcribed using the standard large vocabulary continuous speech recognition approaches. The definition of a sufficient lexicon and the training of the statistical language models are difficult, because the words appear transformed by many inflections and compounds. In this work we apply the recently developed language model that enables n-gram models of morpheme-like subword units discovered in an unsupervised manner. In addition to word-based indexing, we also propose an indexing based on the subword units provided directly by our speech recognizer, and a combination of the both. In an initial evaluation of newsreading in Finnish, we obtained a fairly low recognition error rate and average document retrieval precisions close to what can be obtained from human reference transcripts.

机译：本文介绍了芬兰语中的基线口头文档检索系统，基于无限词汇连续语音识别。由于其凝集结构，使用标准的大词汇连续语音识别方法无法充分转录芬兰语演讲。足够的词典和统计语言模型的培训的定义很困难，因为这些词看起来被许多拐点和化合物转化。在这项工作中，我们应用最近开发的语言模型，该模型使得能够以无监督方式发现的类似语素样子字单元的n-gram模型。除了基于单词的索引之外，我们还提出了基于我们的语音识别器直接提供的子字单元的索引，以及两者的组合。在芬兰语中的新闻稿的初步评估中，我们获得了相当低的识别错误率和平均文档检索精度，接近可以从人类参考转录物中获得的内容。

著录项

来源
《International Workshop on Machine Learning for Multimodal Interaction》|2005年||共10页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. SYLLABLE-BASED CHINESE TEXT/SPOKEN DOCUMENT RETRIEVAL USING TEXT/SPEECH QUERIES [J] . BO-REN BAI, BERLIN CHEN, HSIN-MIN WANG International Journal of Pattern Recognition and Artificial Intelligence . 2000,第5期

机译：基于文本/语音查询的基于音节的中文文本/语音文档检索
2. Word Topic Models for Spoken Document Retrieval and Transcription [J] . BERLIN CHEN ACM transactions on Asian language information processing . 2009,第1期

机译：语音文档检索和转录的Word主题模型
3. SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word [J] . Hansen J.H.L., Huang R., Zhou B., IEEE Transactions on Speech and Audio Proceessing . 2005,第5期

机译：SpeechFind：国家语言单词库的语音文档检索进展
4. Speech Transcription and Spoken Document Retrieval in Finnish [C] . International Workshop on Machine Learning for Multimodal Interaction . 2005

机译：芬兰语中的语音转录和口头文献检索
5. Audio parsing and rapid speaker adaptation in speech recognition for spoken document retrieval. [D] . Zhou, Bowen. 2003

机译：语音识别中的音频解析和快速的说话人自适应，可用于语音文档检索。
6. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions [O] . Feifan Liu, Gokhan Tur, Dilek Hakkani-Tür, 2011

机译：走向口语临床问题的答案：针对口语临床问题评估和改编自动语音识别系统
7. Multimedia Fusion in Automatic Extraction of Studio Speech Segments for Spoken Document Retrieval [O] . Pui Yu Hui, Wai Kit Lo, Helen M. Meng 2009

机译：用于语音文档检索的Studio语音片段自动提取中的多媒体融合

Speech Transcription and Spoken Document Retrieval in Finnish

摘要

著录项

相似文献

相关主题

期刊订阅