Multilingual query by example spoken term detection for under-resourced languages

机译：多语言查询通过示例口语术语检测，用于资源介绍语言

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a query-by-example approach to multilingual Spoken Term Detection for under-resourced languages based on Automatic Speech Recognition. The approach overcomes the main difficulties met under these conditions, i.e., providing a new method for building multilingual acoustic models with few annotated data and searching in approximate Automatic Speech Recognition transcriptions providing high scalability. The acoustic models are obtained by adapting well-trained phonemes to the ones from the envisaged languages. The mapping is made according to International Phonetic Alphabet phoneme classification and a confusion matrix. The weighting of query length and alignment spread are incorporated in the Dynamic Time Warping technique to improve the searching method. Experimental validation was conducted on a standard data set consisting of 3 hours of mixed African languages. The recorded speech has telephonic quality and it is a mix of read and spontaneous speech.

机译：我们提出了一种基于自动语音识别的资源低调语言的多语言语口语术语检测的查询方法。该方法克服了在这些条件下满足的主要困难，即提供了一种用于构建多语言声学模型的新方法，其中有一些注释数据和在提供高可扩展性的近似自动语音识别转录中搜索。声学模型是通过将训练有素的音素与设想的语言中的训练有素的音素进行。映射根据国际语音字母音素分类和混淆矩阵进行。查询长度和对准扩展的加权结合在动态时间翘曲技术中以改善搜索方法。在由3小时混合的非洲语言组成的标准数据集上进行了实验验证。记录的语音具有电话质量，它是一种读取和自发的语音的混合。

著录项

来源
《Conference on Speech Technology and Human - Computer Dialogue》|2013年||共6页
会议地点
作者
Buzo Andi; Cucu Horia; Safta Mihai; Burileanu Corneliu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
multilingual acoustic model; spoken term detection; under-resourced languages;

机译：多语言声学模型;口语术语检测;资源不足的语言;

相似文献

外文文献
中文文献
专利

1. Multilingual query-by-example spoken term detection in Indian languages [J] . Abhimanyu Popli, Arun Kumar International journal of speech technology . 2019,第1期

机译：多语言示例查询印度语言中的口语术语检测
2. Two-stage spoken term detection system for under-resourced languages [J] . Deekshitha G., Mary Leena Signal Processing, IET . 2020,第9期

机译：资源低调语言的两级口语术语检测系统
3. Comparison of Methods for Language-Dependent and Language-Independent Query-by-Example Spoken Term Detection [J] . JAVIER TEJEDOR, MICHAL FAPSO, IGOR SZOEKE, ACM Transactions on Information Systems . 2012,第3期

机译：语言相关和语言独立示例查询口语查询方法的比较
4. Multilingual query by example spoken term detection for under-resourced languages [C] . Buzo Andi, Cucu Horia, Safta Mihai, 2013 7th Conference on Speech Technology and Human - Computer Dialogue . 2013

机译：通过示例口语词检测对资源不足的语言进行多语言查询
5. Adaptation and Augmentation: Towards Better Rescoring Strategies for Automatic Speech Recognition and Spoken Term Detection [D] . Ma, Min. 2018

机译：适应和增强：寻求更好的自动语音识别和语音术语检测的评分策略
6. Word Detection in Sung and Spoken Sentences in Children With TypicalLanguage Development or With Specific Language Impairment [O] . Clément Planchou, Sylvain Clément, Renée Béland, 2015

机译：典型儿童口语和句子中的单词检测语言发展或有特定语言障碍
7. Multilingual Bottleneck Features for Query by Example Spoken Term Detection [O] . Dhananjay Ram, Lesly Miculicich, Herve Bourlard 2019

机译：通过示例说出术语检测的查询的多语言瓶颈特征

Multilingual query by example spoken term detection for under-resourced languages

摘要

著录项

相似文献

相关主题

期刊订阅