Multilingual query by example spoken term detection for under-resourced languages

机译：通过示例口语词检测对资源不足的语言进行多语言查询

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a query-by-example approach to multilingual Spoken Term Detection for under-resourced languages based on Automatic Speech Recognition. The approach overcomes the main difficulties met under these conditions, i.e., providing a new method for building multilingual acoustic models with few annotated data and searching in approximate Automatic Speech Recognition transcriptions providing high scalability. The acoustic models are obtained by adapting well-trained phonemes to the ones from the envisaged languages. The mapping is made according to International Phonetic Alphabet phoneme classification and a confusion matrix. The weighting of query length and alignment spread are incorporated in the Dynamic Time Warping technique to improve the searching method. Experimental validation was conducted on a standard data set consisting of 3 hours of mixed African languages. The recorded speech has telephonic quality and it is a mix of read and spontaneous speech.

机译：我们提出了一种基于示例的方法，用于基于自动语音识别的资源不足语言的多语言口语检测。该方法克服了在这些条件下遇到的主要困难，即，提供了一种新的方法，该方法用于建立带有少量注释数据的多语言声学模型，并在近似的自动语音识别转录中进行搜索，以提供较高的可扩展性。声学模型是通过将训练有素的音素改编成设想的语言中的音素而获得的。根据国际语音字母音素分类和混淆矩阵进行映射。将查询长度的权重和对齐方式的扩展纳入动态时间规整技术中，以改进搜索方法。对包含3小时非洲混合语言的标准数据集进行了实验验证。录制的语音具有电话质量，是阅读语音和自发语音的混合体。

著录项

来源
《2013 7th Conference on Speech Technology and Human - Computer Dialogue》|2013年|1-6|共6页
会议地点 Cluj-Napoca(RO)
作者
Buzo Andi; Cucu Horia; Safta Mihai; Burileanu Corneliu;
展开▼
作者单位

Speech and Dialogue (SpeeD) Research Laboratory University Politehnica of Bucharest Bucharest, Romaniac;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
multilingual acoustic model; spoken term detection; under-resourced languages;

机译：多语言声学模型;口语检测;资源匮乏的语言;;

相似文献

外文文献
中文文献
专利

1. Multilingual query-by-example spoken term detection in Indian languages [J] . Abhimanyu Popli, Arun Kumar International journal of speech technology . 2019,第1期

机译：多语言示例查询印度语言中的口语术语检测
2. Two-stage spoken term detection system for under-resourced languages [J] . Deekshitha G., Mary Leena Signal Processing, IET . 2020,第9期

机译：资源低调语言的两级口语术语检测系统
3. Comparison of Methods for Language-Dependent and Language-Independent Query-by-Example Spoken Term Detection [J] . JAVIER TEJEDOR, MICHAL FAPSO, IGOR SZOEKE, ACM Transactions on Information Systems . 2012,第3期

机译：语言相关和语言独立示例查询口语查询方法的比较
4. Multilingual query by example spoken term detection for under-resourced languages [C] . Buzo Andi, Cucu Horia, Safta Mihai, Conference on Speech Technology and Human - Computer Dialogue . 2013

机译：多语言查询通过示例口语术语检测，用于资源介绍语言
5. Adaptation and Augmentation: Towards Better Rescoring Strategies for Automatic Speech Recognition and Spoken Term Detection [D] . Ma, Min. 2018

机译：适应和增强：寻求更好的自动语音识别和语音术语检测的评分策略
6. Word Detection in Sung and Spoken Sentences in Children With TypicalLanguage Development or With Specific Language Impairment [O] . Clément Planchou, Sylvain Clément, Renée Béland, 2015

机译：典型儿童口语和句子中的单词检测语言发展或有特定语言障碍
7. Multilingual Bottleneck Features for Query by Example Spoken Term Detection [O] . Dhananjay Ram, Lesly Miculicich, Herve Bourlard 2019

机译：通过示例说出术语检测的查询的多语言瓶颈特征

Multilingual query by example spoken term detection for under-resourced languages

摘要

著录项

相似文献

相关主题

期刊订阅