A Soundex-Based Approach for Spoken Document Retrieval

机译：基于Soundex的语音文档检索方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current storage and processing facilities have caused the emergence of many multimedia repositories and, consequently, they have also triggered the necessity of new approaches for information retrieval. In particular, spoken document retrieval is a very complex task since existing speech recognition systems tend to generate several transcription errors (such as word substitutions, insertions and deletions). In order to deal with these errors, this paper proposes an enriched document representation based on a phonetic codification of the automatic transcriptions. This representation aims to reduce the impact of the transcription errors by representing words with similar pronunciations through the same phonetic code. Experimental results on the CL-SR corpus from the CLEF 2007 (which includes 33 test topics and 8,104 English interviews) are encouraging; our method achieved a mean average precision of 0.0795, outperforming all except one of the evaluated systems at this forum.

机译：当前的存储和处理设施已导致出现了许多多媒体存储库，因此，它们也触发了信息检索新方法的必要性。特别地，语音文档检索是一项非常复杂的任务，因为现有的语音识别系统往往会产生一些转录错误（例如单词替换，插入和删除）。为了解决这些错误，本文提出了一种基于自动转录的语音编码的丰富文档表示形式。这种表示的目的是通过通过相同的语音代码表示具有相似发音的单词，从而减少转录错误的影响。来自CLEF 2007的CL-SR语料库的实验结果（包括33个测试主题和8,104个英语访谈）令人鼓舞；我们的方法的平均平均精度为0.0795，优于本论坛上除评估系统之外的所有系统。

著录项

来源
《MICAI 2008: Advances in Artificial Intelligence》|2008年|204-211|共8页
会议地点 Atizapan de Zaragoza(MX);Atizapan de Zaragoza(MX)
作者
M. Alejandro Reyes-Barragan; Luis Villasenor-Pineda; Manuel Montes-y-Gomez;
展开▼
作者单位

Laboratorio de Tecnologias del Lenguaje, Instituto Nacional de Astrofisica, Optica y Electronica, Mexico;

Laboratorio de Tecnologias del Lenguaje, Instituto Nacional de Astrofisica, Optica y Electronica, Mexico;

Laboratorio de Tecnologias del Lenguaje, Instituto Nacional de Astrofisica, Optica y Electronica, Mexico;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
入库时间 2022-08-26 14:09:58

相似文献

外文文献
中文文献
专利

1. A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing [J] . Gupta Anishka, Yadav Divakar Multimedia Tools and Applications . 2021,第14期

机译：基于小波树索引的基于语境的自动口语文献检索的新方法
2. SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word [J] . Hansen J.H.L., Huang R., Zhou B., IEEE Transactions on Speech and Audio Proceessing . 2005,第5期

机译：SpeechFind：国家语言单词库的语音文档检索进展
3. Statistical language models for query-by-example spoken document retrieval [J] . Paula Lopez-Otero, Javier Parapar, Alvaro Barreiro Multimedia Tools and Applications . 2020,第11a12期

机译：逐个示例统计语言模型进行查询语音文档检索
4. A Soundex-Based Approach for Spoken Document Retrieval [C] . M. Alejandro Reyes-Barragan, Luis Villasenor-Pineda, Manuel Montes-y-Gomez Mexican International Conference on Artificial Intelligence . 2008

机译：基于Soundex的语言检索方法
5. Robust spoken document retrieval in multilingual and noisy acoustic environments. [D] . Akbacak, Murat. 2009

机译：在多语言和嘈杂的声学环境中进行可靠的语音文档检索。
6. The Role of Grammatical Category Information in Spoken Word Retrieval [O] . Carolina Palma Duràn, Agnesa Pillon 2011

机译：语法类别信息在口语检索中的作用
7. A Soundex-based Approach for Spoken Document Retrieval [O] . M. Alej, Ro Reyes-barragán, Luis Villaseñor-pineda, 2014

机译：基于soundex的语音文档检索方法

A Soundex-Based Approach for Spoken Document Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅