首页> 外文期刊>Multimedia Tools and Applications >Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
【24h】

Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet

机译:使用国际语音字母在多语言视听文档中基于内容的搜索

获取原文
获取原文并翻译 | 示例
       

摘要

We present in this paper an approach based on the use of the International Phonetic Alphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents. The approach works even if the languages of the document are unknown. It has been validated in the context of the "Star Challenge" search engine competition organized by the Agency for Science, Technology and Research (A~*STAR) of Singapore. Our approach includes the building of an IPA-based multilingual acoustic model and a dynamic programming based method for searching document segments by "IPA string spotting". Dynamic programming allows for retrieving the query string in the document string even with a significant transcription error rate at the phone level. The methods that we developed ranked us as first and third on the monolingual (English) search task, as fifth on the multilingual search task and as first on the multimodal (audio and image) search task.
机译:我们在本文中提出了一种基于国际语音字母(IPA)的基于内容的多语言视听文档索引和检索方法。即使文档的语言未知,该方法仍然有效。它已经在新加坡科学技术研究局(A〜* STAR)组织的“明星挑战”搜索引擎竞赛中得到了验证。我们的方法包括建立基于IPA的多语言声学模型和基于动态编程的方法,以通过“ IPA字符串发现”来搜索文档片段。动态编程允许检索文档字符串中的查询字符串,即使电话级别的转录错误率很高。我们开发的方法使我们在单语(英语)搜索任务上排名第一和第三,在多语种搜索任务上排名第五,在多模式(音频和图像)搜索任务上排名第一。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号