首页> 外文会议>Signal and Image Processing >AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH
【24h】

AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH

机译:基于子词的语音检索中的灾难性电视广播查询检索方法

获取原文
获取原文并翻译 | 示例

摘要

We propose a new type of a video retrieval system that identifies target video sections by a text or speech query. The system is applied to retrieve inquiries in a special TV broadcast program in a disaster, such as the Niigata Chuetsu Earthquake in Japan. The system uses a subword model such as phone or tri-phone models. Subword models do not impose vocabulary constraints to the system. This flexibility of query words is needed for retrieval systems because most keywords are basically proper nouns that correspond to the person a user wants to search for. The system based on speech recognition does not work well because the proper nouns cannot be prepared beforehand. The system utilizes phonetic similarities between subword models to improve the retrieval performance. The phonetic similarity used in the system is obtained by defining the statistical distance between any two subword models that are composed of HMMs. We conducted some experiments to show the effectiveness and possibility of our method, and the system works well for the retrieval of inquiries in real TV disaster broadcasting.
机译:我们提出了一种新型的视频检索系统,该系统可通过文本或语音查询来识别目标视频部分。该系统适用于在特殊情况下(例如日本的新泻县中越地震)通过特殊电视广播节目检索查询。系统使用子词模型,例如电话或三电话模型。子词模型不会对系统施加词汇限制。检索系统需要查询词的这种灵活性,因为大多数关键字基本上都是与用户想要搜索的人相对应的专有名词。基于语音识别的系统无法正常运行,因为无法事先准备专有名词。该系统利用子词模型之间的语音相似性来提高检索性能。通过定义由HMM组成的任何两个子词模型之间的统计距离,可以获得系统中使用的语音相似度。我们进行了一些实验以证明该方法的有效性和可能性,并且该系统可以很好地用于实际电视灾难广播中的查询检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号