首页> 外文会议>Signal and Image Processing >AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH

【24h】

AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH

机译：基于子词的语音检索中的灾难性电视广播查询检索方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new type of a video retrieval system that identifies target video sections by a text or speech query. The system is applied to retrieve inquiries in a special TV broadcast program in a disaster, such as the Niigata Chuetsu Earthquake in Japan. The system uses a subword model such as phone or tri-phone models. Subword models do not impose vocabulary constraints to the system. This flexibility of query words is needed for retrieval systems because most keywords are basically proper nouns that correspond to the person a user wants to search for. The system based on speech recognition does not work well because the proper nouns cannot be prepared beforehand. The system utilizes phonetic similarities between subword models to improve the retrieval performance. The phonetic similarity used in the system is obtained by defining the statistical distance between any two subword models that are composed of HMMs. We conducted some experiments to show the effectiveness and possibility of our method, and the system works well for the retrieval of inquiries in real TV disaster broadcasting.

机译：我们提出了一种新型的视频检索系统，该系统可通过文本或语音查询来识别目标视频部分。该系统适用于在特殊情况下（例如日本的新泻县中越地震）通过特殊电视广播节目检索查询。系统使用子词模型，例如电话或三电话模型。子词模型不会对系统施加词汇限制。检索系统需要查询词的这种灵活性，因为大多数关键字基本上都是与用户想要搜索的人相对应的专有名词。基于语音识别的系统无法正常运行，因为无法事先准备专有名词。该系统利用子词模型之间的语音相似性来提高检索性能。通过定义由HMM组成的任何两个子词模型之间的统计距离，可以获得系统中使用的语音相似度。我们进行了一些实验以证明该方法的有效性和可能性，并且该系统可以很好地用于实际电视灾难广播中的查询检索。

著录项

来源
《Signal and Image Processing》|2005年|P.34-39|共6页
会议地点 HonoluluHI(US)
作者
Kohei Iwata; Yoshiaki Itoh; Kazunori Kojima; Masaaki Ishigame; Kazuyo Tanaka; Shi-wook Lee;
展开▼
作者单位

Faculty of Software and Information Science, Iwate Prefectural University, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;图像信号处理;
关键词
3D object extraction; surface reconstruction; I3D; remote sensing; image visualization; computer vision;

机译：3D对象提取;表面重建; I3D;遥感;图像可视化;计算机视觉;

相似文献

外文文献
中文文献
专利

1. Experiments in syllable-based retrieval of broadcast news speech in Mandarin Chinese [J] . Hsin-min Wang 20f Speech Communication . 2000,第1a2期

机译：基于音节的普通话广播新闻语音检索实验
2. An efficient retrieval approach for encrypted speech based on biological hashing and spectral subtraction [J] . Qiu-yu Zhang, Gai-li Li, Yi-bo Huang Multimedia Tools and Applications . 2020,第39a40期

机译：基于生物散列的加密语音的有效检索方法和光谱减法
3. Spotting words in silent speech videos: a retrieval-based approach [J] . Jha Abhishek, Namboodiri Vinay P., Jawahar C. V. Machine Vision and Applications . 2019,第2期

机译：在无声语音视频中发现单词：基于检索的方法
4. AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH [C] . Kohei Iwata, Yoshiaki Itoh, Kazunori Kojima, IASTED International Conference on Signal and Image Processing . 2005

机译：语音通过语音检索次字的语音检索在灾难中检索电视广播的探讨方法
5. Women's speech as reflected in the television series, Friends. [D] . Del Moral, Gema. 2015

机译：电视连续剧《朋友》中反映的女性演讲。
6. The Presence of ‘Um’ as a Marker of Truthfulness in the Speech of TV Personalities [O] . Gina Villar, Paola Castillo 2017

机译：电视人物演说中 Um作为真实性标记的存在
7. AN INFORMATION GAIN AND GRAMMAR COMPLEXITY BASED APPROACH TO ATTRIBUTE SELECTION IN SPEECH ENABLED INFORMATION RETRIEVAL DIALOGS [O] . Haiping Li, Haixin Chai 2014

机译：基于信息增益和语法复杂度的语音选择方法在语音启用信息检索对话中的应用

AN APPROACH FOR RETRIEVING INQUIRIES IN TV BROADCASTS IN A DISASTER BY SUBWORD BASED SPEECH RETRIEVAL BY SPEECH

摘要

著录项

相似文献

相关主题

期刊订阅