首页> 外文会议>International Conference on Emerging Smart Computing and Informatics >Searching a Video Database using Natural Language Queries
【24h】

Searching a Video Database using Natural Language Queries

机译:使用自然语言查询搜索视频数据库

获取原文

摘要

This paper describes an application that achieves voice based natural language query, search and extracted video segment playing after the search in order to query the content of the videos in a user-friendly manner. Two different models were explored for the same. The first model is implemented using an image captioning approach. Two different image captioning methods are used for creating tracklets, namely Densecap and NeuralTalk2. NeuralTalk2 generates a single appropriate caption for the entire image whereas Densecap generates multiple captions corresponding to specific regions of interest in the image. These captions are used to preprocess the video and create semantically similar tracklets. Given a video and a voice based natural language query, this system will produce video tracklets from the video that are semantically relevant to the query. The second model uses an audio processing approach. Here, first the transcripts generated by YouTube are collected. The voice query is taken as input and the most relevant segments of the video are retrieved using on-the-fly generation of tracks and merging if required. For finding Semantic similarity in both the models, first Universal Sentence Encoder (by Google) which uses a deep averaging network encoder (DAN) for converting the sentences into 512 dimensional vectors is used and then cosine similarity between the vectors is calculated.
机译:本文介绍了一个应用程序,实现基于语音的自然语言查询,搜索和提取的视频段播放,以便以用户友好的方式查询视频的内容。为此探索了两种不同的模型。使用图像标题方法来实现第一模型。两个不同的图像标题方法用于创建Tracklet,即Densecap和NeuralTalk2。 NeuralTalk2为整个图像生成一个合适的标题,而Decsecap生成对应于图像中的特定感兴趣区域的多个标题。这些标题用于预处理视频并创建语义相似的Tracklet。鉴于视频和基于语音的自然语言查询,该系统将从从语义上与查询相关的视频产生视频轨迹。第二种模型使用音频处理方法。在这里,首先收集由YouTube生成的成绩单。语音查询被视为输入,并且使用当机的曲目的轨道和使用时,检索视频的最相关的段。为了在模型中找到语义相似性,使用用于将用于将用于将用于将用于将句子转换为512个维度向量的网络编码器(DAN)的第一通用句子编码器(谷歌),然后计算向量之间的余弦相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号