首页> 外文会议>IEEE International Conference on Technology for Education >Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos
【24h】

Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos

机译:自动标记以启用细粒度的讲座视频浏览

获取原文
获取外文期刊封面目录资料

摘要

Many universities offer distance learning by recording classroom lectures and making them accessible to remote students over the Internet. A university's repository usually contains hundreds of such lecture videos. Each lecture video is typically an hour's duration and is often monolithic. It is cumbersome for students to search through an entire video, or across many videos, in order to find portions of their immediate interest. It is desirable to have a system that takes user-given keywords as a query and provides a link to not only the corresponding lecture videos but also to the section within the video. In order to do this, lecture videos are sometimes tagged with meta-data to enable easy identification of the different sections. However, such tagging is often done manually and is a time-consuming process. In this paper, we propose a technique to automatically generate tags for lecture videos. This is based on generating speech transcripts automatically using a speech recognition engine and automatic indexing and search of the transcripts. We also describe our system implemented for easily browsing through a lecture video repository. Our system takes keywords from users as a query and returns a list of videos as the results. In each video of the retrieved list, the portion of the video that matches the query is highlighted so that users can easily navigate to that location within the video. Following the approach and using open source tools mentioned in the paper, a lecture video repository can provide features for users to access the content required by them easily. We used open source libraries available for speech recognition and text search purposes. We have performed experiments to test the performance of our system, we have achieved a recall of 0.72 and an average precision of 0.84 as video retrieval results.
机译:许多大学通过记录课堂讲授并使远程学生可以通过Internet访问来提供远程学习。大学的资料库通常包含数百个此类演讲视频。每个讲座视频通常需要一个小时的时间,并且通常是单片的。对于学生而言,搜索整个视频或跨多个视频搜索以找到他们眼前的兴趣部分是很麻烦的。期望具有一种系统,该系统将用户提供的关键字作为查询,并且不仅提供到相应的演讲视频的链接,而且还提供到视频中的部分的链接。为此,有时会用元数据标记演讲视频,以便轻松识别不同部分。但是,这样的标记通常是手动完成的,并且是一个耗时的过程。在本文中,我们提出了一种自动为演讲视频生成标签的技术。这是基于使用语音识别引擎自动生成语音成绩单以及对成绩单进行自动索引和搜索的基础。我们还将介绍为轻松浏览演讲视频存储库而实现的系统。我们的系统将来自用户的关键字作为查询,并返回视频列表作为结果。在检索到的列表的每个视频中,突出显示与查询匹配的视频部分,以便用户可以轻松导航到视频中的该位置。按照本文中提到的方法并使用开放源代码工具,讲座视频资料库可以为用户提供功能,使他们可以轻松访问他们所需的内容。我们使用了可用于语音识别和文本搜索目的的开放源代码库。我们已经进行了实验来测试系统的性能,作为视频检索结果,我们实现了0.72的召回率和0.84的平均精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号