首页> 外文期刊>NTT Technical Review >Reducing the Cost of Metadata Generation by Using Video/ Audio Indexing aid Natural Language Processing Techniques
【24h】

Reducing the Cost of Metadata Generation by Using Video/ Audio Indexing aid Natural Language Processing Techniques

机译:通过使用视频/音频索引来降低元数据生成成本,有助于自然语言处理技术

获取原文
获取原文并翻译 | 示例
       

摘要

Reducing the cost of generating metadata will allow more broadcast contents to be transmitted with advanced viewing options. In this article, we describe SceneCabinet, a system that automatically extracts scene-based semantic metadata from video content. It extracts meaningful video slices and their associated textual information such as the title, synopsis, and keywords by using natural language processing based on the results of speech and on-screen text recognition, Moreover, it can import video program scripts and use them for automatic keyword extraction. SceneCabinet provides an intuitive user operation interface including a video browser with key images detected automatically based on scene changes, on-screen text, camerawork, speech, and music information. Experiments showed that SceneCabinet can significantly reduce metadata generation costs.
机译:降低生成元数据的成本将允许使用高级查看选项传输更多广播内容。在本文中,我们介绍了SceneCabinet,这是一个从视频内容中自动提取基于场景的语义元数据的系统。它基于语音和屏幕上的文本识别结果,通过使用自然语言处理来提取有意义的视频片段及其相关的文本信息,例如标题,简介和关键字,此外,还可以导入视频程序脚本并将其用于自动关键字提取。 SceneCabinet提供了一个直观的用户操作界面,包括一个视频浏览器,该视频浏览器具有根据场景变化,屏幕上的文本,摄影作品,语音和音乐信息自动检测到的关键图像。实验表明,SceneCabinet可以显着降低元数据生成成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号