...
首页> 外文期刊>Communications of the ACM >Complementary Video and Audio Analysis for Broadcast News Archives
【24h】

Complementary Video and Audio Analysis for Broadcast News Archives

机译:广播新闻档案的互补视频和音频分析

获取原文
获取原文并翻译 | 示例
           

摘要

The Informedia Digital Video Library project, initiated in 1994, uniquely utilizes integrated speech and image and natural language understanding to process broadcast video. The project's goal is to allow search and retrieval in the video medium, similar to what is available today for text only. To enable this access to video, fast, high-accuracy automatic transcriptions of broadcast news stories are generated through Carnegie Mellon's Sphinx speech recognition system and closed captions are incorporated where available. Image processing determines scene boundaries, recognizes faces, and allows for image similarity comparisons.
机译:Informedia数字视频图书馆项目始于1994年,它独特地利用集成的语音和图像以及自然语言理解来处理广播视频。该项目的目标是允许在视频媒体中进行搜索和检索,类似于今天仅适用于文本的内容。为了实现对视频的访问,通过卡内基·梅隆(Carnegie Mellon)的Sphinx语音识别系统生成了广播新闻故事的快速,高精度自动转录,并在可用的地方添加了隐藏字幕。图像处理确定场景边界,识别人脸并允许图像相似度比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号