首页> 外文会议>Semantic Multimedia; Lecture Notes in Computer Science; 4306 >Automated Speech and Audio Analysis for Semantic Access to Multimedia
【24h】

Automated Speech and Audio Analysis for Semantic Access to Multimedia

机译:自动化语音和音频分析,可语义访问多媒体

获取原文
获取原文并翻译 | 示例

摘要

The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives.
机译:音频处理工具的部署和集成可以增强多媒体内容的语义注释,从而提高概念访问工具的有效性。本文概述了自动语音和音频分析有助于提高自动提取的元数据的粒度的各种方式。将介绍许多技术,包括语音和文本资源的对齐,大词汇量语音识别,关键词识别和说话者分类。将从媒体交叉的角度讨论技术的适用性。该技术的附加值及其对内容价值链的潜在贡献将通过两个(互补)演示者的描述来说明,这些演示者用于浏览广播新闻档案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号