首页> 外文会议>International Workshop on Content-Based Multimedia Indexing >TOWARDS THE DETECTION AND THE CHARACTERIZATION OF CONVERSATIONAL SPEECH ZONES IN AUDIOVISUAL DOCUMENTS
【24h】

TOWARDS THE DETECTION AND THE CHARACTERIZATION OF CONVERSATIONAL SPEECH ZONES IN AUDIOVISUAL DOCUMENTS

机译:朝着视听文献中的会话语音区的检测和表征

获取原文

摘要

Giving access to the semantically rich content of large amounts of digital audiovisual data using an automatic and generic method is still an important challenge. The aim of our work is to address this issue while focusing on temporal aspects. Our approach is based on a method previously developed for analyzing temporal relations from a data mining point of view. This method is used to detect zones of a document in which two characteristics are active. These characteristics can result from low-level segmentations of the audio or video components, or from more semantic processings. Once "activity zones" have been detected, we propose to compute a set of additional descriptors in order to better characterize them. The method is applied in the scope of the EPAC project that focuses on the detection and the characterization of conversational speech.
机译:使用自动和通用方法提供对大量数字视听数据的语义丰富的内容仍然是一个重要的挑战。我们的作品的目的是解决这个问题,同时关注时间方面。我们的方法基于先前开发的用于分析来自数据挖掘的时间关系的方法。该方法用于检测一个文档的区域,其中两个特征是活动的。这些特征可以由音频或视频组件的低级分割,或来自更多的语义处理来导致。一旦检测到“活动区”,我们建议计算一组附加描述符,以便更好地表征它们。该方法应用于EPAC项目的范围,专注于检测和对话语音的表征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号