首页> 外文会议>IEEE International Symposium on Multimedia >A New Multimedia Content Skimming Method based on Speech Emphasis Extraction and Its Application to Content Variations
【24h】

A New Multimedia Content Skimming Method based on Speech Emphasis Extraction and Its Application to Content Variations

机译:基于语音强调提取的新型多媒体内容撇杀方法及其在内容变化中的应用

获取原文

摘要

We propose Choco-Para, a multimedia content skimming technique; its application to a variety of content types is described. Based on automatic speech emphasis extraction, Choco-Para extracts speech attributes, prosodic parameters such as pitch, power, and speaking rate, and uses the data to estimate the degree of emphasis of each spoken phrase. By computing the degree of the emphasis curve, Choco-Para can generate a skimmed edition at an arbitrary skimming rate by selecting emphasized speech portions via dynamic threshold logic. Choco-Para uses three types of prosodic parameters and both short term and long term deviation. Experiments assess the contributions of each prosodic parameter and deviation type. They show that estimation accuracy is optimized by using both short and long term deviation with regard to pitch, power, and speaking rate. The results confirm that Choco-Para supports a wide variety of multimedia content.
机译:我们提出了Choco-Para,一种多媒体内容撇杀技术;描述了其在各种内容类型中的应用。基于自动语音重点提取,Choco-are-para提取语音属性,韵律参数,如音调,功率和说话率,并使用数据来估计每个口语短语的强调程度。通过计算重点曲线的程度,Choco-Para可以通过通过动态阈值逻辑选择强调的语音部分来以任意撇码率生成脱脂版本。 Choco-para使用三种类型的韵律参数和短期和长期偏差。实验评估每个韵律参数和偏差类型的贡献。它们表明,通过使用短期和长期偏差在音高,功率和说话速率方面使用短期和长期偏差来优化估计精度。结果证实,Choco-Para支持各种多媒体内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号