首页> 外国专利> SOUND PROCESSING METHOD, SOUND PROCESSING SYSTEM, VIDEO PROCESSING METHOD, VIDEO PROCESSING SYSTEM, SOUND PROCESSING DEVICE, AND METHOD AND PROGRAM FOR CONTROLLING SAME

SOUND PROCESSING METHOD, SOUND PROCESSING SYSTEM, VIDEO PROCESSING METHOD, VIDEO PROCESSING SYSTEM, SOUND PROCESSING DEVICE, AND METHOD AND PROGRAM FOR CONTROLLING SAME

机译:声音处理方法,声音处理系统,视频处理方法,视频处理系统,声音处理设备以及用于控制相同声音的方法和程序

摘要

To provide a device which accomplishes real-time sound identification and matching, by solving both of the problem of reducing time length of a frame and improving temporal accuracy and the problem of being robust against mixing with other sounds.;A sound processing device according to the present invention includes: a time-frequency analysis means which generates a time-frequency plane from a sound signal through time-frequency analysis; a region characteristic amount extraction means which, for a plurality of partial region pairs which is defined on the time-frequency plane and of which at least either of shapes of two partial regions or positions of the two partial regions differ from one another, extracts a region characteristic amount from each partial region; and a sound identifier generation means which generates a sound identifier which identifies the sound by using the region characteristic amount from the each partial region.
机译:提供一种通过解决减小帧的时间长度和提高时间精度的问题以及抵抗与其他声音混合的鲁棒性的问题这两者来实现实时声音识别和匹配的设备。本发明包括:时频分析装置,其通过时频分析从声音信号生成时频平面。区域特征量提取装置,对于在时频平面上定义的多个局部区域对,提取至少两个局部区域的形状或两个局部区域的位置中的任一个彼此不同的区域。每个局部区域的区域特征量;声音识别器生成装置,其通过使用来自每个局部区域的区域特征量来生成识别声音的声音识别器。

著录项

  • 公开/公告号US2014139739A1

    专利类型

  • 公开/公告日2014-05-22

    原文格式PDF

  • 申请/专利权人 NAOTAKE FUJITA;TOSHIYUKI NOMURA;

    申请/专利号US201214131580

  • 发明设计人 NAOTAKE FUJITA;TOSHIYUKI NOMURA;

    申请日2012-07-13

  • 分类号H04N5/04;G10L19/018;

  • 国家 US

  • 入库时间 2022-08-21 16:08:51

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号