首页> 外国专利> METHOD AND APPARATUS FOR AUDIO OBJECT CODING BASED ON INFORMED SOURCE SEPARATION

METHOD AND APPARATUS FOR AUDIO OBJECT CODING BASED ON INFORMED SOURCE SEPARATION

机译:基于信息源分离的音频对象编码方法和装置

摘要

To represent and recover the constituent sources present in an audio mixture, informed source separation techniques are used. In particular, a universal spectral model (USM) is used to obtain a sparse time activation matrix for an individual audio source in the audio mixture. The indices of non-zero groups in the time activation matrix are encoded as the side information into a bitstream. The non-zero coefficients of the time activation matrix may also be encoded into the bitstream. At the decoder side, when the coefficients of the time activation matrix are included in the bitstream, the matrix can be decoded from the bitstream. Otherwise, the time activation matrix can be estimated from the audio mixture, the non-zero indices included in the bitstream, and the USM model. Given the time activation matrix, the constituent audio sources can be recovered based on the audio mixture and the USM model.
机译:为了表示和恢复音频混合中存在的组成源,使用了知情的源分离技术。特别地,通用频谱模型(USM)用于获得音频混合中单个音频源的稀疏时间激活矩阵。时间激活矩阵中的非零组的索引被编码为边信息到比特流中。时间激活矩阵的非零系数也可以被编码到比特流中。在解码器侧,当时间激活矩阵的系数包括在比特流中时,可以从比特流对矩阵进行解码。否则,可以从音频混合,比特流中包含的非零索引和USM模型估计时间激活矩阵。给定时间激活矩阵,可以基于音频混合和USM模型恢复组成的音频源。

著录项

  • 公开/公告号BR112018011005A2

    专利类型

  • 公开/公告日2018-12-04

    原文格式PDF

  • 申请/专利权人 THOMSON LICENSING;

    申请/专利号BR20181111005

  • 发明设计人 ALEXEY OZEROV;QUANG KHANH NGOC DUONG;

    申请日2016-11-25

  • 分类号G10L19/008;G10L19/26;

  • 国家 BR

  • 入库时间 2022-08-21 12:03:36

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号