首页> 外国专利> SPEECH CODING AND DECODING METHOD USING A NEURAL NETWORK MODEL THAT RECOGNIZES SOUND SOURCES AND AN ENCODING AND DECODING APPARATUS THAT PERFORMS THE SAME

SPEECH CODING AND DECODING METHOD USING A NEURAL NETWORK MODEL THAT RECOGNIZES SOUND SOURCES AND AN ENCODING AND DECODING APPARATUS THAT PERFORMS THE SAME

机译:语音编码和解码方法使用神经网络模型识别执行相同的声源和编码和解码设备

摘要

Disclosed are a method for encoding and decoding a voice signal using a neural network model for recognizing a sound source, and an encoding and decoding apparatus for performing the same. A method of encoding a voice signal according to an embodiment of the present invention includes: identifying input signals for a plurality of sound sources; encoding the input signal to generate a latent signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining the number of bits used for quantization of each of the plurality of sound source signals according to the type of the sound source; quantizing each of the plurality of sound source signals according to the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.
机译:公开了一种用于使用神经网络模型对语音信号进行编码和解码的方法,用于识别声源,以及用于执行该声源的编码和解码设备。 根据本发明的实施例的编码语音信号的方法包括:识别多个声源的输入信号; 编码输入信号以产生潜在信号; 通过分离多个声源中的每一个的潜信号来获得多个声源信号; 根据声源的类型确定用于量化多个声源信号中的每一个的比特数; 根据所确定的比特量来量化多个声源信号中的每一个; 通过组合多个量化的声源信号来生成比特流。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号