首页>
外国专利>
SPEECH CODING AND DECODING METHOD USING A NEURAL NETWORK MODEL THAT RECOGNIZES SOUND SOURCES AND AN ENCODING AND DECODING APPARATUS THAT PERFORMS THE SAME
SPEECH CODING AND DECODING METHOD USING A NEURAL NETWORK MODEL THAT RECOGNIZES SOUND SOURCES AND AN ENCODING AND DECODING APPARATUS THAT PERFORMS THE SAME
展开▼
机译:语音编码和解码方法使用神经网络模型识别执行相同的声源和编码和解码设备
展开▼
页面导航
摘要
著录项
相似文献
摘要
Disclosed are a method for encoding and decoding a voice signal using a neural network model for recognizing a sound source, and an encoding and decoding apparatus for performing the same. A method of encoding a voice signal according to an embodiment of the present invention includes: identifying input signals for a plurality of sound sources; encoding the input signal to generate a latent signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining the number of bits used for quantization of each of the plurality of sound source signals according to the type of the sound source; quantizing each of the plurality of sound source signals according to the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.
展开▼