首页> 外国专利> Neural network classifier for seperating audio sources from a monophonic audio signal

Neural network classifier for seperating audio sources from a monophonic audio signal

机译：神经网络分类器，用于将音频源与单声道音频信号分离

页面导航

摘要
著录项
相似文献

摘要

A neural network classifier provides the ability to separate and categorize multiple arbitrary and previously unknown audio sources down-mixed to a single monophonic audio signal. This is accomplished by breaking the monophonic audio signal into baseline frames (possibly overlapping), windowing the frames, extracting a number of descriptive features in each frame, and employing a pre-trained nonlinear neural network as a classifier. Each neural network output manifests the presence of a pre-determined type of audio source in each baseline frame of the monophonic audio signal. The neural network classifier is well suited to address widely changing parameters of the signal and sources, time and frequency domain overlapping of the sources, and reverberation and occlusions in real-life signals. The classifier outputs can be used as a front-end to create multiple audio channels for a source separation algorithm (e.g., ICA) or as parameters in a post-processing algorithm (e.g. categorize music, track sources, generate audio indexes for the purposes of navigation, re-mixing, security and surveillance, telephone and wireless communications, and teleconferencing).

机译：神经网络分类器提供了将向下混合为单个单声道音频信号的多个任意和以前未知的音频源进行分离和分类的功能。这是通过将单声道音频信号分解为基准帧（可能重叠），在帧上加窗，在每个帧中提取许多描述性特征以及使用预训练的非线性神经网络作为分类器来实现的。每个神经网络输出表明在单声道音频信号的每个基线帧中存在预定类型的音频源。神经网络分类器非常适合解决信号和信号源变化很大的参数，信号源的时域和频域重叠以及现实信号中的混响和遮挡问题。分类器的输出可用作前端，以创建用于源分离算法（例如ICA）的多个音频通道，或用作后处理算法中的参数（例如，对音乐进行分类，跟踪源，出于以下目的生成音频索引）导航，重新混合，安全和监视，电话和无线通信以及电话会议）。

著录项

公开/公告号AU2006302549A1

专利类型
公开/公告日2007-04-19

原文格式PDF
申请/专利权人 DTS INC.;
展开▼

申请/专利号AU20060302549
发明设计人 DMITRI V. SHMUNK;
展开▼

申请日2006-10-03
分类号G10L15/16;
国家 AU
入库时间 2022-08-21 20:52:52

相似文献

专利
外文文献
中文文献