首页> 外国专利> Deep learning driven multi-channel filtering for speech enhancement

Deep learning driven multi-channel filtering for speech enhancement

机译：深度学习驱动的多通道过滤以增强语音

页面导航

摘要
著录项
相似文献

摘要

A number of features are extracted from a current frame of a multi-channel speech pickup and from side information that is a linear echo estimate, a diffuse signal component, or a noise estimate of the multi-channel speech pickup. A DNN-based speech presence probability is produced for the current frame, where the SPP value is produced in response to the extracted features being input to the DNN. The DNN-based SPP value is applied to configure a multi-channel filter whose input is the multi-channel speech pickup and whose output is a single audio signal. In one aspect, the system is designed to run online, at low enough latency for real time applications such voice trigger detection. Other aspects are also described and claimed.

机译：从多通道语音拾取器的当前帧以及从辅助信息中提取多个特征，该辅助信息是线性回声估计，扩散信号分量或多通道语音拾取器的噪声估计。针对当前帧产生基于DNN的语音存在概率，其中响应于将提取的特征输入到DNN而产生SPP值。基于DNN的SPP值用于配置多通道滤波器，其输入是多通道语音拾取，其输出是单个音频信号。一方面，该系统被设计为以足够低的等待时间在线运行，以用于诸如语音触发检测的实时应用。其他方面也被描述和要求保护。

著录项

公开/公告号US10546593B2

专利类型
公开/公告日2020-01-28

原文格式PDF
申请/专利权人 APPLE INC.;
展开▼

申请/专利号US201715830955
发明设计人 JASON WUNG;MEHREZ SOUDEN;RAMIN PISHEHVAR;JOSHUA D. ATKINS;
展开▼

申请日2017-12-04
分类号G10L21;G10L19;G10L21/02;G10L15/02;G10L21/0232;G10L25/30;H04R1/40;G10L25/03;G10L21/0208;
国家 US
入库时间 2022-08-21 11:26:12

相似文献

专利
外文文献
中文文献