首页> 外国专利> AUDIO-SPECTRAL-MASKING-DEEP-NEURAL-NETWORK CROWD SEARCH

AUDIO-SPECTRAL-MASKING-DEEP-NEURAL-NETWORK CROWD SEARCH

机译：音频光谱 - 深神经网络人群搜索

页面导航

摘要
著录项
相似文献

摘要

A system includes a memory having instructions therein and at least one processor in communication with the memory. The at least one processor is configured to execute the instructions to communicate, into a user device, a deep neural network comprising a predictive audio spectral mask. The at least one processor is also configured to execute the instructions to: generate data corresponding to ambient sound via a multi-microphone device; separate amplitude data and/or phase data from the data via the deep neural network comprising the predictive audio spectral mask; and determine, via the user device and based on the amplitude data and/or phase data, a location of origin of target speech relative to the user device. The at least one processor is configured to execute the instructions to display, via the user device, the location of origin of the target speech relative to the user device.

机译：系统包括具有其中的指令和至少一个与存储器通信的处理器的存储器。该至少一个处理器被配置为执行将传送到用户设备的指令，该指令是包括预测音频谱掩模的深神经网络。至少一个处理器还被配置为执行指令：通过多麦克风设备生成与环境声音相对应的数据; 通过包括预测音频光谱掩模的深神经网络分离来自数据的幅度数据和/或相位数据; 并通过用户设备确定并基于幅度数据和/或相位数据，相对于用户设备的目标语音原点的位置。该至少一个处理器被配置为经由用户设备相对于用户设备执行要通过用户设备的原点的原点的位置来执行指令。

著录项

公开/公告号US2021295828A1

专利类型
公开/公告日2021-09-23

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US202016823725
发明设计人 JONATHAN SAMN;POOJITHA BIKKI;JEB R. LINTON;MINSIK LEE;
展开▼

申请日2020-03-19
分类号G10L15/16;G10L15/30;G10L15/32;G06N3/04;
国家 US
入库时间 2022-08-24 21:12:21

相似文献

专利
外文文献
中文文献