首页> 外国专利> AUDIO-SPECTRAL-MASKING-DEEP-NEURAL-NETWORK CROWD SEARCH

AUDIO-SPECTRAL-MASKING-DEEP-NEURAL-NETWORK CROWD SEARCH

机译:音频光谱 - 深神经网络人群搜索

摘要

A system includes a memory having instructions therein and at least one processor in communication with the memory. The at least one processor is configured to execute the instructions to communicate, into a user device, a deep neural network comprising a predictive audio spectral mask. The at least one processor is also configured to execute the instructions to: generate data corresponding to ambient sound via a multi-microphone device; separate amplitude data and/or phase data from the data via the deep neural network comprising the predictive audio spectral mask; and determine, via the user device and based on the amplitude data and/or phase data, a location of origin of target speech relative to the user device. The at least one processor is configured to execute the instructions to display, via the user device, the location of origin of the target speech relative to the user device.
机译:系统包括具有其中的指令和至少一个与存储器通信的处理器的存储器。 该至少一个处理器被配置为执行将传送到用户设备的指令,该指令是包括预测音频谱掩模的深神经网络。 至少一个处理器还被配置为执行指令:通过多麦克风设备生成与环境声音相对应的数据; 通过包括预测音频光谱掩模的深神经网络分离来自数据的幅度数据和/或相位数据; 并通过用户设备确定并基于幅度数据和/或相位数据,相对于用户设备的目标语音原点的位置。 该至少一个处理器被配置为经由用户设备相对于用户设备执行要通过用户设备的原点的原点的位置来执行指令。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号