首页> 外国专利> METHOD AND SYSTEM TO ESTIMATE SPEAKER CHARACTERISTICS ON-THE-FLY FOR UNKNOWN SPEAKER WITH HIGH ACCURACY AND LOW LATENCY

METHOD AND SYSTEM TO ESTIMATE SPEAKER CHARACTERISTICS ON-THE-FLY FOR UNKNOWN SPEAKER WITH HIGH ACCURACY AND LOW LATENCY

机译:以高精度和低延迟,估算扬声器特性的方法和系统,以获得高精度和低延迟

摘要

A computer-implemented technique is presented for profiling an unknown speaker. A DNN-based frame selection allows the system to select the relevant frames necessary to provide a reliable speaker characteristic estimation. A frame selection module selects those frames that contain relevant information for estimating a given speaker characteristic and thereby contributes to the accuracy and the low latency of the system. Real-time speaker characteristics estimation allows the system to estimate the speaker characteristics from a speech segment of accumulated selected frames at any given time. The frame level processing contributes to the low latency as it is not necessary to wait for the whole speech utterance to predict a speaker characteristic but rather a speaker characteristic is estimated from only a few reliable frames. Different stopping criteria also contribute to the accuracy and the low latency of the system.
机译:提出了一种用于分析未知扬声器的计算机实现的技术。 基于DNN的帧选择允许系统选择提供可靠的扬声器特征估计所需的相关帧。 帧选择模块选择包含相关信息的那些帧,用于估计给定的扬声器特性,从而有助于系统的准确性和低延迟。 实时扬声器特性估计允许系统在任何给定时间估计来自累积所选帧的语音段的扬声器特性。 帧级处理有助于低延迟,因为不需要等待整个语音话语来预测扬声器特征,而是从少数可靠的帧估计扬声器特性。 不同的停止标准也有助于系统的准确性和低延迟。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号