首页> 外文学位 >Detection of usable speech using linear prediction coefficients.
【24h】

Detection of usable speech using linear prediction coefficients.

机译:使用线性预测系数检测可用语音。

获取原文
获取原文并翻译 | 示例

摘要

"Co-channel speech" refers to a speech input where a single speaker's speech is corrupted by an additional speaker's speech. If such a two-speaker speech can still be used as an input for a speaker identification system, it is termed as "usable speech". The purpose of a usable speech detection system is to be able to determine such usable segments of speech. It would act as a front-end to a speaker identification system thereby improving the performance and accuracy of such a system.; Portions of usable speech occur when high energy voiced speech from the target speaker overlaps with low-energy speech from an interfering speaker, or visa versa. The current research proposes a novel technique to detect usable speech using Linear Prediction Coefficients. The width of the first peak of the vocal tract frequency response is used as a measure to detect usable speech. The Target-to-Interferer Ratio (TIR) is used as a benchmark for comparing the results.
机译:“同频道语音”是指语音输入,其中单个讲话者的语音被另一讲话者的语音破坏。如果这样的两个发言者语音仍然可以用作发言者识别系统的输入,则称为“可用语音”。可用语音检测系统的目的是能够确定此类可用语音段。它可以作为说话人识别系统的前端,从而提高这种系统的性能和准确性。当来自目标说话者的高能语音与来自干扰说话者的低能语音重叠时,会发生部分可用语音,反之亦然。当前的研究提出了一种使用线性预测系数来检测可用语音的新技术。声道频率响应的第一个峰值的宽度用作检测可用语音的度量。目标干扰比(TIR)用作比较结果的基准。

著录项

  • 作者

    Vaidyanathan, Ramprakash.;

  • 作者单位

    Texas A&M University - Kingsville.;

  • 授予单位 Texas A&M University - Kingsville.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 M.S.
  • 年度 2004
  • 页码 88 p.
  • 总页数 88
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号