首页> 外文期刊>Computer speech and language >Using quality measures for multilevel speaker recognition
【24h】

Using quality measures for multilevel speaker recognition

机译:使用质量指标进行多级说话人识别

获取原文
获取原文并翻译 | 示例

摘要

The use of quality information for multilevel speaker recognition systems is addressed in this contribution. From a definition of what constitutes a quality measure, two applications are proposed at different phases of the recognition process: scoring and multilevel fusion stages. The traditional likelihood scoring stage is further developed providing guidelines for the practical application of the proposed ideas. Conventional user-independent multilevel support vector machine (SVM) score fusion is also adapted for the inclusion of quality information in the fusion process. In particular, quality measures meeting three different goodness criteria: SNR, F0 deviations and the ITU P.563 objective speech quality assessment are used in the speaker recognition process. Experiments carried out in the Switchboard-I database assess the benefits of the proposed quality-guided recognition approach for both the score computation and score fusion stages.
机译:此贡献解决了将质量信息用于多级说话人识别系统的问题。根据对质量度量的定义,在识别过程的不同阶段提出了两个应用程序:评分和多级融合阶段。进一步发展了传统的可能性评分阶段,为提出的想法的实际应用提供了指导。常规的用户独立的多级支持向量机(SVM)分数融合也适用于在融合过程中包含质量信息。特别是,在说话人识别过程中使用了满足三个不同善良标准的质量度量:SNR,F0偏差和ITU P.563客观语音质量评估。在Switchboard-I数据库中进行的实验评估了针对分数计算和分数融合阶段提出的质量指导识别方法的益处。

著录项

  • 来源
    《Computer speech and language》 |2006年第3期|p. 192-209|共18页
  • 作者单位

    ATVS (Speech and Signal Processing Group), Escuela Politecnica Superior, Universidad Autonoma de Madrid, Ctra. Colmenar km. 15 Campus de Cantoblanco, E-28049 Madrid, Spain;

    ATVS (Speech and Signal Processing Group), Escuela Politecnica Superior, Universidad Autonoma de Madrid, Ctra. Colmenar km. 15 Campus de Cantoblanco, E-28049 Madrid, Spain;

    ATVS (Speech and Signal Processing Group), Escuela Politecnica Superior, Universidad Autonoma de Madrid, Ctra. Colmenar km. 15 Campus de Cantoblanco, E-28049 Madrid, Spain;

    ATVS (Speech and Signal Processing Group), Escuela Politecnica Superior, Universidad Autonoma de Madrid, Ctra. Colmenar km. 15 Campus de Cantoblanco, E-28049 Madrid, Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号