首页> 外文期刊>Computer speech and language >Unsupervised classification of speaker roles in multi-participant conversational speech
【24h】

Unsupervised classification of speaker roles in multi-participant conversational speech

机译:多人对话语音中说话人角色的无监督分类

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an unsupervised method for analyzing speaker roles in multi-participant conversational speech. First, features for characterizing the differences of various roles are extracted from the outputs of speaker diarization. Then, an algorithm of role clustering based on the criterion of maximizing the inter-cluster distance without using any convergence threshold is proposed to obtain the number of roles and to merge the utterances belonging to the same role into one cluster. The contributions of different combinations of individual feature subsets are compared for the proposed method on the outputs from speaker diarization, and the combined feature subsets obtain higher F scores than the individual ones for clustering speaker roles. The impacts of both speaker diarization errors and feature dimensions on the performance of the proposed method are also discussed. Experiments are done on the outputs of both manual annotations and automatic speaker diarization to compare the proposed method with both the state-of-the-art clustering method and the supervised method. Evaluations show that the proposed method is superior to the previous clustering method and close to the conventional supervised method in terms of F scores under two different experimental conditions.
机译:本文提出了一种无监督的方法来分析多方对话语音中的说话者角色。首先,从说话者二值化的输出中提取表征各种角色差异的特征。然后,提出了一种基于最大化聚类间距离而不使用任何收敛阈值的准则的角色聚类算法,以获取角色数量并将属于同一角色的话语合并为一个聚类。针对说话人二分法的输出,比较了所提出方法的单个特征子集的不同组合的贡献,并且对于聚类的讲话者角色,组合的特征子集获得的F得分高于单个特征子集。还讨论了说话人区分误差和特征尺寸对所提方法性能的影响。对人工注释和自动说话人区分的输出进行了实验,以将所提出的方法与最新的聚类方法和监督方法进行比较。评估表明,在两种不同的实验条件下,所提出的方法在F评分方面优于先前的聚类方法,并且接近于传统的监督方法。

著录项

  • 来源
    《Computer speech and language》 |2017年第3期|81-99|共19页
  • 作者单位

    School of Electronic and Information Engineering, South China University of Technology, Room 223, Shaw Science Building, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

    School of Electronic and Information Engineering, South China University of Technology, 381 Wushan Road, Guangzhou, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Speaker role; Speaker diarization; Role clustering; Multi-participant conversational speech;

    机译:演讲者角色;说话人差异化;角色聚类;多人对话;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号