首页> 外国专利> Method of speaker clustering for unknown speakers in conversational audio data

Method of speaker clustering for unknown speakers in conversational audio data

机译：对话音频数据中未知说话人的说话人聚类方法

页面导航

摘要
著录项
相似文献

摘要

A method for clustering speaker data from a plurality of unknown speakers. The method includes steps of providing a portion of audio data containing speech from at least all the speakers in the audio data and dividing the portion into data clusters. A pairwise distance between each pair of clusters is computed, the pairwise distance being based on a likelihood that two clusters were created by the same speaker, the likelihood measurement being biased by the prior probability of speaker changes. The two clusters with a minimum pairwise distance are combined into a new cluster and speakers models are trained for each of the remaining clusters including the new cluster. The likelihood that two clusters were created by the same speaker may be biased by a Markov duration model based on speaker changes over the length of the initial data clusters.

机译：一种用于对来自多个未知说话者的说话者数据进行聚类的方法。该方法包括以下步骤：从音频数据中提供至少一部分来自讲话者的包含语音的音频数据，并将该部分划分为数据簇。计算每对集群之间的成对距离，该成对距离基于同一说话者创建两个集群的似然度，该似然度测量因说话者变化的先验概率而有偏差。将成对距离最小的两个群集组合为一个新群集，并针对包括新群集在内的其余每个群集训练扬声器模型。基于发言者在初始数据簇的长度上的变化，可以由马尔可夫持续时间模型对由同一发言者创建两个簇的可能性进行偏倚。

著录项

公开/公告号US5598507A

专利类型
公开/公告日1997-01-28

原文格式PDF
申请/专利权人 XEROX CORPORATION;
展开▼

申请/专利号US19940226523
发明设计人 DONALD G. KIMBER;FRANCINE R. CHEN;LYNN D. WILCOX;
展开▼

申请日1994-04-12
分类号G10L5/06;
国家 US
入库时间 2022-08-22 03:10:39

相似文献

专利
外文文献
中文文献