首页> 外国专利> Methods and apparatus for unknown speaker labeling using concurrent speech recognition, segmentation, classification and clustering

Methods and apparatus for unknown speaker labeling using concurrent speech recognition, segmentation, classification and clustering

机译：使用并发语音识别，分段，分类和聚类为未知说话人添加标签的方法和设备

页面导航

摘要
著录项
相似文献

摘要

A method and apparatus are disclosed for identifying speakers participating in an audio-video source, whether or not such speakers have been previously registered or enrolled. The speaker identification system uses an enrolled speaker database that includes background models for unenrolled speakers, such as “unenrolled male” or “unenrolled female,” to assign a speaker label to each identified segment. Speaker labels are identified for each speech segment by comparing the segment utterances to the enrolled speaker database and finding the “closest” speaker, if any. A speech segment having an unknown speaker is initially assigned a general speaker label from the set of background models. The “unenrolled” segment is assigned a segment number and receives a cluster identifier assigned by the clustering system. If a given segment is assigned a temporary speaker label associated with an unenrolled speaker, the user can be prompted by the present invention to identify the speaker. Once the user assigns a speaker label to an audio segment having an unknown speaker, the same speaker name can be automatically assigned to any segments that are assigned to the same cluster and the enrolled speaker database can be automatically updated to enroll the previously unknown speaker.

机译：公开了一种用于识别参与音频视频源的说话者的方法和装置，无论这种说话者先前是否已经被注册或登记。说话者识别系统使用包括包括未注册说话者（例如“未注册男性”）的背景模型的已注册说话者数据库。或“未注册女性”，为每个识别出的片段分配说话者标签。通过将片段话语与已注册的说话者数据库进行比较并找到“最接近的”，从而为每个语音片段识别出说话者标签。扬声器（如果有）。首先从背景模型集中为具有未知讲话者的语音片段分配一个通用讲话者标签。 “未注册”段被分配一个段号，并接收由集群系统分配的集群标识符。如果给定的片段被分配了与未注册的讲话者相关的临时讲话者标签，则本发明可以提示用户识别讲话者。一旦用户将扬声器标签分配给具有未知扬声器的音频片段，就可以将相同的扬声器名称自动分配给分配给同一群集的任何片段，并且可以自动更新已注册的扬声器数据库以注册先前未知的扬声器。

著录项

公开/公告号US6424946B1

专利类型
公开/公告日2002-07-23

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US19990434604
发明设计人 ALAIN CHARLES LOUIS TRITSCHLER;MAHESH VISWANATHAN;
展开▼

申请日1999-11-05
分类号G10L152/20;
国家 US
入库时间 2022-08-22 00:48:15

相似文献

专利
外文文献
中文文献