首页> 外国专利> METHOD AND APPARATUS FOR DISCOVERING AND LABELING SPEAKERS IN A LARGE AND GROWING COLLECTION OF VIDEOS WITH MINIMAL USER EFFORT

METHOD AND APPARATUS FOR DISCOVERING AND LABELING SPEAKERS IN A LARGE AND GROWING COLLECTION OF VIDEOS WITH MINIMAL USER EFFORT

机译：在用户的最少努力下发现和标记大型和不断增长的视频中的讲话者的方法和装置

页面导航

摘要
著录项
相似文献

摘要

In one embodiment, an audio stream is partitioned into a plurality of segments such that the plurality of segments are clustered into one or more clusters, each of the one or more clusters identifying a subset of the plurality of segments in the audio stream and corresponding to one of a first set of one or more speaker models, each speaker model in the first set of speaker models representing one of a first set of hypothetical speakers. The speaker models in the first set of speaker models are compared with a second set of one or more speaker models, where each speaker model in the second set of speaker models represents one of a second set of hypothetical speakers. Labels associated with one or more speaker models in the second set of speaker models are propagated to one or more speaker models in the first set of speaker models according to a result of the comparing step.

机译：在一个实施例中，音频流被划分为多个片段，使得多个片段被群集为一个或多个群集，一个或多个群集中的每一个标识音频流中多个片段的子集并且对应于一个或多个扬声器模型的第一组中的一个，第一组扬声器模型中的每个扬声器模型代表第一组假设扬声器中的一个。将第一组扬声器模型中的扬声器模型与第二组一个或多个扬声器模型进行比较，其中第二组扬声器模型中的每个扬声器模型代表第二组假设扬声器中的一个。根据比较步骤的结果，将与第二组扬声器模型中的一个或多个扬声器模型相关联的标签传播到第一组扬声器模型中的一个或多个扬声器模型。

著录项

公开/公告号US2013144414A1

专利类型
公开/公告日2013-06-06

原文格式PDF
申请/专利权人 SACHIN KAJAREKAR;ANANTH SANKAR;SATTISH GANNU;APARNA KHARE;
展开▼

申请/专利号US201113312800
发明设计人 SACHIN KAJAREKAR;APARNA KHARE;SATTISH GANNU;ANANTH SANKAR;
展开▼

申请日2011-12-06
分类号G06F17/00;
国家 US
入库时间 2022-08-21 16:47:06

相似文献

专利
外文文献
中文文献