首页> 外国专利> METHOD AND APPARATUS FOR DISCOVERING AND LABELING SPEAKERS IN A LARGE AND GROWING COLLECTION OF VIDEOS WITH MINIMAL USER EFFORT

METHOD AND APPARATUS FOR DISCOVERING AND LABELING SPEAKERS IN A LARGE AND GROWING COLLECTION OF VIDEOS WITH MINIMAL USER EFFORT

机译:在用户的最少努力下发现和标记大型和不断增长的视频中的讲话者的方法和装置

摘要

In one embodiment, an audio stream is partitioned into a plurality of segments such that the plurality of segments are clustered into one or more clusters, each of the one or more clusters identifying a subset of the plurality of segments in the audio stream and corresponding to one of a first set of one or more speaker models, each speaker model in the first set of speaker models representing one of a first set of hypothetical speakers. The speaker models in the first set of speaker models are compared with a second set of one or more speaker models, where each speaker model in the second set of speaker models represents one of a second set of hypothetical speakers. Labels associated with one or more speaker models in the second set of speaker models are propagated to one or more speaker models in the first set of speaker models according to a result of the comparing step.
机译:在一个实施例中,音频流被划分为多个片段,使得多个片段被群集为一个或多个群集,一个或多个群集中的每一个标识音频流中多个片段的子集并且对应于一个或多个扬声器模型的第一组中的一个,第一组扬声器模型中的每个扬声器模型代表第一组假设扬声器中的一个。将第一组扬声器模型中的扬声器模型与第二组一个或多个扬声器模型进行比较,其中第二组扬声器模型中的每个扬声器模型代表第二组假设扬声器中的一个。根据比较步骤的结果,将与第二组扬声器模型中的一个或多个扬声器模型相关联的标签传播到第一组扬声器模型中的一个或多个扬声器模型。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号